Data Types and Schemas¶
Factory Functions¶
These should be used to create Arrow data types and schemas.
null () |
Create instance of null type |
bool_ () |
Create instance of boolean type |
int8 () |
Create instance of signed int8 type |
int16 () |
Create instance of signed int16 type |
int32 () |
Create instance of signed int32 type |
int64 () |
Create instance of signed int64 type |
uint8 () |
Create instance of unsigned int8 type |
uint16 () |
Create instance of unsigned uint16 type |
uint32 () |
Create instance of unsigned uint32 type |
uint64 () |
Create instance of unsigned uint64 type |
float16 () |
Create half-precision floating point type |
float32 () |
Create single-precision floating point type |
float64 () |
Create double-precision floating point type |
time32 (unit) |
Create instance of 32-bit time (time of day) type with unit resolution |
time64 (unit) |
Create instance of 64-bit time (time of day) type with unit resolution |
timestamp (unit[, tz]) |
Create instance of timestamp type with resolution and optional time zone |
date32 () |
Create instance of 32-bit date (days since UNIX epoch 1970-01-01) |
date64 () |
Create instance of 64-bit date (milliseconds since UNIX epoch 1970-01-01) |
binary (int length=-1) |
Create variable-length binary type |
string () |
Create UTF8 variable-length string type |
utf8 () |
Alias for string() |
decimal128 (int precision, int scale=0) |
Create decimal type with precision and scale and 128bit width |
list_ (value_type) |
Create ListType instance from child data type or field |
struct (fields) |
Create StructType instance from fields |
dictionary (DataType index_type, …) |
Dictionary (categorical, or simply encoded) type |
field (name, type, bool nullable=True[, metadata]) |
Create a pyarrow.Field instance |
schema (fields[, metadata]) |
Construct pyarrow.Schema from collection of fields |
from_numpy_dtype (dtype) |
Convert NumPy dtype to pyarrow.DataType |
Type Classes¶
Do not instantiate these classes directly. Instead, call one of the factory functions above.
DataType () |
Base class of all Arrow data types. |
DictionaryType |
Concrete class for dictionary data types. |
ListType |
Concrete class for list data types. |
StructType |
Concrete class for struct data types. |
UnionType |
Concrete class for struct data types. |
TimestampType |
Concrete class for timestamp data types. |
Time32Type |
Concrete class for time32 data types. |
Time64Type |
Concrete class for time64 data types. |
FixedSizeBinaryType |
Concrete class for fixed-size binary data types. |
Decimal128Type |
Concrete class for decimal128 data types. |
Field () |
A named field, with a data type, nullability, and optional metadata. |
Schema () |
Type Checking¶
These functions are predicates to check whether a DataType
instance
represents a given data type (such as int32
) or general category
(such as “is a signed integer”).
is_boolean (t) |
Return True if value is an instance of a boolean type |
is_integer (t) |
Return True if value is an instance of any integer type |
is_signed_integer (t) |
Return True if value is an instance of any signed integer type |
is_unsigned_integer (t) |
Return True if value is an instance of any unsigned integer type |
is_int8 (t) |
Return True if value is an instance of an int8 type |
is_int16 (t) |
Return True if value is an instance of an int16 type |
is_int32 (t) |
Return True if value is an instance of an int32 type |
is_int64 (t) |
Return True if value is an instance of an int64 type |
is_uint8 (t) |
Return True if value is an instance of an uint8 type |
is_uint16 (t) |
Return True if value is an instance of an uint16 type |
is_uint32 (t) |
Return True if value is an instance of an uint32 type |
is_uint64 (t) |
Return True if value is an instance of an uint64 type |
is_floating (t) |
Return True if value is an instance of a floating point numeric type |
is_float16 (t) |
Return True if value is an instance of an float16 (half-precision) type |
is_float32 (t) |
Return True if value is an instance of an float32 (single precision) type |
is_float64 (t) |
Return True if value is an instance of an float64 (double precision) type |
is_decimal (t) |
Return True if value is an instance of a decimal type |
is_list (t) |
Return True if value is an instance of a list type |
is_struct (t) |
Return True if value is an instance of a struct type |
is_union (t) |
Return True if value is an instance of a union type |
is_nested (t) |
Return True if value is an instance of a nested type |
is_temporal (t) |
Return True if value is an instance of a temporal (date, time, timestamp) type |
is_timestamp (t) |
Return True if value is an instance of a timestamp type |
is_date (t) |
Return True if value is an instance of a date type |
is_date32 (t) |
Return True if value is an instance of a date32 (days) type |
is_date64 (t) |
Return True if value is an instance of a date64 (milliseconds) type |
is_time (t) |
Return True if value is an instance of a time type |
is_time32 (t) |
Return True if value is an instance of a time32 type |
is_time64 (t) |
Return True if value is an instance of a time64 type |
is_null (t) |
Return True if value is an instance of a null type |
is_binary (t) |
Return True if value is an instance of a variable-length binary type |
is_unicode (t) |
Alias for is_string |
is_string (t) |
Return True if value is an instance of string (utf8 unicode) type |
is_fixed_size_binary (t) |
Return True if value is an instance of a fixed size binary type |
is_map (t) |
Return True if value is an instance of a map logical type |
is_dictionary (t) |
Return True if value is an instance of a dictionary-encoded type |