Supported Iceberg Data Types¶
Bodo supports most data types defined in the Apache Iceberg specification. This following table shows how Iceberg data types are represented in Python and SQL.
| Iceberg Data Type | Equivalent Python / Pandas Array Type | Equivalent SQL Column Type |
|---|---|---|
| boolean | bool[pyarrow] |
BOOL |
| int | int32[pyarrow] |
INT |
| long | int64[pyarrow] |
BIGINT |
| float | float32[pyarrow] |
FLOAT |
| double | float64[pyarrow] |
DOUBLE |
| decimal(P, S) | decimal128(P, S)[pyarrow] |
DECIMAL(P, S) |
| date | date32[pyarrow] |
DATE |
| time | time32[pyarrow] |
TIME |
| timestamp | timestamp[us][pyarrow] |
TIMESTAMP |
| timestamptz | timestamp[us, tz=UTC][pyarrow] |
TIMESTAMPTZ |
| string | large_string[pyarrow] |
STRING |
| binary | binary[pyarrow] |
BINARY |
| struct<...> | struct<...>[pyarrow] |
STRUCT |
| list |
large_list<E>[pyarrow] |
LIST |
| map |
map<K, V>[pyarrow] |
MAP |
Bodo does not support these Iceberg data types yet:
- fixed
- uuid