Parquet Format
The Apache Parquet format allows to read and write Parquet data.
Here is an example to create a table using Filesystem connector and Parquet format.
item_id BIGINT,
category_id BIGINT,
ts TIMESTAMP(3),
dt STRING
'path' = '/tmp/user_behavior',
'format' = 'parquet'
)
Currently, Parquet format type mapping is compatible with Apache Hive, but different with Apache Spark:
- Timestamp: mapping timestamp type to int96 whatever the precision is.