PyIceberg 0.2.1: Iceberg ❤️ PyArrow & DuckDB

January 23, 2023

This video demonstrates the new features in PyIceberg 0.2.1.

For the demo, we use the docker-spark-iceberg setup that’s available here: https://github.com/tabular-io/docker-…

After spinning up the docker-compose setup, the Jupyter notebook will be available at http://localhost:8888/

The notebook PyIceberg – Getting Started.ipynb will guide you through how to read data into PyArrow, and then Pandas.

Lastly, we demonstrate how to query the Pandas dataset using DuckDB.

PyIceberg 0.2.1: Iceberg ❤️ PyArrow & DuckDB

Related Videos

Demonstrating PyIceberg

[Webinar] Apache Hive to Apache Iceberg Migration

Iceberg: Copy on Write vs Merge on Read