From CDC to Apache Iceberg in Minutes 🚀 | OLake + Trino Demo ft. Lester Martin (Starburst)

Modern data pipelines break at scale not because of tools, but because of complexity, fragmentation, and performance bottlenecks.

In this session, we show how to go from raw database changes to fully queryable Apache Iceberg tables using OLake and how to efficiently query and optimize them using Trino.

Huge thanks to Lester Martin (Developer Advocate at Starburst) for sharing deep insights into Iceberg internals, query performance, and the small file problem.

What we’ll cover:
1. High-throughput ingestion with Full Load + CDC using OLake
2. Iceberg fundamentals: metadata, snapshots, and file structure
3. The small file problem and its impact on performance
4. Querying Iceberg with Trino
5. Compaction and optimization for faster queries

The session includes a live demo of end-to-end data replication and query optimization, with practical insights for building scalable, open lakehouse pipelines.

OLake website - https://olake.io
OLake GitHub - https://github.com/datazip-inc/olake
Join OLake Slack - https://olake.io/slack
Find us on LinkedIn - https://www.linkedin.com/company/datazipio

Видео From CDC to Apache Iceberg in Minutes 🚀 | OLake + Trino Demo ft. Lester Martin (Starburst) канала OLake

olake apache-iceberg lakehouse cdc debezium

Комментарии отсутствуют

Информация о видео

6 апреля 2026 г. 11:02:26

00:43:13

OLake

Теги

Правообладателям

Жалоба на материал Недопустимый материал Нарушение авторских прав

Комментарии

Другие видео канала

From CDC to Apache Iceberg in Minutes 🚀 | OLake + Trino Demo ft. Lester Martin (Starburst)

Inside the Minds of Two CTOs: The Future Is Apache Iceberg | Fireside Chat by OLake

OLake Community Meetup | 3rd Edition | 13.02.2025

OLake Launch Webinar: Fastest Apache Iceberg Native CDC

Women in Data Engineering

Apache Arrow Meets Apache Iceberg | High-Performance Ingestion with OLake

Apache Arrow + ADBC & Iceberg: From SDK Integration to Query Engine | Matt Topol & Shubham Baldava

Webinar 7: Demystifying Lakehouse Architecture with Akshat Mathur

OLake Community Call #9 - Kafka as a source, Ingestion Controls & Destination Refactorization

A journey into Data Lake: Introducing Apache Iceberg

OLake Community Meetup | 4th Edition | 28.02.2025

How OLake helped @PhysicsWallah solve issues with Debezium in their Data Pipeline

Iceberg Catalogs in 2025:Deep Dive into Polaris, LakeKeeper, Glue & Nessie| Greybeam’s Arsham Eslami

How to Maintain Apache Iceberg Tables — Monthly Webinar by OLake with Amit Gilad, CTO- Lakeops

Apache Iceberg: Building Tomorrow's Data Architecture Today with Sachin Tripathi

How to Debug OLake | Open Source Database to Apache Iceberg data replication tool

Webinar 6: Iceberg- Game Changing Capabilities and advantage of REST Catalog by Viktor Kessler

Sync MongoDB data to AWS S3 and LocalFile System in Parquet format

How to Configure the job using OLake-UI.

OLake 6th Community Meetup | 28.04.2025

OLake 5th Community Meetup | 27.03.2025

Sync MongoDB data to Apache Iceberg table format