Все видео Новые видео Популярные видео Категории видео

Авто	Видео-блоги	ДТП, аварии	Для маленьких	Еда, напитки
Животные	Закон и право	Знаменитости	Игры	Искусство
Комедии	Красота, мода	Кулинария, рецепты	Люди	Мото
Музыка	Мультфильмы	Наука, технологии	Новости	Образование
Политика	Праздники	Приколы	Природа	Происшествия
Путешествия	Развлечения	Ржач	Семья	Сериалы
Спорт	Стиль жизни	ТВ передачи	Танцы	Технологии
Товары	Ужасы	Фильмы	Шоу-бизнес	Юмор

Lessons Learned From Running Apache Iceberg at Petabyte Scale // Subsurface 2020

Anton Okolnychyi, Apache Iceberg PMC Member and Apache Spark Contributor, presents "Lessons Learned From Running Apache Iceberg at Petabyte Scale" at Subsurface Summer 2020 -- the first-ever cloud data lake conference.

Apache Iceberg is an open table format that allows data engineers and data scientists to build efficient and reliable data lakes with features that are normally present only in data warehouses. Specifically, Iceberg enables ACID compliance on any object store or distributed system, boosts the performance of highly selective queries, provides reliable schema evolution, and offers time travel and rollback capabilities. Iceberg lets companies simplify their current architectures as well as unlock new use cases on top of data lakes.

This talk will describe how to maintain Iceberg tables in their optimal shapes while running at petabyte scale. In particular, the presentation will focus on how to efficiently perform metadata and data compaction on Iceberg tables with millions of files without any impact on concurrent readers and writers.

--------------------------------------------------------------------------------------------
Subsurface Is The Industry’s First Cloud Data Lake Conference
Presented by Dremio

Expand your technical knowledge and hear from your peers and industry experts about cloud data lake use cases and architectures at Subsurface™, where we explore what’s below the surface of the data lake. Hear firsthand from open source and technology leaders at companies about their experiences spearheading open source projects and building modern data lakes. Explore real-world use cases, from data warehousing and BI to data science and advanced analytics.

Connect with us!
Event Page https://bit.ly/33Ym5rh
Twitter https://bit.ly/2CqKhHt
Summer 2020 https://bit.ly/3iH160u
Dremio https://bit.ly/2XmtEnN

Видео Lessons Learned From Running Apache Iceberg at Petabyte Scale // Subsurface 2020 канала Dremio

Показать

Комментарии отсутствуют