Загрузка страницы

Mid-October DataEng Meetup

Hi All, to keep you going with your monthly fill of data engineering, we will be bringing you an online edition this month.

💬 Join our Slack Group here: https://goo.gl/forms/DVNazDmNBg1FFm2X2

Speakers:
🎤 Mary Law, Principal Specialist (Data & Analytics, APJC), AWS
Amazon Redshift powers the lake house architecture
The Pareto principles (the “vital few”) applies to analytics solutions such as Data warehouse which makes data lake an important component of an enterprise analytics solution. In this talk, we’ll see how Amazon Redshift Spectrum powers the lake house architecture by querying your data across Redshift, lake house, operational databases, open data formats and now open source Apache Hudi and Delta Lake without any need for ETL or loading data. Combine with Materialized view to significantly boost query performance for repeated and predictable analytical workloads while providing cost optimization & cost control.
Mary Law is a Principal Specialist, Big Data & Analytics at AWS. She enjoys working with customers in building, scaling and achieving operational excellence, optimize costs and scale workloads efficiently through workload reviews, best practices workshops and deep dives. Specifically focused on AWS Data Services such as Amazon Athena, Amazon Redshift, AWS Glue and QuickSight. Outside of work, she loves spending time in mastering her patisserie skills.

🎤 Mahdi Askari, Solutions Architect, Databricks
Making Data Lakes performant and reliable with DELTA
Apache Spark™ is the dominant processing framework for big data. Delta Lake adds reliability to Spark so your analytics and machine learning initiatives have ready access to quality, reliable data. This webinar covers the use of Delta Lake to enhance data reliability for Spark environments.

🎤 George Pongracz, Senior Data Engineer, SEEK
How SEEK “Lakehouses” in AWS
This talk takes a look at the Data Pipeline and Serverless Data Platform our team has built to load CRM data into Amazon Redshift with Delta Lake, Apache Spark and Apache Airflow.
George is a Senior Data Engineer in the Enterprise DataOps Team at SEEK, who enjoys building distributed systems that simplify working with data in AWS. Out of hours, he enjoys spending time together with his family and overclocking the GPU on his son’s PC to boost his framerate in Fortnite.

Here is George's medium post as well: https://medium.com/seek-blog/data-lakehousing-in-aws-7c76577ed88f

Видео Mid-October DataEng Meetup канала DataEngBytes
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
21 октября 2020 г. 20:04:26
01:52:48
Яндекс.Метрика