Mid-October DataEng Meetup
Hi All, to keep you going with your monthly fill of data engineering, we will be bringing you an online edition this month.
💬 Join our Slack Group here: https://goo.gl/forms/DVNazDmNBg1FFm2X2
Speakers:
🎤 Mary Law, Principal Specialist (Data & Analytics, APJC), AWS
Amazon Redshift powers the lake house architecture
The Pareto principles (the “vital few”) applies to analytics solutions such as Data warehouse which makes data lake an important component of an enterprise analytics solution. In this talk, we’ll see how Amazon Redshift Spectrum powers the lake house architecture by querying your data across Redshift, lake house, operational databases, open data formats and now open source Apache Hudi and Delta Lake without any need for ETL or loading data. Combine with Materialized view to significantly boost query performance for repeated and predictable analytical workloads while providing cost optimization & cost control.
Mary Law is a Principal Specialist, Big Data & Analytics at AWS. She enjoys working with customers in building, scaling and achieving operational excellence, optimize costs and scale workloads efficiently through workload reviews, best practices workshops and deep dives. Specifically focused on AWS Data Services such as Amazon Athena, Amazon Redshift, AWS Glue and QuickSight. Outside of work, she loves spending time in mastering her patisserie skills.
🎤 Mahdi Askari, Solutions Architect, Databricks
Making Data Lakes performant and reliable with DELTA
Apache Spark™ is the dominant processing framework for big data. Delta Lake adds reliability to Spark so your analytics and machine learning initiatives have ready access to quality, reliable data. This webinar covers the use of Delta Lake to enhance data reliability for Spark environments.
🎤 George Pongracz, Senior Data Engineer, SEEK
How SEEK “Lakehouses” in AWS
This talk takes a look at the Data Pipeline and Serverless Data Platform our team has built to load CRM data into Amazon Redshift with Delta Lake, Apache Spark and Apache Airflow.
George is a Senior Data Engineer in the Enterprise DataOps Team at SEEK, who enjoys building distributed systems that simplify working with data in AWS. Out of hours, he enjoys spending time together with his family and overclocking the GPU on his son’s PC to boost his framerate in Fortnite.
Here is George's medium post as well: https://medium.com/seek-blog/data-lakehousing-in-aws-7c76577ed88f
Видео Mid-October DataEng Meetup канала DataEngBytes
💬 Join our Slack Group here: https://goo.gl/forms/DVNazDmNBg1FFm2X2
Speakers:
🎤 Mary Law, Principal Specialist (Data & Analytics, APJC), AWS
Amazon Redshift powers the lake house architecture
The Pareto principles (the “vital few”) applies to analytics solutions such as Data warehouse which makes data lake an important component of an enterprise analytics solution. In this talk, we’ll see how Amazon Redshift Spectrum powers the lake house architecture by querying your data across Redshift, lake house, operational databases, open data formats and now open source Apache Hudi and Delta Lake without any need for ETL or loading data. Combine with Materialized view to significantly boost query performance for repeated and predictable analytical workloads while providing cost optimization & cost control.
Mary Law is a Principal Specialist, Big Data & Analytics at AWS. She enjoys working with customers in building, scaling and achieving operational excellence, optimize costs and scale workloads efficiently through workload reviews, best practices workshops and deep dives. Specifically focused on AWS Data Services such as Amazon Athena, Amazon Redshift, AWS Glue and QuickSight. Outside of work, she loves spending time in mastering her patisserie skills.
🎤 Mahdi Askari, Solutions Architect, Databricks
Making Data Lakes performant and reliable with DELTA
Apache Spark™ is the dominant processing framework for big data. Delta Lake adds reliability to Spark so your analytics and machine learning initiatives have ready access to quality, reliable data. This webinar covers the use of Delta Lake to enhance data reliability for Spark environments.
🎤 George Pongracz, Senior Data Engineer, SEEK
How SEEK “Lakehouses” in AWS
This talk takes a look at the Data Pipeline and Serverless Data Platform our team has built to load CRM data into Amazon Redshift with Delta Lake, Apache Spark and Apache Airflow.
George is a Senior Data Engineer in the Enterprise DataOps Team at SEEK, who enjoys building distributed systems that simplify working with data in AWS. Out of hours, he enjoys spending time together with his family and overclocking the GPU on his son’s PC to boost his framerate in Fortnite.
Here is George's medium post as well: https://medium.com/seek-blog/data-lakehousing-in-aws-7c76577ed88f
Видео Mid-October DataEng Meetup канала DataEngBytes
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![DataEngBytes 2023 - SYD-T3-02 - Benjamin Boyter](https://i.ytimg.com/vi/ovXPxdJJSU8/default.jpg)
![DataEngBytes 2023 - SYD-T1-07 - Lizzie Macneill](https://i.ytimg.com/vi/krbS2MlCJy8/default.jpg)
![DataEngBytes 2023 - SYD-T1-03 - Chad Sanderson](https://i.ytimg.com/vi/M4RwU8w1ng8/default.jpg)
![DataEngBytes 2023 - SYD-T2-03 - Akanksha Malik](https://i.ytimg.com/vi/YaLAkK3cyXc/default.jpg)
![DataEngBytes 2023 - SYD-T1-11 - Joe Reis](https://i.ytimg.com/vi/Tkr3JUewKkc/default.jpg)
![DataEngBytes 2023 - SYD-T3-06 - James Weakley](https://i.ytimg.com/vi/VIryRDUxR9E/default.jpg)
![DataEngBytes 2023 - SYD-T3-04 - Abhinav Goyal](https://i.ytimg.com/vi/iWGfyfQ-Rxg/default.jpg)
![DataEngBytes 2023 - SYD-T2-05 - Arezou Soltani](https://i.ytimg.com/vi/7iOuwfS2qw8/default.jpg)
![DataEngBytes 2023 - SYD-T2-06 - Panel Discussion](https://i.ytimg.com/vi/OkEGxZ_vyg4/default.jpg)
![DataEngBytes 2023 - MEL-T1-03 - Adric Streatfeild](https://i.ytimg.com/vi/0rXYH_oILS4/default.jpg)
![DataEngBytes 2023 - MEL-T3-06 - David Colls](https://i.ytimg.com/vi/2fp_f4R4OVc/default.jpg)
![DataEngBytes 2023 - SYD-T1-06 - Sana Sanai](https://i.ytimg.com/vi/nsDir4Yhkgw/default.jpg)
![DataEngBytes 2023 - MEL-T1-08 - Troy Sellers](https://i.ytimg.com/vi/6Z21kLcrwUM/default.jpg)
![DataEngBytes 2023 - MEL-T1-04 - Michael Hyatt](https://i.ytimg.com/vi/dmlE60s8l9k/default.jpg)
![DataEngBytes 2023 - MEL-T1-05 - Izzudin Hafiz](https://i.ytimg.com/vi/1UUnzHszblI/default.jpg)
![DataEngBytes 2023 - SYD-T3-01 - Scott Eade](https://i.ytimg.com/vi/loy6po3wfAY/default.jpg)
![DataEngBytes 2023 - MEL-T3-04 - Sarah Young](https://i.ytimg.com/vi/36InhW3no3A/default.jpg)
![DataEngBytes 2023 - SYD-T1-05 - Patrick Cuba](https://i.ytimg.com/vi/zk3aqhRMyD8/default.jpg)
![DataEngBytes 2023 - MEL-T1-06 - Lizzie Macneill](https://i.ytimg.com/vi/tlEDWOof04o/default.jpg)
![DataEngBytes 2023 - MEL-T2-01 - Mikiko Bazeley](https://i.ytimg.com/vi/NqZ2fnczASI/default.jpg)
![DataEngBytes Conference 2023 Melbourne Akanksha Malik v1](https://i.ytimg.com/vi/6ZzBSviMuow/default.jpg)