Загрузка страницы

Spark 3.0 Features | Adaptive Query Execution(AQE) | Part 1 - Optimizing SKEW Joins

Data Skewness is handled using Key Salting Technique in spark 2.x versions. In spark 3.0, there is a cool feature to do it automatically using Adaptive query Executions.

One of the biggest problem in parallel computational systems is data skewness. Data Skewness in Spark happens due to joining on a key that is not evenly distributed across the cluster, causing some partitions to be very large and not allowing Spark to process data in parallel.

This feature will address the above issue automatically by enabling the below configuration:
spark.conf.set(“spark.sql.adaptive.enabled”,”true”)
Medium Blog https://medium.com/@jeevan.madhur22/spark-3-0-features-demo-data-skewness-aqe-a5c237d3d5db

Handling the Data Skewness using Key Salting Technique for Spark 2.x versions:
https://www.youtube.com/watch?v=d41_X78ojCg

Content By - Jeevan Madhur [LinkedIn - https://www.linkedin.com/in/jeevan-madhur-225a3a86]
Editing By - Sivaraman Ravi [LinkedIn - https://www.linkedin.com/in/sivaraman-ravi-791838114/]
Facebook Page - https://www.facebook.com/Tech-Island-113793100393638/?modal=admin_todo_tour

Please SUBSCRIBE to our channel :)

Share your feedback with us.
techieeisland@gmail.com

Видео Spark 3.0 Features | Adaptive Query Execution(AQE) | Part 1 - Optimizing SKEW Joins канала Tech Island
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
19 июля 2020 г. 18:42:21
00:13:56
Яндекс.Метрика