New Developments in the Open Source Ecosystem: Apache Spark 3 0, Delta Lake, and Koalas
In this talk, we will highlight major efforts happening in the Spark ecosystem. In particular, we will dive into the details of adaptive and static query optimizations in Spark 3.0 to make Spark easier to use and faster to run. We will also demonstrate how new features in Koalas, an open source library that provides Pandas-like API on top of Spark, helps data scientists gain insights from their data quicker.
About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: https://databricks.com/product/unified-data-analytics-platform
Connect with us:
Website: https://databricks.com
Facebook: https://www.facebook.com/databricksinc
Twitter: https://twitter.com/databricks
LinkedIn: https://www.linkedin.com/company/databricks
Instagram: https://www.instagram.com/databricksinc/
Видео New Developments in the Open Source Ecosystem: Apache Spark 3 0, Delta Lake, and Koalas канала Databricks
About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: https://databricks.com/product/unified-data-analytics-platform
Connect with us:
Website: https://databricks.com
Facebook: https://www.facebook.com/databricksinc
Twitter: https://twitter.com/databricks
LinkedIn: https://www.linkedin.com/company/databricks
Instagram: https://www.instagram.com/databricksinc/
Видео New Developments in the Open Source Ecosystem: Apache Spark 3 0, Delta Lake, and Koalas канала Databricks
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Spark + Parquet In Depth: Spark Summit East talk by: Emily Curtin and Robbie StricklandDelta Lake for apache Spark | How does it work | How to use delta lake | Delta Lake for Spark ACIDSimplifying Model Management with MLflow - Matei Zaharia (Databricks) Corey Zumar (Databricks)Introduction to Stateful Stream Processing with Apache Flink • Robert Metzger • GOTO 2019Predicting the Future of the Web Development (2020 and 2025)Top 20 Projects Completing in 2020Apache Spark Core—Deep Dive—Proper Optimization Daniel Tomes DatabricksAnnouncing Delta Lake Open Source Project | Ali Ghodsi (Databricks), Michael Armbrust (Databricks)Apache Spark 3 | Design | Architecture | New Features | Interview QuestionArtificial intelligence and algorithms: pros and cons | DW Documentary (AI documentary)Intro to Apache Spark for Java and Scala Developers - Ted Malaska (Cloudera)Azure Databricks - What's new! | DB102Delta Lake for Apache Spark - Why do we need Delta Lake for Spark?A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets - Jules DamjiGet Rid of Traditional ETL, Move to Spark! (Bas Geerdink)Physical Plans in Spark SQL - David Vrba (Socialbakers)The Fascinating Truth About Gravity | Jim Al-Khalili: Gravity and Me | SparkTop 5 Mistakes When Writing Spark ApplicationsTop 20 Apache Spark Interview Questions and Answers | Hadoop Interview Questions and Answers