SQL Analytics and the Lakehouse Architecture | Ali Ghodsi | Keynote Data + AI Summit EU 2020
The Rise of the Lakehouse:
Data warehouses have a long history in decision support and business intelligence applications. But, data warehouses were not well suited to dealing with the unstructured, semi-structured, and streaming data common in modern enterprises. This led to organizations building data lakes of raw data about a decade ago. But, they also lacked important capabilities. The need for a better solution has given rise to lakehouse architecture, which implements similar data structures and data management features to those in a data warehouse, directly on the kind of low cost storage used for data lakes.
This keynote by Databricks CEO, Ali Ghodsi, explains how the open source Delta Lake project allows the industry to realize the full potential of lakehouse architecture. Additionally, Ali will discuss the newly announced SQL Analytics service that allows users to run traditional analytics on their data lake, instead of moving data out to data warehouses, without sacrificing performance, security, or quality. This service completes the vision of lakehouse architecture to allow the data lake to be a single source of truth of all data workloads.
Speaker: Ali Ghodsi, Co-founder & CEO, Original Creator of Apache Spark, Databricks
Discussion with Tableau Software
Francois Ajenstat
Chief Product Officer, Tableau Software
Demo: SQL Analytics and the Lakehouse Architecture
Brooke Wenig,
Machine Learning Practice Lead, Databricks
How SQL Analytics Makes Lakehouse Fast
Reynold Xin
Co-founder & Chief Architect, Databricks
In this keynote, Reynold Xin, Co-founder and Chief Architect at Databricks, will explore how SQL Analytics brings a new level of performance to data lakes for analytics workloads. Traditionally, data lakes have struggled with analytics, because they struggle to deliver the fast query performance wiht low latency at high user concurrency. Reynold will provide a techical deep dive of how Databricks has addresssed these challenges. First, Delta Engine, Databricks’ polymorphic vectorized execution engine, delivers extremely fast single query throughput. Second, the new auto-scaling SQL-optimized clusters in SQL Analytics make it easy to match compute capacity to user load. And third, optimizations in the new SQL Analytics Endpoints reduce the time required to get query results by up to 6x. Altogether, SQL Analytics is able to provide users with data warehousing performance at data lake economics for their analytics workloads.
Discussion with Peter Boncz
Professor, CWI & Vrije Universiteit Amsterdam
Discussion with Unilever
Phinean Woodward
Head of Architecture, Information and Analytics, Unilever
In this talk, we’ll discuss how the Lakehouse architecture has become a critical part of Unilever’s information management infrastructure to limit traditional enterprise data silos, and enable agile access to data both up and downstream that’s needed for faster decision making. As a result, IT is helping Unilever to deliver higher quality predictions in many areas of the business, thereby building trust in AI throughout the company.
About:
Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: https://databricks.com/product/unifie...
Connect with us:
Website: https://databricks.com
Facebook: https://www.facebook.com/databricksinc
Twitter: https://twitter.com/databricks
LinkedIn: https://www.linkedin.com/company/data...
Instagram: https://www.instagram.com/databricksinc/
Видео SQL Analytics and the Lakehouse Architecture | Ali Ghodsi | Keynote Data + AI Summit EU 2020 канала Databricks
Data warehouses have a long history in decision support and business intelligence applications. But, data warehouses were not well suited to dealing with the unstructured, semi-structured, and streaming data common in modern enterprises. This led to organizations building data lakes of raw data about a decade ago. But, they also lacked important capabilities. The need for a better solution has given rise to lakehouse architecture, which implements similar data structures and data management features to those in a data warehouse, directly on the kind of low cost storage used for data lakes.
This keynote by Databricks CEO, Ali Ghodsi, explains how the open source Delta Lake project allows the industry to realize the full potential of lakehouse architecture. Additionally, Ali will discuss the newly announced SQL Analytics service that allows users to run traditional analytics on their data lake, instead of moving data out to data warehouses, without sacrificing performance, security, or quality. This service completes the vision of lakehouse architecture to allow the data lake to be a single source of truth of all data workloads.
Speaker: Ali Ghodsi, Co-founder & CEO, Original Creator of Apache Spark, Databricks
Discussion with Tableau Software
Francois Ajenstat
Chief Product Officer, Tableau Software
Demo: SQL Analytics and the Lakehouse Architecture
Brooke Wenig,
Machine Learning Practice Lead, Databricks
How SQL Analytics Makes Lakehouse Fast
Reynold Xin
Co-founder & Chief Architect, Databricks
In this keynote, Reynold Xin, Co-founder and Chief Architect at Databricks, will explore how SQL Analytics brings a new level of performance to data lakes for analytics workloads. Traditionally, data lakes have struggled with analytics, because they struggle to deliver the fast query performance wiht low latency at high user concurrency. Reynold will provide a techical deep dive of how Databricks has addresssed these challenges. First, Delta Engine, Databricks’ polymorphic vectorized execution engine, delivers extremely fast single query throughput. Second, the new auto-scaling SQL-optimized clusters in SQL Analytics make it easy to match compute capacity to user load. And third, optimizations in the new SQL Analytics Endpoints reduce the time required to get query results by up to 6x. Altogether, SQL Analytics is able to provide users with data warehousing performance at data lake economics for their analytics workloads.
Discussion with Peter Boncz
Professor, CWI & Vrije Universiteit Amsterdam
Discussion with Unilever
Phinean Woodward
Head of Architecture, Information and Analytics, Unilever
In this talk, we’ll discuss how the Lakehouse architecture has become a critical part of Unilever’s information management infrastructure to limit traditional enterprise data silos, and enable agile access to data both up and downstream that’s needed for faster decision making. As a result, IT is helping Unilever to deliver higher quality predictions in many areas of the business, thereby building trust in AI throughout the company.
About:
Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: https://databricks.com/product/unifie...
Connect with us:
Website: https://databricks.com
Facebook: https://www.facebook.com/databricksinc
Twitter: https://twitter.com/databricks
LinkedIn: https://www.linkedin.com/company/data...
Instagram: https://www.instagram.com/databricksinc/
Видео SQL Analytics and the Lakehouse Architecture | Ali Ghodsi | Keynote Data + AI Summit EU 2020 канала Databricks
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![How to Build a Cloud Data Platform Part 1- Architecture](https://i.ytimg.com/vi/uhVpLwjEOKU/default.jpg)
![Data + AI Summit 2021 - Full Thursday AM Keynote on Apache Spark, Data Science + Machine Learning](https://i.ytimg.com/vi/VLLTuMBARss/default.jpg)
![Ali Ghodsi, Co-Founder and CEO of Databricks](https://i.ytimg.com/vi/-GYbqZRJeFM/default.jpg)
![What is the difference between Database vs. Data lake vs. Warehouse?](https://i.ytimg.com/vi/E49BFhThC3U/default.jpg)
![Foundations - Slab vs. Pier and Beam - Which is better?](https://i.ytimg.com/vi/q4nGeEOXm8k/default.jpg)
![Realizing the Vision of the Data Lakehouse | Ali Ghodsi | Keynote Spark + AI Summit 2020](https://i.ytimg.com/vi/g11y-kJHr3I/default.jpg)
![AWS re:Invent 2020: The lake house approach to data warehousing with Amazon Redshift](https://i.ytimg.com/vi/35wXL0Q1Dcc/default.jpg)
![Keynote - Data Mesh by Zhamak Dehghani](https://i.ytimg.com/vi/L_-fHo0ZkAo/default.jpg)
![Azure Synapse Analytics: A Data Lakehouse - James Serra - PASS Virtual Summit 2020](https://i.ytimg.com/vi/mbHFKS8yf48/default.jpg)
![Artificial intelligence and algorithms: pros and cons | DW Documentary (AI documentary)](https://i.ytimg.com/vi/s0dMTAQM4cw/default.jpg)
![What is Lake House and why it matters | AWS Events](https://i.ytimg.com/vi/vx9hW0ZUzOA/default.jpg)
![The Parquet Format and Performance Optimization Opportunities Boudewijn Braams (Databricks)](https://i.ytimg.com/vi/1j8SdS7s_NY/default.jpg)
![Overcoming the Data Glut with Snowflake, Databricks and Portworx](https://i.ytimg.com/vi/Xi70_iTY3FY/default.jpg)
![Tech Talk | Using Delta as a Change Data Capture Source](https://i.ytimg.com/vi/7y0AAQ6qX5w/default.jpg)
![Azure Databricks Tutorial | Data transformations at scale](https://i.ytimg.com/vi/M7t1T1Q5MNc/default.jpg)
![What is Databricks? The Data Lakehouse You've Never Heard Of](https://i.ytimg.com/vi/ThrmPaleEiI/default.jpg)
![Workshop | Managing the Complete Machine Learning Lifecycle with MLflow: 1 of 3](https://i.ytimg.com/vi/x3cxvsUFVZA/default.jpg)
![Data Mesh in Practice: How Europe's Leading Online Platform for Fashion Goes Beyond the Data Lake](https://i.ytimg.com/vi/eiUhV56uVUc/default.jpg)
![Spark + AI Summit 2020 NA - Wednesday Morning Keynotes](https://i.ytimg.com/vi/OLJKIogf2nU/default.jpg)
![Making Apache Spark™ Better with Delta Lake](https://i.ytimg.com/vi/LJtShrQqYZY/default.jpg)