Apache Arrow: A New Gold Standard for Dataset Transport // Subsurface Summer 2020
Wes McKinney, Director @ Ursa Labs, presents "Apache Arrow: A New Gold Standard for Dataset Transport" at Subsurface Summer 2020 -- the first-ever cloud data lake conference.
Session Abstract: This talk will discuss the role that Apache Arrow and Arrow Flight play in disrupting previous approaches to creating data services that transport large datasets. We'll look at the technical details of why the Arrow protocol is an attractive choice and share specific examples of where Arrow has been employed for better performance and resource efficiency. We'll also discuss the implications for the upcoming generation of data systems.
--------------------------------------------------------------------------------------------
Subsurface Is The Industry’s First Cloud Data Lake Conference
Presented by Dremio
Expand your technical knowledge and hear from your peers and industry experts about cloud data lake use cases and architectures at Subsurface™, where we explore what’s below the surface of the data lake. Hear firsthand from open source and technology leaders at companies about their experiences spearheading open source projects and building modern data lakes. Explore real-world use cases, from data warehousing and BI to data science and advanced analytics.
Connect with us!
Event Page https://bit.ly/33Ym5rh
Twitter https://bit.ly/2CqKhHt
Summer 2020 https://bit.ly/3iH160u
Dremio https://bit.ly/2XmtEnN
Видео Apache Arrow: A New Gold Standard for Dataset Transport // Subsurface Summer 2020 канала Dremio
Session Abstract: This talk will discuss the role that Apache Arrow and Arrow Flight play in disrupting previous approaches to creating data services that transport large datasets. We'll look at the technical details of why the Arrow protocol is an attractive choice and share specific examples of where Arrow has been employed for better performance and resource efficiency. We'll also discuss the implications for the upcoming generation of data systems.
--------------------------------------------------------------------------------------------
Subsurface Is The Industry’s First Cloud Data Lake Conference
Presented by Dremio
Expand your technical knowledge and hear from your peers and industry experts about cloud data lake use cases and architectures at Subsurface™, where we explore what’s below the surface of the data lake. Hear firsthand from open source and technology leaders at companies about their experiences spearheading open source projects and building modern data lakes. Explore real-world use cases, from data warehousing and BI to data science and advanced analytics.
Connect with us!
Event Page https://bit.ly/33Ym5rh
Twitter https://bit.ly/2CqKhHt
Summer 2020 https://bit.ly/3iH160u
Dremio https://bit.ly/2XmtEnN
Видео Apache Arrow: A New Gold Standard for Dataset Transport // Subsurface Summer 2020 канала Dremio
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Wes McKinney - Apache Arrow: Leveling Up the Data Science StackApache Beam Explained in 12 MinutesScale By The Bay 2019: Jacques Nadeau, Vectorized Query Processing for CPUs using Apache ArrowThree reasons why Dremio"Apache Arrow and the Future of Data Frames" with Wes McKinneyPandas Limitations - Pandas vs Dask vs PySpark - DataMites CoursesIntroduction to Self-Service Data with DremioWhat is Dremio and Apache Arrow?Wes McKinney and Dr. Neal Richardson - Speeding Up Data Access in R with Apache ArrowApache Arrow: In Theory, In Practice // Apache Arrow Meetup SFInfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in Apache ArrowThe columnar roadmap: Apache Parquet and Apache ArrowTransforming Code into Beautiful, Idiomatic PythonUsing Apache Arrow with Spark in RExtending Pandas using Apache Arrow and Numba - Uwe L Kornapply() Conference 2021 | Apache Arrow and the Next Generation of Data Analytics SystemsWant Access to a High-Performance Jupyter Notebook? BlazingSQL Notebooks (Powered by NVIDIA GPUs)Wes McKinney - Apache Arrow: Present & FutureDelta Lake for Apache Spark - Why do we need Delta Lake for Spark?Improving Python and Spark Performance and Interoperability with Apache Arrow