"Apache Arrow and the Future of Data Frames" with Wes McKinney
Title: Apache Arrow and the Future of Data Frames
Speaker: Wes McKinney, Director, Ursa Labs
Date: July 8, 2020
ABSTRACT
In this talk I will discuss the background and motivation for the Apache Arrow project, which contains a columnar in-memory data standard and an expanding set of supporting libraries for a variety of programming languages. We will look at the relationship between data frame libraries and database systems and explore the ways in which analytics systems are likely to evolve to be more "Arrow-native" over the coming years.
SPEAKER
Wes McKinney
Director, Ursa Labs
Wes McKinney is an open source software developer focusing on analytical computing. He created the Python pandas project and is a co-creator of Apache Arrow, his current focus. He authored two editions of the reference book Python for Data Analysis. Wes is a Member of The Apache Software Foundation and also a PMC member for Apache Parquet. He is the director of Ursa Labs, a not-for-profit development group focused on data science tools for Python and R powered by Apache Arrow, built in partnership with RStudio. Previously, he worked for Two Sigma, Cloudera, and AQR Capital Management, and he was co-founder and CEO of the startup DataPad.
MODERATOR
Larisa Sawyer
Two Sigma Investments; ACM Practitioner Board
Larisa Sawyer is a software engineering manager and Vice President at Two Sigma Investments. Her educational background is in Computer Science and Applied Mathematics. The opportunity to blendmath and CS drew her to the realm of finance. Her career began at investment banks, building algorithmic trading platforms. Larisa has been at Two Sigma for the past seven years, and has worked on distributed time series analysis and platform technologies to increase research productivity and collaboration. Larisa also serves on the ACM Practitioner Board, as well as the advisory board for Data Clinic, Two Sigma’s data and tech for good program that leverages employees’ data science skills and technological know-how to support charities and non-profits.
Видео "Apache Arrow and the Future of Data Frames" with Wes McKinney канала Association for Computing Machinery (ACM)
Speaker: Wes McKinney, Director, Ursa Labs
Date: July 8, 2020
ABSTRACT
In this talk I will discuss the background and motivation for the Apache Arrow project, which contains a columnar in-memory data standard and an expanding set of supporting libraries for a variety of programming languages. We will look at the relationship between data frame libraries and database systems and explore the ways in which analytics systems are likely to evolve to be more "Arrow-native" over the coming years.
SPEAKER
Wes McKinney
Director, Ursa Labs
Wes McKinney is an open source software developer focusing on analytical computing. He created the Python pandas project and is a co-creator of Apache Arrow, his current focus. He authored two editions of the reference book Python for Data Analysis. Wes is a Member of The Apache Software Foundation and also a PMC member for Apache Parquet. He is the director of Ursa Labs, a not-for-profit development group focused on data science tools for Python and R powered by Apache Arrow, built in partnership with RStudio. Previously, he worked for Two Sigma, Cloudera, and AQR Capital Management, and he was co-founder and CEO of the startup DataPad.
MODERATOR
Larisa Sawyer
Two Sigma Investments; ACM Practitioner Board
Larisa Sawyer is a software engineering manager and Vice President at Two Sigma Investments. Her educational background is in Computer Science and Applied Mathematics. The opportunity to blendmath and CS drew her to the realm of finance. Her career began at investment banks, building algorithmic trading platforms. Larisa has been at Two Sigma for the past seven years, and has worked on distributed time series analysis and platform technologies to increase research productivity and collaboration. Larisa also serves on the ACM Practitioner Board, as well as the advisory board for Data Clinic, Two Sigma’s data and tech for good program that leverages employees’ data science skills and technological know-how to support charities and non-profits.
Видео "Apache Arrow and the Future of Data Frames" with Wes McKinney канала Association for Computing Machinery (ACM)
Показать
Комментарии отсутствуют
Информация о видео
16 июля 2020 г. 0:44:06
01:00:26
Другие видео канала
![Wes McKinney - Apache Arrow: Leveling Up the Data Science Stack](https://i.ytimg.com/vi/uZA55cGDaBQ/default.jpg)
![Scale By The Bay 2019: Jacques Nadeau, Vectorized Query Processing for CPUs using Apache Arrow](https://i.ytimg.com/vi/hLm_duqB3Y4/default.jpg)
![STOC 2020 - Session 9B: Randomness in Computing](https://i.ytimg.com/vi/b6kaLAY7OgE/default.jpg)
![Martin Fowler – Microservices](https://i.ytimg.com/vi/2yko4TbC8cI/default.jpg)
![The Rise Of Open-Source Software](https://i.ytimg.com/vi/SpeDK1TPbew/default.jpg)
![July 17, 2020 SIG Governing Board Zoom Meeting](https://i.ytimg.com/vi/9SJ8pNgr74M/default.jpg)
![Data Engineering and Data Science: Bridging the Gap | DataEDGE 2016](https://i.ytimg.com/vi/-K9SjrWpeys/default.jpg)
![The Future of Data Science - Data Science @ Stanford](https://i.ytimg.com/vi/hxXIJnjC_HI/default.jpg)
![Python Data Wrangling: Preparing for the Future by Wes McKinney | DataEngConf NYC '16](https://i.ytimg.com/vi/BDhjOUPQLUc/default.jpg)
!["Probabilistic scripts for automating common-sense tasks" by Alexander Lew](https://i.ytimg.com/vi/MiiWzJE0fEA/default.jpg)
![Transforming Code into Beautiful, Idiomatic Python](https://i.ytimg.com/vi/OSGv2VnC0go/default.jpg)
![Data Microservices in Apache Spark using Apache Arrow Flight](https://i.ytimg.com/vi/Jm9HC9y3MDA/default.jpg)
![Building Software Systems At Google and Lessons Learned](https://i.ytimg.com/vi/modXC5IWTJI/default.jpg)
![Wes McKinney - Keynote - PyCon Colombia 2020](https://i.ytimg.com/vi/ZTXFQ2sEarQ/default.jpg)
![Using Apache Arrow, Calcite and Parquet to build a Relational Cache | Dremio](https://i.ytimg.com/vi/KMl9Py8o3pk/default.jpg)
![Best Practices for Virtual Conferences Zoom Recording, July 24, 2020](https://i.ytimg.com/vi/37X5GwihQOA/default.jpg)
![Lessons Learned the Hard Way: Hacking the Data Science Interview](https://i.ytimg.com/vi/3BRLGRqj8ps/default.jpg)
![Top 20 Apache Spark Interview Questions and Answers | Hadoop Interview Questions and Answers](https://i.ytimg.com/vi/Y8LKEDyA5iY/default.jpg)
![The columnar roadmap: Apache Parquet and Apache Arrow](https://i.ytimg.com/vi/dPb2ZXnt2_U/default.jpg)
![STOC 2020 - Workshop 6: Derandomizing Space-Bounded Computation](https://i.ytimg.com/vi/NfzE-WfGgq8/default.jpg)