Mastering a data pipeline with Python / Robson Luis Monteiro Junior (Microsoft)
Python Conf++ 2020 Online
Тезисы и презентация:
https://conf.python.ru/moscow/2020/abstracts/6316
Building data pipelines are a consolidated task, there are a vast number of tools that automate and help developers to create data pipelines with few clicks on the cloud. It might solve non-complex or well-defined standard problems. This presentation is a demystification of years of experience and painful mistakes using Python as a core to create reliable data pipelines and manage insanely amount of valuable data. Let's cover how each piece fits into this puzzle: data acquisition, ingestion, transformation, storage, workflow management and serving. Also, we'll walk through best practices and possible issues. We'll cover PySpark vs Dask and Pandas, Airflow, and Apache Arrow as a new approach.
--------
Нашли ошибку в видео? Пишите нам на support@ontico.ru
Видео Mastering a data pipeline with Python / Robson Luis Monteiro Junior (Microsoft) канала MoscowPython
Тезисы и презентация:
https://conf.python.ru/moscow/2020/abstracts/6316
Building data pipelines are a consolidated task, there are a vast number of tools that automate and help developers to create data pipelines with few clicks on the cloud. It might solve non-complex or well-defined standard problems. This presentation is a demystification of years of experience and painful mistakes using Python as a core to create reliable data pipelines and manage insanely amount of valuable data. Let's cover how each piece fits into this puzzle: data acquisition, ingestion, transformation, storage, workflow management and serving. Also, we'll walk through best practices and possible issues. We'll cover PySpark vs Dask and Pandas, Airflow, and Apache Arrow as a new approach.
--------
Нашли ошибку в видео? Пишите нам на support@ontico.ru
Видео Mastering a data pipeline with Python / Robson Luis Monteiro Junior (Microsoft) канала MoscowPython
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Python OOP Tutorial 1: Classes and InstancesData Pipelines with Python and PostgreSQLBuilding an image processing pipeline with PythonGoogle Data Center Security: 6 Layers DeepWhat is Data Pipeline | How to design Data Pipeline ? - ETL vs Data pipelinePython Tutorial: if __name__ == '__main__'How To Create Data Pipeline on AWS? || Techtter ||Inside a Google data centerPython programming for beginners: What can you do with Python?Preparing for a Python Interview: 10 Things You Should KnowPython Object Oriented Programming (OOP) - For BeginnersHEAVY Surf at PIPELINE (RAW FOOTAGE)James Powell: So you want to be a Python expert? | PyData Seattle 2017Intro to Web Scraping with Python and Beautiful SoupSolving real world data science tasks with Python Pandas!Jiaqi Liu - Building a Data Pipeline with Testing in Mind - PyCon 2018Complete Python Pandas Data Science Tutorial! (Reading CSV/Excel files, Sorting, Filtering, Groupby)What Does It Take To Be An Expert At Python?Object-Oriented Programming