Data pipeline at Spotify – from the inception to the production - Rafal Wojdyla
We all use the same tools and frameworks to process data, but the environment and best practices differ from one company to another. In this talk Rafal – an engineer at Spotify, will present the full journey, an idea has to travel from the inception to the full fledged data pipeline at Spotify. We will cover the tools and frameworks we use to ease the processes of bootstrapping, testing, validating and productionizing a new data pipeline. You will hear about some of the open source tools like scio, ratatool, gcs-tools and styx, as well as some internal ones. This talk will give you a sense of how does it feel to be a data engineer at Spotify – including all the struggle – you will see that we still have a long way to go.
Видео Data pipeline at Spotify – from the inception to the production - Rafal Wojdyla канала Criteo Eng
Видео Data pipeline at Spotify – from the inception to the production - Rafal Wojdyla канала Criteo Eng
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Quick Python 6: Our First Data PipelineData Pipeline Frameworks: The Dream and the Reality | BeeswaxData Journey EP-02: Batch Ingestion 📦 - 5 ways to ingest files into Google CloudHow I Passed Google's Professional Data Engineer Exam in 2022Variational Autoencoders for Recommender SystemsIntroduction to Accessing Spotify API in RModelling Microservices at SpotifyHow Spotify Distills Terabytes of Raw Data into Meaningful Music Recommendations | SpotifyWhat is the difference between Database vs. Data lake vs. Warehouse?Machine Learning at Spotify - Gustav Soderstrom | AI Podcast ClipsThe Story of Why We Migrate to gRPC and How We Go About It - Matthias Grüter, SpotifyWhat is Data Pipeline | How to design Data Pipeline ? - ETL vs Data pipelineContent Analytics at SpotifyScalable computation for Bayesian hierarchical modelsMachine Learning & Big Data for Music Discovery presented by SpotifyTop 4 Use Cases For Streaming Data IntegrationModern Data Warehouse in Azure - episode 3 - Data LakeScalable Data Ingestion Architecture Using Airflow and Spark | Komodo HealthHow to Remove Outliers From Data (Spotify Song Popularity Prediction) - Data Every Day #127What is a Data Strategy?