The Highs and Lows of Building an Adtech Data Pipeline | TripleLift
Get the slides: https://www.datacouncil.ai/talks/the-highs-and-lows-of-building-an-adtech-data-pipeline
ABOUT THE TALK:
The talk will give an overview of how the scrappy data engineering team at TripleLift evolved its data pipeline to keep up with its rapid growth which is currently processing tens of billions of events a day. Emphasis will be placed on the major turning points and decisions that required us to tackle the same problems in a different way - both due to new scales of data as well as growing business requirements.
The talk will cover the following data technologies and how they were used and modified over the years: Kafka, Redshift, Secor, Spark, Spark Streaming, VoltDB, and Druid.
ABOUT THE SPEAKER:
Dan is currently the VP of Engineering at TripleLift and was responsible for introducing many of the problems this talk will cover. Before TripleLift he launched a small startup that was acquired by an advertising agency and had a few stints as a quantitative engineer, the precursor to data science. He's currently trying to keep up with the overwhelming amount of open source data engineering tools.
ABOUT DATA COUNCIL:
Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers. Make sure to subscribe to our channel for more videos, including DC_THURS, our series of live online interviews with leading data professionals from top open source projects and startups.
FOLLOW DATA COUNCIL:
Twitter: https://twitter.com/DataCouncilAI
LinkedIn: https://www.linkedin.com/company/datacouncil-ai
Facebook: https://www.facebook.com/datacouncilai
Eventbrite: https://www.eventbrite.com/o/data-council-30357384520
Видео The Highs and Lows of Building an Adtech Data Pipeline | TripleLift канала Data Council
ABOUT THE TALK:
The talk will give an overview of how the scrappy data engineering team at TripleLift evolved its data pipeline to keep up with its rapid growth which is currently processing tens of billions of events a day. Emphasis will be placed on the major turning points and decisions that required us to tackle the same problems in a different way - both due to new scales of data as well as growing business requirements.
The talk will cover the following data technologies and how they were used and modified over the years: Kafka, Redshift, Secor, Spark, Spark Streaming, VoltDB, and Druid.
ABOUT THE SPEAKER:
Dan is currently the VP of Engineering at TripleLift and was responsible for introducing many of the problems this talk will cover. Before TripleLift he launched a small startup that was acquired by an advertising agency and had a few stints as a quantitative engineer, the precursor to data science. He's currently trying to keep up with the overwhelming amount of open source data engineering tools.
ABOUT DATA COUNCIL:
Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers. Make sure to subscribe to our channel for more videos, including DC_THURS, our series of live online interviews with leading data professionals from top open source projects and startups.
FOLLOW DATA COUNCIL:
Twitter: https://twitter.com/DataCouncilAI
LinkedIn: https://www.linkedin.com/company/datacouncil-ai
Facebook: https://www.facebook.com/datacouncilai
Eventbrite: https://www.eventbrite.com/o/data-council-30357384520
Видео The Highs and Lows of Building an Adtech Data Pipeline | TripleLift канала Data Council
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
How Superset and Druid Power Real-Time Analytics at Airbnb | DataEngConf SF '17Building highly reliable data pipelines at Datadog | DatadogWhat is a Data Architecture? Modern Data Architectures ExplainedData DAGs with lineage for fun and for profitWhat is the difference between AdTech and MarTech? | How Programmatic Advertising Works | DSP | SSPData Pipeline Frameworks: The Dream and the Reality | BeeswaxSpicy Tech Industry Secrets from a Technical Program ManagerDeveloping Real Time Bidding Solutions with RTBkit - WebinarFletcher Riehl: Using Embedding Layers to Manage High Cardinality Categorical Data | PyData LA 2019Image Labeling using Active Learning reduces Human EffortMachine Learning for AdTech in Action with Cyrille Dubarry and Han Ju Teads (Teads.tv)near real-time streaming OLAP with Kafka and DruidBuilding a High-Performance Networking Protocol for MicroservicesAd Ops Metrics | Reporting Metrics | Data Points to ConsiderCreating a Data Engineering Culture | Big Data InstituteGoogle Technical Program Manager Mock Interview: Data CentersAdvanced Campaign Analysis: Measuring Campaign Effectiveness and Long-Term ImpactWhat is Enterprise Architecture (EA) and why is it important? EA concepts explained in a simple way.Inside Apache Druid's Storage and Query Engine (Gian Merlino)