Get Rid of Traditional ETL, Move to Spark! (Bas Geerdink)
ETL has been around since the 90s, supporting a whole ecosystem of BI tools and practises. While traditional ETL has proven its value, it’s time to move on to modern ways of getting your data from A to B. Since BI moved to big data, data warehousing became data lakes, and applications became microservices, ETL is next our our list of obsolete terms. Spark provides an ideal middleware framework for writing code that gets the job done fast, reliable, readable. In this session I will support this statement with some nice ‘old vs new’ diagrams, code examples and use cases. Please join if you want to know more about the NoETL paradigm, or just want to be convinced of the possibilities of Spark in this area!
Видео Get Rid of Traditional ETL, Move to Spark! (Bas Geerdink) канала Spark Summit
Видео Get Rid of Traditional ETL, Move to Spark! (Bas Geerdink) канала Spark Summit
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Building Robust ETL Pipelines with Apache Spark - Xiao LiETL Is Dead, Long Live Streams: real-time streams w/ Apache KafkaSpark + Parquet In Depth: Spark Summit East talk by: Emily Curtin and Robbie StricklandA Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets - Jules DamjiThree Use Cases for BigQuery and Apache SparkTop 5 Mistakes When Writing Spark ApplicationsTop 5 Mistakes When Writing Spark ApplicationsEverything You Need to Know About Big Data: From Architectural Principles to Best PracticesWhat is Data Pipeline | How to design Data Pipeline ? - ETL vs Data pipelineApache Spark Core—Deep Dive—Proper Optimization Daniel Tomes DatabricksScalable Stream Processing: A Survey of Storm, Samza, Spark and Flink by Felix GessertA Deep Dive into the Catalyst Optimizer (Herman van Hovell)Using Apache Spark 2.0 to Analyze the City of San Francisco's Open DataSparkSQL: A Compiler from Queries to RDDs: Spark Summit East talk by Sameer AgarwalData Architecture 101 for Your BusinessSpark Tutorial For Beginners | Big Data Spark Tutorial | Apache Spark Tutorial | SimplilearnNew Developments in the Open Source Ecosystem: Apache Spark 3 0, Delta Lake, and KoalasAWS re:Invent 2018: Effective Data Lakes: Challenges and Design Patterns (ANT316)Oracle and Kafka Build a Scalable ETL Solution