Overview of Apache Beam
Apache Beam is set of portable SDKs (Java, Python, Go) for constructing streaming and batch data processing pipelines that can be written once and executed on any supported runtime. Tyler will give an overview of the project, with a focus on the current community efforts towards completing the vision laid out in when the project was founded: providing full cross-language portability across supported execution engines.
Tyler Akidau is a staff software engineer at Google Seattle. He leads technical infrastructure’s internal data processing teams (MillWheel & Flume), is a founding member of the Apache Beam PMC, and has spent the last seven years working on massive-scale data processing systems. He is the author of the 2015 Dataflow Model paper and the Streaming 101 and Streaming 102 articles on the O’Reilly website. His preferred mode of transportation is by cargo bike, with his two young daughters in tow.
This a talk by Robert Crowe presented at SF Big Analytics meetup in Feb 2019 at Lyft HQ in San Francisco.
Видео Overview of Apache Beam канала SF Big Analytics
Tyler Akidau is a staff software engineer at Google Seattle. He leads technical infrastructure’s internal data processing teams (MillWheel & Flume), is a founding member of the Apache Beam PMC, and has spent the last seven years working on massive-scale data processing systems. He is the author of the 2015 Dataflow Model paper and the Streaming 101 and Streaming 102 articles on the O’Reilly website. His preferred mode of transportation is by cargo bike, with his two young daughters in tow.
This a talk by Robert Crowe presented at SF Big Analytics meetup in Feb 2019 at Lyft HQ in San Francisco.
Видео Overview of Apache Beam канала SF Big Analytics
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Apache Kafka in 6 minutesA Whirlwind Overview of Apache BeamApache Beam Explained in 12 MinutesWhat is Hadoop?: SQL Comparison7 Chakras | Our Doorway To Knowledge | SadhguruWatermarks: Time and Progress in Apache Beam and BeyondAPI vs. SDK: What's the difference?Introduction to Stateful Stream Processing with Apache Flink • Robert Metzger • GOTO 2019Dataflow: A Unified Model for Batch and Streaming Data ProcessingApache Beam: Portable and Parallel Data Processing (Google Cloud Next '17)Terraform ExplainedStreaming Concepts & Introduction to Flink - Event Time and WatermarksBuild ETL Pipelines using Cloud DataflowRust GenericsApache Beam | How to read a CSV file | Google colab | Map | Pipeline | Fillter | google dataflowApache Kafka Explained (Comprehensive Overview)Word count example - From Google Cloud Storage to BigQuery with Apache BeamStream Processing Fundamentals with Apache Beam - PyCon SG 2019Identity and Access Management: Technical OverviewWhat is Dataflow?