Using Structured Streaming in Apache Spark: Insights Without Tradeoffs
Get started with Structured Streaming on Databricks today. https://databricks.com/try-databricks
Apache Spark 2.0 introduced Structured Streaming which allows users to continually and incrementally update your view of the world as new data arrives while still using the same familiar Spark SQL abstractions. Michael Armbrust from Databricks talks about the progress made since the release of Spark 2.0 on robustness, latency, expressiveness and observability, using examples of production end-to-end continuous applications.
Overview:
Parallelism and Complexity
Developer Productivity and Efficiency
Throughput and Latency
Production Use Cases
- Viacom
- iPass
Streaming at Databricks
Engineer Office Hours
This talk was originally presented at Spark Summit East 2017.
You can view the slides on Slideshare:
http://www.slideshare.net/databricks/spark-summit-east-2017-michael-armbrust-keynote-insights-without-tradeoffs-using-structured-streaming
Related Articles:
Structured Streaming In Apache Spark
https://databricks.com/blog/2016/07/28/structured-streaming-in-apache-spark.html
Real-time Streaming ETL with Structured Streaming in Apache Spark 2.1
https://databricks.com/blog/2017/01/19/real-time-streaming-etl-structured-streaming-apache-spark-2-1.html
About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: https://databricks.com/product/unified-data-analytics-platform
Connect with us:
Website: https://databricks.com
Facebook: https://www.facebook.com/databricksinc
Twitter: https://twitter.com/databricks
LinkedIn: https://www.linkedin.com/company/databricks
Instagram: https://www.instagram.com/databricksinc/
Видео Using Structured Streaming in Apache Spark: Insights Without Tradeoffs канала Databricks
Apache Spark 2.0 introduced Structured Streaming which allows users to continually and incrementally update your view of the world as new data arrives while still using the same familiar Spark SQL abstractions. Michael Armbrust from Databricks talks about the progress made since the release of Spark 2.0 on robustness, latency, expressiveness and observability, using examples of production end-to-end continuous applications.
Overview:
Parallelism and Complexity
Developer Productivity and Efficiency
Throughput and Latency
Production Use Cases
- Viacom
- iPass
Streaming at Databricks
Engineer Office Hours
This talk was originally presented at Spark Summit East 2017.
You can view the slides on Slideshare:
http://www.slideshare.net/databricks/spark-summit-east-2017-michael-armbrust-keynote-insights-without-tradeoffs-using-structured-streaming
Related Articles:
Structured Streaming In Apache Spark
https://databricks.com/blog/2016/07/28/structured-streaming-in-apache-spark.html
Real-time Streaming ETL with Structured Streaming in Apache Spark 2.1
https://databricks.com/blog/2017/01/19/real-time-streaming-etl-structured-streaming-apache-spark-2-1.html
About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: https://databricks.com/product/unified-data-analytics-platform
Connect with us:
Website: https://databricks.com
Facebook: https://www.facebook.com/databricksinc
Twitter: https://twitter.com/databricks
LinkedIn: https://www.linkedin.com/company/databricks
Instagram: https://www.instagram.com/databricksinc/
Видео Using Structured Streaming in Apache Spark: Insights Without Tradeoffs канала Databricks
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Get Rid of Traditional ETL, Move to Spark! (Bas Geerdink)Deep Dive into Stateful Stream Processing in Structured Streaming - Tathagata DasKafka Tutorial - Core ConceptsTop 5 Mistakes When Writing Spark ApplicationsTuning and Debugging Apache SparkMaking Structured Streaming Ready for Production Updates: Spark Summit East talk by Tathagata DasReal-Time Data Pipelines Made Easy with Structured Streaming in Apache Spark | DataEngConf SF '18RDDs, DataFrames and Datasets in Apache Spark - NE Scala 2016Recipes for Running Spark Streaming Applications in Production - Tathagata Das (Databricks)Spark Summit East 2016 Demo: Databricks Community EditionProductizing Structured Streaming Jobs Burak Yavuz DatabricksUnderstanding Query Plans and Spark UIs - Xiao Li DatabricksETL Is Dead, Long Live Streams: real-time streams w/ Apache KafkaScaling Self Service Analytics with Databricks and Apache Spark - Amelia Chu & Dan MorrisAdvanced Apache Spark Training - Sameer Farooqui (Databricks)Apache Spark 2.0: A Deep Dive Into Structured Streaming - by Tathagata DasApache Spark - ComputerphileBuilding Robust ETL Pipelines with Apache Spark - Xiao LiUse Cases and Design Patterns for Spark Streaming (MeetupVideo.com)Easy, Scalable, Fault Tolerant Stream Processing with Structured Streaming in Apache Spark