Advanced Apache Spark Training - Sameer Farooqui (Databricks)
Live Big Data Training from Spark Summit 2015 in New York City.
"Today I'll cover Spark core in depth and get you prepared to use Spark in your own prototypes. We'll start by learning about the big data ecosystem, then jump into RDDs (Resilient Distributed Datasets). Then we'll talk about integrating Spark with resource managers like YARN and Standalone mode. After a peek into some Spark Internals, we touch base upon Accumulators and Broadcast Variables. Finally, we end with Spark Streaming and a technical explanation of how the 100 TB sort competition was won in 2014." - Sameer
Slides:
https://spark-summit.org/wp-content/uploads/2015/03/SparkSummitEast2015-AdvDevOps-StudentSlides.pdf
Want to learn more about Spark?
Check out my new class, "Exploring Wikipedia with Apache Spark", recorded June 2016:
https://www.youtube.com/watch?v=vlVnSpJ6TDE&t=21m23s
// About the Presenter //
Sameer Farooqui is a Technology Evangelist at Databricks where he helps promote the adoption of Apache Spark. As a founding member of the training team, he created and taught advanced Spark classes at private clients, meetups and conferences globally.
Follow Sameer on -
Twitter: https://twitter.com/blueplastic
LinkedIn: https://www.linkedin.com/in/blueplastic
Видео Advanced Apache Spark Training - Sameer Farooqui (Databricks) канала Spark Summit
"Today I'll cover Spark core in depth and get you prepared to use Spark in your own prototypes. We'll start by learning about the big data ecosystem, then jump into RDDs (Resilient Distributed Datasets). Then we'll talk about integrating Spark with resource managers like YARN and Standalone mode. After a peek into some Spark Internals, we touch base upon Accumulators and Broadcast Variables. Finally, we end with Spark Streaming and a technical explanation of how the 100 TB sort competition was won in 2014." - Sameer
Slides:
https://spark-summit.org/wp-content/uploads/2015/03/SparkSummitEast2015-AdvDevOps-StudentSlides.pdf
Want to learn more about Spark?
Check out my new class, "Exploring Wikipedia with Apache Spark", recorded June 2016:
https://www.youtube.com/watch?v=vlVnSpJ6TDE&t=21m23s
// About the Presenter //
Sameer Farooqui is a Technology Evangelist at Databricks where he helps promote the adoption of Apache Spark. As a founding member of the training team, he created and taught advanced Spark classes at private clients, meetups and conferences globally.
Follow Sameer on -
Twitter: https://twitter.com/blueplastic
LinkedIn: https://www.linkedin.com/in/blueplastic
Видео Advanced Apache Spark Training - Sameer Farooqui (Databricks) канала Spark Summit
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Spark Scala | Spark Tutorial | Scala Tutorial | Spark Scala Full Course | IntellipaatModernizing Apache Hive Metastore for the Next DecadeA Deeper Understanding of Spark Internals - Aaron Davidson (Databricks)Using Apache Spark 2.0 to Analyze the City of San Francisco's Open DataData Engineering Interview | Apache Spark Interview | Live Big Data InterviewWhy Scala? ...by a hilarious Indian guyHadoop Tutorial: Intro to HDFSA Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets - Jules DamjiSpark + Parquet In Depth: Spark Summit East talk by: Emily Curtin and Robbie StricklandTop 5 Mistakes When Writing Spark ApplicationsDataDevOps for the Modern Data Warehouse on Microsoft Azure - Lace LofrancoSparkSQL: A Compiler from Queries to RDDs: Spark Summit East talk by Sameer AgarwalSpark Tutorial | Spark Tutorial for Beginners | Apache Spark Full Course - Learn Apache Spark 2020Optimizing Apache Spark SQL Joins: Spark Summit East talk by Vida HaPartition vs bucketing | Spark and Hive Interview QuestionApache Spark Beyond Shuffling • Holden Karau • GOTO 2017Using Structured Streaming in Apache Spark: Insights Without TradeoffsSpark Out of Memory Issue | Spark Memory Tuning | Spark Memory Management | Part 1Apache Spark Tutorial | Spark Tutorial for Beginners | Apache Spark Training | Edureka