AWS re:Invent 2019: [REPEAT 1] Deep dive into running Apache Spark on Amazon EMR (ANT308-R1)
Amazon EMR enables customers to run ETL, machine learning, real-time processing, data science, and low-latency SQL at petabyte scale. We focus this session on running Apache Spark on Amazon EMR. We introduce design patterns such as using Amazon S3 instead of HDFS, running long- and short-lived clusters, using notebooks, and performance-related enhancements. We discuss lowering cost with auto scaling and Spot Instances, and security with encryption and fine-grained access control with AWS Lake Formation.
Видео AWS re:Invent 2019: [REPEAT 1] Deep dive into running Apache Spark on Amazon EMR (ANT308-R1) канала AWS Events
Видео AWS re:Invent 2019: [REPEAT 1] Deep dive into running Apache Spark on Amazon EMR (ANT308-R1) канала AWS Events
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Amazon EMR Deep Dive and Best Practices - AWS Online Tech TalksRun Spark Application(Scala) on Amazon EMR (Elastic MapReduce) clusterAWS re:Invent 2019: Machine Learning Summit (MLS201)Big Data Hadoop Spark Cluster on AWS EMR Cloud | Big Data on AWS Cloud | Production Big Data ClusterAWS re:Invent 2019: Deep dive and best practices for Amazon Redshift (ANT418)What is Amazon EMR and how can I use it for processing data?Run Apache Spark on Kubernetes with Amazon EMR on Amazon EKS - AWS Online Tech TalksAWS re:Invent 2019: A day in the life of a Netflix engineer (NFX202)AWS re:Invent 2019: [REPEAT 1] Dive deep on how to save with AWS Savings Plans (CMP210-R1)AWS re:Invent 2019: [NEW LAUNCH!] Running Kubernetes Applications on AWS Fargate (CON326-R1)AWS re:Invent 2019: [REPEAT 1] Advanced VPC design and new capabilities for Amazon VPC (NET305-R1)Two examples for each of Amazon’s 14 Leadership PrinciplesWhat is AWS EMR | Introduction to Amazon EMR | Data Processing with AWS EMR | AWS Training | EdurekaAWS re:Invent 2019: [REPEAT 1] AWS Transit Gateway reference architectures for many VPCs (NET406-R1)AWS EMR Big Data Processing with Spark and Hadoop | Python, PySpark, Step by Step InstructionsAWS re:Invent 2019: Running lean architectures: How to be cost-effective on AWS (ARC209-R1)Create a QnA Bot with Amazon Lex and Amazon Kendra to Answer Questions - AWS Online Tech Talks41. Pyspark: How to run Spark Application on Amazon EMR ElasticMapReduce cluster【AWS Black Belt Online Seminar】Amazon EMRAWS re:Invent 2019: [REPEAT 2] Amazon EC2 foundations (CMP211-R2)