Containers and Big Data
As containerization continues to gain momentum and become a de facto standard for application deployment, challenges around containerization of big data workloads are coming to light. Great strides have been made within the open source communities towards running big data workloads in containers, but much is left to be done.
Apache Hadoop YARN is the modern distributed operating system for big data applications. It has morphed the Hadoop compute layer into a common resource-management platform that can host a wide variety of applications. At its core, YARN has a very powerful scheduler which enforces global cluster level invariants and helps sites manage user and operator expectations of elastic sharing, resource usage limits, SLAs, and more. YARN recently increased its support for Docker containerization and added a YARN service framework supporting long-running services.
In this session we will explore the emerging patterns and challenges related to containers and big data workloads, including running applications such as Apache Spark, Apache HBase, and Kubernetes in containers on YARN.
Speakers
BILLIE RINALDI
Principal Software Engineer I
Hortonworks
SHANE KUMPF
Software Engineer
Hortonworks
Видео Containers and Big Data канала DataWorks Summit
Apache Hadoop YARN is the modern distributed operating system for big data applications. It has morphed the Hadoop compute layer into a common resource-management platform that can host a wide variety of applications. At its core, YARN has a very powerful scheduler which enforces global cluster level invariants and helps sites manage user and operator expectations of elastic sharing, resource usage limits, SLAs, and more. YARN recently increased its support for Docker containerization and added a YARN service framework supporting long-running services.
In this session we will explore the emerging patterns and challenges related to containers and big data workloads, including running applications such as Apache Spark, Apache HBase, and Kubernetes in containers on YARN.
Speakers
BILLIE RINALDI
Principal Software Engineer I
Hortonworks
SHANE KUMPF
Software Engineer
Hortonworks
Видео Containers and Big Data канала DataWorks Summit
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Network Reference Architecture for Hadoop- Validated and Tested ApproachDataWorks Summit 2018 San Jose Day 2 KeynoteDay two keynotesHarnessing the Power of Big Data at Freddie MacLeveraging Hadoop to defend against improvised threatsBI on Big Data with instant response times at VerizonBig data processing meets non-volatile memory: opportunities and challengesStream Scaling in PravegaIntegrating Apache Phoenix with Distributed Query EnginesHelp Hadoop survive the 300 million block barrier and then back it up0605 Hadoop REST API Security with the Apache Knox GatewayPractice of large Hadoop cluster in China MobileHow to Ingest 16 Billion Records Per Day into your Hadoop EnvironmentCounting Unique Users in Real-Time: Here's a Challenge for You!Druid and Hive Together : Use Cases and Best PracticesBuilding intelligent applications, experimental ML with Uber’s Data Science WorkbenchObserving Intraday Indicators Using Real-Time Tick Data on Apache Superset and DruidHBase Global Indexing to support large-scale data ingestion at UberAchieving a 360-degree view of manufacturing via open source industrial data managementCan you re platform your Teradata, Oracle, Netezza and SQL Server analytic workloads to Hadoop? The