Systems @Scale 2019 - Apache Hive
Apache Hive: From MapReduce to enterprise-grade big data warehousing
https://code.fb.com/core-data/systems-scale/
Jesus Camacho Rodriguez, Principal Software Engineer, Cloudera
In this talk, Jesus describes the innovations on the journey from batch tool to full-fledged SQL enterprise data warehousing system. In particular, he shows how the community expanded the utility of the system by adding row-level transactional capabilities required for data modifications in star schema databases, introducing optimization techniques that are useful to handle today’s view hierarchies and big data operations, implementing the runtime improvements necessary to bring query latency and concurrency into the realm of interactive operation, and laying the groundwork for using Apache Hive as a relational front end to multiple storage and data systems. All these enhancements were introduced without ever compromising on the original characteristics that made the system popular.
Видео Systems @Scale 2019 - Apache Hive канала Justin Miller
https://code.fb.com/core-data/systems-scale/
Jesus Camacho Rodriguez, Principal Software Engineer, Cloudera
In this talk, Jesus describes the innovations on the journey from batch tool to full-fledged SQL enterprise data warehousing system. In particular, he shows how the community expanded the utility of the system by adding row-level transactional capabilities required for data modifications in star schema databases, introducing optimization techniques that are useful to handle today’s view hierarchies and big data operations, implementing the runtime improvements necessary to bring query latency and concurrency into the realm of interactive operation, and laying the groundwork for using Apache Hive as a relational front end to multiple storage and data systems. All these enhancements were introduced without ever compromising on the original characteristics that made the system popular.
Видео Systems @Scale 2019 - Apache Hive канала Justin Miller
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Serving Machine Learning Models at Scale Using KServe - Animesh Singh, IBM - KubeCon North AmericaAvoiding Common Mistakes In Your Ansible Playbooks - AnsibleFest 2021Spring Kafka Beyond the Basics: Lessons Learned - Kafka Summit 2020Help, My Kafka is Broken! - Kafka Summit 2020OpenDataHub Meet Up - Meeting - Monday July 12th, 2021Red Hat Insights AMA: Compare Systems with DriftData and PicardSystems @Scale 2019 - Welcome KeynoteSystems @Scale 2019 - Delos Storage for the Facebook Control PlaneSystems @Scale 2019 - Observability Infra Uber and FacebookSystems @Scale 2019 - Enabling next generation models for PYMK ScaleSystems @Scale 2019 - Continuous Deployment at Facebook ScaleSystems @Scale 2019 - Observability Infra at AffirmSystems @Scale 2019 - Continuous Deployment at Facebook ScaleSystems @Scale 2019 - Scaling Cluster Management at Facebook with TupperwareSystems @Scale 2019 - Accordion Better Memory Organization for LSM Key Value StoresSystems @Scale 2019 - Preemption in Nomad A Greedy Algorithm That ScalesSystems @Scale 2019 - Disaster Recovery at Facebook ScaleHow to use Impala's query plan and profile to fix Performance - Part 2How to use Impala's query plan and profile to fix Performance - Part 4