Systems @Scale 2019 - Observability Infra Uber and Facebook
Yuri Shkuro, Software Engineer, Uber
Michael Bevilacqua-Linn, Software Engineer, Facebook
https://code.fb.com/core-data/systems-scale/
Distributed tracing systems are a tried-and-true tool for understanding systems at scale, ranging back over a decade to early research systems like X-Trace and Magpie, and popularized in industry with Google’s Dapper. Both Uber and Facebook operate large-scale distributed tracing systems, but each has a different focus. Uber’s Jaeger is used primarily as an observability tool, which gives engineers insight into failures in their microservices architecture, while Facebook has largely used its tracing system, Canopy, to get a detailed view of its web and mobile apps, including the creation of aggregate data sets with a built in trace-processing system. In this talk, Yuri and Michael walk through Canopy’s built-in trace processing, as well as Uber’s use of traces for more automated root cause analyses of distributed failures.
Видео Systems @Scale 2019 - Observability Infra Uber and Facebook канала Justin Miller
Michael Bevilacqua-Linn, Software Engineer, Facebook
https://code.fb.com/core-data/systems-scale/
Distributed tracing systems are a tried-and-true tool for understanding systems at scale, ranging back over a decade to early research systems like X-Trace and Magpie, and popularized in industry with Google’s Dapper. Both Uber and Facebook operate large-scale distributed tracing systems, but each has a different focus. Uber’s Jaeger is used primarily as an observability tool, which gives engineers insight into failures in their microservices architecture, while Facebook has largely used its tracing system, Canopy, to get a detailed view of its web and mobile apps, including the creation of aggregate data sets with a built in trace-processing system. In this talk, Yuri and Michael walk through Canopy’s built-in trace processing, as well as Uber’s use of traces for more automated root cause analyses of distributed failures.
Видео Systems @Scale 2019 - Observability Infra Uber and Facebook канала Justin Miller
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Serving Machine Learning Models at Scale Using KServe - Animesh Singh, IBM - KubeCon North AmericaAvoiding Common Mistakes In Your Ansible Playbooks - AnsibleFest 2021Spring Kafka Beyond the Basics: Lessons Learned - Kafka Summit 2020Help, My Kafka is Broken! - Kafka Summit 2020OpenDataHub Meet Up - Meeting - Monday July 12th, 2021Red Hat Insights AMA: Compare Systems with DriftData and PicardSystems @Scale 2019 - Welcome KeynoteSystems @Scale 2019 - Delos Storage for the Facebook Control PlaneSystems @Scale 2019 - Apache HiveSystems @Scale 2019 - Enabling next generation models for PYMK ScaleSystems @Scale 2019 - Continuous Deployment at Facebook ScaleSystems @Scale 2019 - Observability Infra at AffirmSystems @Scale 2019 - Continuous Deployment at Facebook ScaleSystems @Scale 2019 - Scaling Cluster Management at Facebook with TupperwareSystems @Scale 2019 - Accordion Better Memory Organization for LSM Key Value StoresSystems @Scale 2019 - Preemption in Nomad A Greedy Algorithm That ScalesSystems @Scale 2019 - Disaster Recovery at Facebook ScaleHow to use Impala's query plan and profile to fix Performance - Part 2How to use Impala's query plan and profile to fix Performance - Part 4