SREcon20 Americas - Observing from Incidents
Observing from Incidents
Cory Watson
Despite thousands of squawking alerts and a morass of dashboards our complex systems remain firmly mysterious. Incidents continue to pop up in places that, frankly, they should not. In this talk, we'll leverage techniques from dozens of companies to learn successes and failures, how to spread that hard-earned knowledge via observability and visualizations, and how to productize the process internally to drive down incident impact, improve customer experience, and reduce stress.
View the full SREcon20 Americas program at https://www.usenix.org/conference/srecon20americas/program
Видео SREcon20 Americas - Observing from Incidents канала USENIX
Cory Watson
Despite thousands of squawking alerts and a morass of dashboards our complex systems remain firmly mysterious. Incidents continue to pop up in places that, frankly, they should not. In this talk, we'll leverage techniques from dozens of companies to learn successes and failures, how to spread that hard-earned knowledge via observability and visualizations, and how to productize the process internally to drive down incident impact, improve customer experience, and reduce stress.
View the full SREcon20 Americas program at https://www.usenix.org/conference/srecon20americas/program
Видео SREcon20 Americas - Observing from Incidents канала USENIX
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![SREcon20 Americas - Give Your PXE wings! Bootstrapping Explained](https://i.ytimg.com/vi/wEPORWOjLjg/default.jpg)
![The DevOps Transformation](https://i.ytimg.com/vi/3KpPBnEtRj4/default.jpg)
![SREcon20 Americas - Cloudy with a Chance of Chaos](https://i.ytimg.com/vi/p_d4lrDQcHU/default.jpg)
![SREcon14 - Keys to SRE](https://i.ytimg.com/vi/n4Wf14e2jxQ/default.jpg)
![Analyzing and modeling complex and big data | Professor Maria Fasli | TEDxUniversityofEssex](https://i.ytimg.com/vi/8DqQCZMawNg/default.jpg)
![LISA14 - Open Compute Project and the Changing Data Center](https://i.ytimg.com/vi/KAFDI8j_h00/default.jpg)
![LISA11 - Fork Yeah! The Rise and Development of illumos](https://i.ytimg.com/vi/-zRN7XLCRhc/default.jpg)
![SREcon20 Americas - It's a Trap! How Abstractions Have Failed Us.](https://i.ytimg.com/vi/bkZxgQ4BjFg/default.jpg)
![A Security Analysis of the APCO Project 25 Two-Way Radio System](https://i.ytimg.com/vi/NW-jRRTPCuw/default.jpg)
![USENIX Security ’17 - Understanding the Mirai Botnet](https://i.ytimg.com/vi/1pywzRTJDaY/default.jpg)
![SREcon20 Americas - Squish Level Objectives: How SRE can Help Align Technical Work to User Benefit](https://i.ytimg.com/vi/WWXfYvZe1HE/default.jpg)
![SREcon20 Americas - Off the Beaten Path: Moving Observability Focus from Your Service](https://i.ytimg.com/vi/HfQKrFNRGWE/default.jpg)
![SREcon16 - Performance Checklists for SREs](https://i.ytimg.com/vi/zxCWXNigDpA/default.jpg)
![SREcon16 - What's NetDevOps? How Do I Start?](https://i.ytimg.com/vi/57xZwNpMeQ0/default.jpg)
![LISA17 - Scalability Is Quantifiable: The Universal Scalability Law](https://i.ytimg.com/vi/lZU6RK0oazM/default.jpg)
![SREcon16 - Putting Together Great SRE Teams](https://i.ytimg.com/vi/g6O8ZNZwA6w/default.jpg)
![SREcon20 Americas - Testing Encyclopedias in Production](https://i.ytimg.com/vi/X1QVuENFhwc/default.jpg)
![SREcon20 Americas - The Secret Lives of SREs - Controlling the Costs of Coordination across Remote](https://i.ytimg.com/vi/2C2F5USR6N4/default.jpg)
![SREcon20 Americas - Building Service Ownership Using Documentation, Telemetry, and a Chance to Make](https://i.ytimg.com/vi/N61CssNfz58/default.jpg)
![SREcon20 Americas - When /bin/sh Attacks: Revisiting ""Automate All the Things""](https://i.ytimg.com/vi/QN6-5b4FuMM/default.jpg)