Keynote: How Spotify Accidentally Deleted All its Kube Clusters with No User Impact - David Xia
Join us for Kubernetes Forums Seoul, Sydney, Bengaluru and Delhi - learn more at kubecon.io
Don't miss KubeCon + CloudNativeCon 2020 events in Amsterdam March 30 - April 2, Shanghai July 28-30 and Boston November 17-20! Learn more at kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects
Keynote: How Spotify Accidentally Deleted All its Kube Clusters with No User Impact - David Xia, Infrastructure Engineer, Spotify
During Spotify's Kubernetes migration, David's team deleted most of their production Kubernetes clusters. Accidentally. Twice. With little to no user impact. David shares how they recovered and learned to operate many clusters automatically and safely.
In 2017, Spotify planned the migration of hundreds of teams, thousands of services, and tens of thousands of hosts to Google Kubernetes Engine (GKE). In the last half of 2018, Spotify migrated 50 teams and hundreds of services, including critical ones, onto multiple production clusters.
David describes what led to the cluster deletions and how they barely affected users. Since the postmortem, Spotify has minimized downtime and human error by declaratively defining clusters in code with Terraform, backing up and restoring clusters with Ark, and increasing scalability and availability by running many more clusters.
https://sched.co/MQbb
Видео Keynote: How Spotify Accidentally Deleted All its Kube Clusters with No User Impact - David Xia канала CNCF [Cloud Native Computing Foundation]
Don't miss KubeCon + CloudNativeCon 2020 events in Amsterdam March 30 - April 2, Shanghai July 28-30 and Boston November 17-20! Learn more at kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects
Keynote: How Spotify Accidentally Deleted All its Kube Clusters with No User Impact - David Xia, Infrastructure Engineer, Spotify
During Spotify's Kubernetes migration, David's team deleted most of their production Kubernetes clusters. Accidentally. Twice. With little to no user impact. David shares how they recovered and learned to operate many clusters automatically and safely.
In 2017, Spotify planned the migration of hundreds of teams, thousands of services, and tens of thousands of hosts to Google Kubernetes Engine (GKE). In the last half of 2018, Spotify migrated 50 teams and hundreds of services, including critical ones, onto multiple production clusters.
David describes what led to the cluster deletions and how they barely affected users. Since the postmortem, Spotify has minimized downtime and human error by declaratively defining clusters in code with Terraform, backing up and restoring clusters with Ark, and increasing scalability and availability by running many more clusters.
https://sched.co/MQbb
Видео Keynote: How Spotify Accidentally Deleted All its Kube Clusters with No User Impact - David Xia канала CNCF [Cloud Native Computing Foundation]
Показать
Комментарии отсутствуют
Информация о видео
22 мая 2019 г. 23:18:20
00:20:23
Другие видео канала
10 Ways to Shoot Yourself in the Foot with Kubernetes, #9 Will Surprise You - Laurent BernailleKeynote: Kubernetes and the Path to Serverless - Kelsey Hightower, Staff Developer Advocate, GoogleOrganizing Kubernetes with NamespacesThe Story of Why We Migrate to gRPC and How We Go About It - Matthias Grüter, SpotifyRook: Cloud-Native Storage Orchestr... Jared Watts, Bassam Tabbara, Travis Nielsen & Alexander TrostFidelity’s Move to “Finance Grade” Kubernetes with... Alexis Richardson & Rajarajan Pudupatti SJSpotify Engineering Culture - Part 1Keynote: Serverless, Not So FaaS - Kelsey Hightower, Kubernetes Community Member, GoogleKeynote: NATS: Past, Present and the Future - Derek Collison, Founder and CEO, SynadiaKelsey Hightower's Best Live Demo Yet (Cloud Next '18)Kubernetes Design Principles: Understand the Why - Saad Ali, GoogleKubernetes Operator simply explained in 10 minsHow Spotify Migrated Ingress HTTP Systems to Envoy - Erica Manno & Vladimir Shakhov, SpotifyKubernetes Failure Stories and How to Crash Your Clusters - Henning Jacobs, Zalando SEKeynote: E2E 5G Cloud Native Network - Heather Kirksey, Azhar Sayeed & Fu QiaoTutorial: Zero to Operator in 90 Minutes! - Solly Ross, GoogleKubernetes Security Best Practices - Ian Lewis, GoogleKeynote: Reflections - Kelsey Hightower, Staff Developer Advocate, GoogleStripe Sessions 2019 | KeynoteKnox Anderson, Amit Gupta, & Loris Degioanni | KubeCon + CloudNativeCon NA 2019