Загрузка страницы

Improving GPU Utilization using Kubernetes - Maulin Patel & Pradeep Venkatachalam, Google

Don’t miss out! Join us at our upcoming hybrid event: KubeCon + CloudNativeCon North America 2022 from October 24-28 in Detroit (and online!). Learn more at https://kubecon.io. The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.

Improving GPU Utilization using Kubernetes - Maulin Patel & Pradeep Venkatachalam, Google

Kubernetes supports efficient utilization of resources by enabling applications to request the precise amounts of resources it needs. Unlike fractional requests for CPUs, fractional requests for GPUs are not allowed in Kubernetes. GPU resources requested in the pod manifest must be an integer number. This means one GPU is fully allocated to one container even if the container only needs a fraction of GPU for its workload. Without the support for fractional GPUs, GPU resources are invariably over provisioned leading to a wastage. This is especially true for inference workloads that process a handful of data samples in real-time. To address this limitation, we have developed user-friendly solutions that allow a single GPU to be shared by multiple containers thereby improving utilization of GPUs and saving cost. In this talk, we will show the demos of our solutions and share performance results.

Видео Improving GPU Utilization using Kubernetes - Maulin Patel & Pradeep Venkatachalam, Google канала CNCF [Cloud Native Computing Foundation]
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
2 июня 2022 г. 5:40:58
00:37:53
Другие видео канала
TiKV: A Cloud Native Key-Value Database - Dongxu Huang & Nick Cameron, PingCAPTiKV: A Cloud Native Key-Value Database - Dongxu Huang & Nick Cameron, PingCAPKubernetes 10 Year Video - full length versionKubernetes 10 Year Video - full length versionOpenTelemetry or eBPF? That is the Question - Omid Azizi, New Relic (Pixie)OpenTelemetry or eBPF? That is the Question - Omid Azizi, New Relic (Pixie)Kubernetes Networking at Scale - Laurent Bernaille, Datadog & Bowei Du, GoogleKubernetes Networking at Scale - Laurent Bernaille, Datadog & Bowei Du, GoogleCluster API Deep Dive - Katie Gamanji, American Express & Carlos Panato, MattermostCluster API Deep Dive - Katie Gamanji, American Express & Carlos Panato, MattermostHow to Be 10x SRE? A Deep Dive to Prometheus Operator - Jayapriya Pai & Haoyu Sun, Red HatHow to Be 10x SRE? A Deep Dive to Prometheus Operator - Jayapriya Pai & Haoyu Sun, Red HatBuildKit CLI for kubectl: A New Way to Build Container Images - Daniel Hiltgen & Patrick DevineBuildKit CLI for kubectl: A New Way to Build Container Images - Daniel Hiltgen & Patrick DevineEdge Computing using K3s on Raspberry Pi - Jeff Spahr, LenovoEdge Computing using K3s on Raspberry Pi - Jeff Spahr, LenovoTikTok’s Story: How To Manage a Thousand Applications on Edge With Argo CD - Qingkun Li & Jesse SuenTikTok’s Story: How To Manage a Thousand Applications on Edge With Argo CD - Qingkun Li & Jesse SuenDeploying VNFs with Kubernetes pods and VMsDeploying VNFs with Kubernetes pods and VMsRunning distributed load tests with the Grafana k6-operatorRunning distributed load tests with the Grafana k6-operatorVolcano – Cloud Native Batch System for AI, BigData and HPC - William (LeiBo) WangVolcano – Cloud Native Batch System for AI, BigData and HPC - William (LeiBo) WangMeshery - The Service Mesh ManagerMeshery - The Service Mesh ManagerTutorial: Building an Enterprise Infrastructure Control Plane on Kubernetes - Daniel MangumTutorial: Building an Enterprise Infrastructure Control Plane on Kubernetes - Daniel MangumKeynote: Cloud Native Superpowers with eBPF by Liz RiceKeynote: Cloud Native Superpowers with eBPF by Liz RiceKubernetes Configuration - Auditing for Enterprise Best Practices Through Open Source ToolingKubernetes Configuration - Auditing for Enterprise Best Practices Through Open Source ToolingOpen Policy Agent (OPA) Intro & Deep Dive - Anders Eknert, Styra & Will Beason, GoogleOpen Policy Agent (OPA) Intro & Deep Dive - Anders Eknert, Styra & Will Beason, GoogleSeeing is Believing: Debugging with Ephemeral Containers - Aaron Alpar, KastenSeeing is Believing: Debugging with Ephemeral Containers - Aaron Alpar, KastenServing Machine Learning Models at Scale Using KServe - Yuzhui Liu, BloombergServing Machine Learning Models at Scale Using KServe - Yuzhui Liu, BloombergCloud Native Apps with Server-Side WebAssembly - Liam Randall, CosmonicCloud Native Apps with Server-Side WebAssembly - Liam Randall, Cosmonic
Яндекс.Метрика