Загрузка страницы

Serving Machine Learning Models at Scale Using KServe - Yuzhui Liu, Bloomberg

Don’t miss out! Join us at our next event: KubeCon + CloudNativeCon Europe 2022 in Valencia, Spain from May 17-20. Learn more at https://kubecon.io The conference features presentations from developers and end users of Kubernetes, Prometheus, Envoy, and all of the other CNCF-hosted projects.

Serving Machine Learning Models at Scale Using KServe - Yuzhui Liu, Bloomberg

KServe (previously known as KFServing) is a serverless open source solution to serve machine learning models. With machine learning becoming more widely adopted in organizations, the trend is to deploy larger numbers of models. Plus, there is an increasing need to serve models using GPUs. As GPUs are expensive, engineers are seeking ways to serve multiple models with one GPU. The KServe community designed a Multi-Model Serving solution to scale the number of models that can be served in a Kubernetes cluster. By sharing the serving container that is enabled to host multiple models, Multi-Model Serving addresses three limitations that the current ‘one model, one service’ paradigm encounters: 1) Compute resources (including the cost for public cloud), 2) Maximum number of pods, 3) Maximum number of IP addresses. 4) Maximum number of services This talk will present the design of Multi-Model Serving, describe how to use it to serve models for different frameworks, and share benchmark stats that demonstrate its scalability.

Видео Serving Machine Learning Models at Scale Using KServe - Yuzhui Liu, Bloomberg канала CNCF [Cloud Native Computing Foundation]
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
30 октября 2021 г. 9:49:06
00:31:13
Другие видео канала
TiKV: A Cloud Native Key-Value Database - Dongxu Huang & Nick Cameron, PingCAPTiKV: A Cloud Native Key-Value Database - Dongxu Huang & Nick Cameron, PingCAPOpenTelemetry or eBPF? That is the Question - Omid Azizi, New Relic (Pixie)OpenTelemetry or eBPF? That is the Question - Omid Azizi, New Relic (Pixie)Kubernetes Networking at Scale - Laurent Bernaille, Datadog & Bowei Du, GoogleKubernetes Networking at Scale - Laurent Bernaille, Datadog & Bowei Du, GoogleCluster API Deep Dive - Katie Gamanji, American Express & Carlos Panato, MattermostCluster API Deep Dive - Katie Gamanji, American Express & Carlos Panato, MattermostHow to Be 10x SRE? A Deep Dive to Prometheus Operator - Jayapriya Pai & Haoyu Sun, Red HatHow to Be 10x SRE? A Deep Dive to Prometheus Operator - Jayapriya Pai & Haoyu Sun, Red HatBuildKit CLI for kubectl: A New Way to Build Container Images - Daniel Hiltgen & Patrick DevineBuildKit CLI for kubectl: A New Way to Build Container Images - Daniel Hiltgen & Patrick DevineEdge Computing using K3s on Raspberry Pi - Jeff Spahr, LenovoEdge Computing using K3s on Raspberry Pi - Jeff Spahr, LenovoTikTok’s Story: How To Manage a Thousand Applications on Edge With Argo CD - Qingkun Li & Jesse SuenTikTok’s Story: How To Manage a Thousand Applications on Edge With Argo CD - Qingkun Li & Jesse SuenDeploying VNFs with Kubernetes pods and VMsDeploying VNFs with Kubernetes pods and VMsRunning distributed load tests with the Grafana k6-operatorRunning distributed load tests with the Grafana k6-operatorVolcano – Cloud Native Batch System for AI, BigData and HPC - William (LeiBo) WangVolcano – Cloud Native Batch System for AI, BigData and HPC - William (LeiBo) WangMeshery - The Service Mesh ManagerMeshery - The Service Mesh ManagerTutorial: Building an Enterprise Infrastructure Control Plane on Kubernetes - Daniel MangumTutorial: Building an Enterprise Infrastructure Control Plane on Kubernetes - Daniel MangumKeynote: Cloud Native Superpowers with eBPF by Liz RiceKeynote: Cloud Native Superpowers with eBPF by Liz RiceKubernetes Configuration - Auditing for Enterprise Best Practices Through Open Source ToolingKubernetes Configuration - Auditing for Enterprise Best Practices Through Open Source ToolingOpen Policy Agent (OPA) Intro & Deep Dive - Anders Eknert, Styra & Will Beason, GoogleOpen Policy Agent (OPA) Intro & Deep Dive - Anders Eknert, Styra & Will Beason, GoogleSeeing is Believing: Debugging with Ephemeral Containers - Aaron Alpar, KastenSeeing is Believing: Debugging with Ephemeral Containers - Aaron Alpar, KastenImproving GPU Utilization using Kubernetes - Maulin Patel & Pradeep Venkatachalam, GoogleImproving GPU Utilization using Kubernetes - Maulin Patel & Pradeep Venkatachalam, GoogleIAM Confused - getting work done and keeping SecOps happyIAM Confused - getting work done and keeping SecOps happyCloud Native Apps with Server-Side WebAssembly - Liam Randall, CosmonicCloud Native Apps with Server-Side WebAssembly - Liam Randall, Cosmonic
Яндекс.Метрика