Загрузка страницы

New course with Predibase: Efficiently Serving LLMs

Enroll now: https://bit.ly/3IA1WLs

This course will help you build a ground-up understanding of how to serve large language model applications.

Whether you’re ready to launch your own application or just getting started building it, you will deepen your foundational knowledge of how LLMs work and better understand the performance trade-offs you must consider when building LLM applications that will serve large numbers of users.

You’ll walk through the most important optimizations that allow LLM vendors to efficiently serve models to many customers, including strategies for working with multiple fine-tuned models at once. In this course, you will:

- Learn how auto-regressive LLMs generate text one token at a time.
Implement the foundational elements of a modern LLM inference stack in code, including KV caching, continuous batching, and model quantization, and benchmark their impacts on inference throughput and latency.
- Explore the details of how LoRA adapters work, and learn how batching techniques allow different LoRA adapters to be served to multiple customers simultaneously.
- Get hands-on with Predibase’s LoRAX framework inference server to see these optimization techniques implemented in a real world LLM inference server.
- Enhance your understanding of the options you have to increase the performance and efficiency of your LLM-powered applications.

Learn more: https://bit.ly/3IA1WLs

Видео New course with Predibase: Efficiently Serving LLMs канала DeepLearningAI
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
18 марта 2024 г. 19:53:02
00:02:58
Другие видео канала
New course with Hugging Face: Quantization in Depth 🤗New course with Hugging Face: Quantization in Depth 🤗DeepLearning.AI NLP Learner Community Event ft. Mo RebaieDeepLearning.AI NLP Learner Community Event ft. Mo Rebaie#2 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 1, Lesson 2]#2 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 1, Lesson 2]#9 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 1]#9 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 1]#26 AI for Good Specialization [Course 1, Week 2, Lesson 2]#26 AI for Good Specialization [Course 1, Week 2, Lesson 2]#29 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 5]#29 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 5]#28 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#28 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#30 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#30 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#27 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 3]#27 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 3]Addressing Data Mismatch (C3W2L06)Addressing Data Mismatch (C3W2L06)Augmenting Data (TensorFlow in Practice)Augmenting Data (TensorFlow in Practice)#BeADeepLearner like Matt Struble with DeepLearning.AI#BeADeepLearner like Matt Struble with DeepLearning.AI#5 AI for Good Specialization [Course 1, Week 1, Lesson 2]#5 AI for Good Specialization [Course 1, Week 1, Lesson 2]#26 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#26 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#18 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 10]#18 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 10]#12 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 4]#12 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 4]#25 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 1]#25 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 1]deeplearning.ai Learner Community Event ft. Roger Smithdeeplearning.ai Learner Community Event ft. Roger Smith#22 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 14]#22 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 14]#28 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 4]#28 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 4]#BeADeepLearner like Yudhiesh Ravindran with DeepLearning.AI#BeADeepLearner like Yudhiesh Ravindran with DeepLearning.AI
Яндекс.Метрика