Загрузка страницы

New course with Google Cloud: Reinforcement Learning from Human Feedback (RLHF)

Enroll now: https://bit.ly/48aqPrK

Large language models (LLMs) are trained on human-generated text, but additional methods are needed to align an LLM with human values and preferences. Reinforcement Learning from Human Feedback (RLHF) is currently the main method for aligning LLMs to make them more helpful, honest, and safe.

In this course, you will gain a conceptual understanding of the RLHF training process, and then practice applying RLHF to tune an LLM. You will:

- Explore the two datasets (“preference” and “prompt”) that are used in RLHF training.
- Use the open source Google Cloud Pipeline Components Library to fine-tune the Llama 2 model with RLHF.
- Assess the tuned LLM against the original base model by comparing loss curves and using the “Side-by-Side (SxS)” method.

Join instructor Nikita Namjoshi, Developer Advocate for Generative AI at Google Cloud, for this learning adventure. Prepare to include Reinforcement Learning from Human Feedback in your skillset.

Learn more: https://bit.ly/48aqPrK

Видео New course with Google Cloud: Reinforcement Learning from Human Feedback (RLHF) канала DeepLearningAI
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
13 декабря 2023 г. 19:11:37
00:03:27
Другие видео канала
New course with Hugging Face: Quantization in Depth 🤗New course with Hugging Face: Quantization in Depth 🤗DeepLearning.AI NLP Learner Community Event ft. Mo RebaieDeepLearning.AI NLP Learner Community Event ft. Mo Rebaie#2 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 1, Lesson 2]#2 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 1, Lesson 2]#9 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 1]#9 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 1]#26 AI for Good Specialization [Course 1, Week 2, Lesson 2]#26 AI for Good Specialization [Course 1, Week 2, Lesson 2]#29 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 5]#29 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 5]#28 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#28 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#30 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#30 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#27 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 3]#27 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 3]Addressing Data Mismatch (C3W2L06)Addressing Data Mismatch (C3W2L06)Augmenting Data (TensorFlow in Practice)Augmenting Data (TensorFlow in Practice)#BeADeepLearner like Matt Struble with DeepLearning.AI#BeADeepLearner like Matt Struble with DeepLearning.AI#5 AI for Good Specialization [Course 1, Week 1, Lesson 2]#5 AI for Good Specialization [Course 1, Week 1, Lesson 2]#26 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#26 Machine Learning Specialization [Course 1, Week 2, Lesson 2]#18 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 10]#18 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 10]#12 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 4]#12 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 4]#25 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 1]#25 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 1]deeplearning.ai Learner Community Event ft. Roger Smithdeeplearning.ai Learner Community Event ft. Roger Smith#22 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 14]#22 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 2, Lesson 14]#28 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 4]#28 Machine Learning Engineering for Production (MLOps) Specialization [Course 1, Week 3, Lesson 4]#BeADeepLearner like Yudhiesh Ravindran with DeepLearning.AI#BeADeepLearner like Yudhiesh Ravindran with DeepLearning.AI
Яндекс.Метрика