Все видео Новые видео Популярные видео Категории видео

Авто	Видео-блоги	ДТП, аварии	Для маленьких	Еда, напитки
Животные	Закон и право	Знаменитости	Игры	Искусство
Комедии	Красота, мода	Кулинария, рецепты	Люди	Мото
Музыка	Мультфильмы	Наука, технологии	Новости	Образование
Политика	Праздники	Приколы	Природа	Происшествия
Путешествия	Развлечения	Ржач	Семья	Сериалы
Спорт	Стиль жизни	ТВ передачи	Танцы	Технологии
Товары	Ужасы	Фильмы	Шоу-бизнес	Юмор

Moral Self-Correction in Large Language Models | paper explained

We explain why large language models (LLM) suffer from multiple personality disorder and how they can morally self-correct with instructions. We elucidate technical terms such as RLHF (reinforcement learning from human feedback), explain "instruction following" and "Chain of Thought" prompting (CoT). "Moral Self-Correction in Large Language Models", paper explained.
► Sponsor: Salad 👉 https://bit.ly/SaladCloud-Letitia

Check out our #MachineLearning Quiz Questions: https://www.youtube.com/c/AICoffeeBreak/community

📜 Ganguli, Deep, Amanda Askell, Nicholas Schiefer, Thomas Liao, Kamilė Lukošiūtė, Anna Chen, Anna Goldie et al. "The capacity for moral self-correction in large language models." arXiv preprint arXiv:2302.07459 (2023). https://arxiv.org/abs/2302.07459

Read more about RLHF: https://huggingface.co/blog/rlhf 🤗

Thanks to our Patrons who support us in Tier 2, 3, 4: 🙏
Dres. Trost GbR, Siltax, Edvard Grødem, Vignesh Valliappan, Mutual Information, Mike Ton,
Kshitij

Outline:
00:00 Large LMs are biased
01:48 Salad (Sponsor)
02:49 Question answering: Q
04:23 Language models have multiple personality disorder
05:27 RLHF explained
08:23 Instruction Following (IF) explained
09:37 CoT: Chain of Thought prompting explained
10:05 Effect of size on moral self-correction
13:07 Effect of RLHF on instruction following
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
🔥 Optionally, pay us a coffee to help with our Coffee Bean production! ☕
Patreon: https://www.patreon.com/AICoffeeBreak
Ko-fi: https://ko-fi.com/aicoffeebreak
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀

🔗 Links:
AICoffeeBreakQuiz: https://www.youtube.com/c/AICoffeeBreak/community
Twitter: https://twitter.com/AICoffeeBreak
Reddit: https://www.reddit.com/r/AICoffeeBreak/
YouTube: https://www.youtube.com/AICoffeeBreak

#AICoffeeBreak #MsCoffeeBean #MachineLearning #AI #research

Music 🎵 : Illusions - Anno Domini Beats
Video editing: Nils Trost

Видео Moral Self-Correction in Large Language Models | paper explained канала AI Coffee Break with Letitia

Показать

Комментарии отсутствуют