Загрузка страницы

Moral Self-Correction in Large Language Models | paper explained

We explain why large language models (LLM) suffer from multiple personality disorder and how they can morally self-correct with instructions. We elucidate technical terms such as RLHF (reinforcement learning from human feedback), explain "instruction following" and "Chain of Thought" prompting (CoT). "Moral Self-Correction in Large Language Models", paper explained.
► Sponsor: Salad 👉 https://bit.ly/SaladCloud-Letitia

Check out our #MachineLearning Quiz Questions: https://www.youtube.com/c/AICoffeeBreak/community

📜 Ganguli, Deep, Amanda Askell, Nicholas Schiefer, Thomas Liao, Kamilė Lukošiūtė, Anna Chen, Anna Goldie et al. "The capacity for moral self-correction in large language models." arXiv preprint arXiv:2302.07459 (2023). https://arxiv.org/abs/2302.07459

Read more about RLHF: https://huggingface.co/blog/rlhf 🤗

Thanks to our Patrons who support us in Tier 2, 3, 4: 🙏
Dres. Trost GbR, Siltax, Edvard Grødem, Vignesh Valliappan, Mutual Information, Mike Ton,
Kshitij

Outline:
00:00 Large LMs are biased
01:48 Salad (Sponsor)
02:49 Question answering: Q
04:23 Language models have multiple personality disorder
05:27 RLHF explained
08:23 Instruction Following (IF) explained
09:37 CoT: Chain of Thought prompting explained
10:05 Effect of size on moral self-correction
13:07 Effect of RLHF on instruction following
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
🔥 Optionally, pay us a coffee to help with our Coffee Bean production! ☕
Patreon: https://www.patreon.com/AICoffeeBreak
Ko-fi: https://ko-fi.com/aicoffeebreak
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀

🔗 Links:
AICoffeeBreakQuiz: https://www.youtube.com/c/AICoffeeBreak/community
Twitter: https://twitter.com/AICoffeeBreak
Reddit: https://www.reddit.com/r/AICoffeeBreak/
YouTube: https://www.youtube.com/AICoffeeBreak

#AICoffeeBreak #MsCoffeeBean #MachineLearning #AI #research​

Music 🎵 : Illusions - Anno Domini Beats
Video editing: Nils Trost

Видео Moral Self-Correction in Large Language Models | paper explained канала AI Coffee Break with Letitia
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
25 апреля 2023 г. 17:11:00
00:14:50
Яндекс.Метрика