MuZero - Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model | RL Paper explained
❤️ Become The AI Epiphany Patreon ❤️ ► https://www.patreon.com/theaiepiphany
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
MuZero - the latest agent in the lineage of AlphaGo agents. Zero human knowledge, zero rules, and cracks not only Go, Chess and Shogi but additionally it achieves SOTA on the Atari benchmark.
You'll learn about:
✔️ How can MuZero learn to play without the rules
✔️ How does it learn the dynamics/model
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
✅ Paper: https://arxiv.org/abs/1911.08265
✅ Blog: https://deepmind.com/blog/article/muzero-mastering-go-chess-shogi-and-atari-without-rules
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
⌚️ Timetable:
00:00 Overview of the AlphaGo lineage
03:00 MuZero actors explained
11:10 How can MuZero work without any rules?
14:50 MuZero learner explained
21:15 Results
25:15 Update to the search algorithm
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💰 BECOME A PATREON OF THE AI EPIPHANY ❤️
If these videos, GitHub projects, and blogs help you,
consider helping me out by supporting me on Patreon!
The AI Epiphany ► https://www.patreon.com/theaiepiphany
One-time donation:
https://www.paypal.com/paypalme/theaiepiphany
Much love! ❤️
Huge thank you to these AI Epiphany patreons:
Petar Veličković
Zvonimir Sabljic
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💡 The AI Epiphany is a channel dedicated to simplifying the field of AI using creative visualizations and in general, a stronger focus on geometrical and visual intuition, rather than the algebraic and numerical "intuition".
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
👋 CONNECT WITH ME ON SOCIAL
LinkedIn ► https://www.linkedin.com/in/aleksagordic/
Twitter ► https://twitter.com/gordic_aleksa
Instagram ► https://www.instagram.com/aiepiphany/
Facebook ► https://www.facebook.com/aiepiphany/
👨👩👧👦 JOIN OUR DISCORD COMMUNITY:
Discord ► https://discord.gg/peBrCpheKE
📢 SUBSCRIBE TO MY MONTHLY AI NEWSLETTER:
Substack ► https://aiepiphany.substack.com/
💻 FOLLOW ME ON GITHUB FOR COOL PROJECTS:
GitHub ► https://github.com/gordicaleksa
📚 FOLLOW ME ON MEDIUM:
Medium ► https://gordicaleksa.medium.com/
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#muzero #atari #reinforcementlearning
Видео MuZero - Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model | RL Paper explained канала Aleksa Gordić - The AI Epiphany
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
MuZero - the latest agent in the lineage of AlphaGo agents. Zero human knowledge, zero rules, and cracks not only Go, Chess and Shogi but additionally it achieves SOTA on the Atari benchmark.
You'll learn about:
✔️ How can MuZero learn to play without the rules
✔️ How does it learn the dynamics/model
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
✅ Paper: https://arxiv.org/abs/1911.08265
✅ Blog: https://deepmind.com/blog/article/muzero-mastering-go-chess-shogi-and-atari-without-rules
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
⌚️ Timetable:
00:00 Overview of the AlphaGo lineage
03:00 MuZero actors explained
11:10 How can MuZero work without any rules?
14:50 MuZero learner explained
21:15 Results
25:15 Update to the search algorithm
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💰 BECOME A PATREON OF THE AI EPIPHANY ❤️
If these videos, GitHub projects, and blogs help you,
consider helping me out by supporting me on Patreon!
The AI Epiphany ► https://www.patreon.com/theaiepiphany
One-time donation:
https://www.paypal.com/paypalme/theaiepiphany
Much love! ❤️
Huge thank you to these AI Epiphany patreons:
Petar Veličković
Zvonimir Sabljic
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💡 The AI Epiphany is a channel dedicated to simplifying the field of AI using creative visualizations and in general, a stronger focus on geometrical and visual intuition, rather than the algebraic and numerical "intuition".
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
👋 CONNECT WITH ME ON SOCIAL
LinkedIn ► https://www.linkedin.com/in/aleksagordic/
Twitter ► https://twitter.com/gordic_aleksa
Instagram ► https://www.instagram.com/aiepiphany/
Facebook ► https://www.facebook.com/aiepiphany/
👨👩👧👦 JOIN OUR DISCORD COMMUNITY:
Discord ► https://discord.gg/peBrCpheKE
📢 SUBSCRIBE TO MY MONTHLY AI NEWSLETTER:
Substack ► https://aiepiphany.substack.com/
💻 FOLLOW ME ON GITHUB FOR COOL PROJECTS:
GitHub ► https://github.com/gordicaleksa
📚 FOLLOW ME ON MEDIUM:
Medium ► https://gordicaleksa.medium.com/
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#muzero #atari #reinforcementlearning
Видео MuZero - Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model | RL Paper explained канала Aleksa Gordić - The AI Epiphany
Показать
Комментарии отсутствуют
Информация о видео
28 марта 2021 г. 20:54:51
00:28:05
Другие видео канала
Feed-forward method | Neural Style Transfer #510k subscribers | joining Google DeepMind, updates, AMADay 25: Open NLLB - filtering HBS (Pt 2)Fastformer: Additive Attention Can Be All You Need | Paper ExplainedDay 24: Open NLLB - back from China, analyzing spikes, preparing HBS run (Pt 2)Day 6: Meta NLLB - data filtering (Pt. 4)Day 24: Open NLLB - back from China, filtering HBS data (Pt 3)Day 14: Open NLLB - Eval of our first run (English, Turkish, Hindi) (Pt 2.)The Vesuvius challenge breakthrough with Luke FarritorDay 6: Meta NLLB - data filtering (Pt. 3)DeepMind's Android RL Environment - AndroidEnvDay 20: Open NLLB - analyzing HBS data (Bosnian, Croatian) (Pt 2)Day 11: Open NLLB - FSDP, on-boarding as a new-joiner (Pt 3.)Hamel Husain - Building LLM Apps in ProductionDay 28: Open NLLB - debugging fuzzy dedup, training fasttext LID (Pt 3)Day 10: Open NLLB - evaluation data, filtering (Pt 3.)Day 7: Meta NLLB - analyzing the training script, Jais paper (Pt. 4)EleutherAI Pythia w/ Hailey SchoelkopfDay 18: Open NLLB - data loading document, GitHub tasks (Pt 1 cont.)DeepMind's TacticAI: an AI assistant for football tactics | Petar VeličkovićJarvis for Images! (demo) - run locally, no external APIs