Replay Memory Explained - Experience for Deep Q-Network Training
Welcome back to this series on reinforcement learning! In this video, we'll continue our discussion of deep Q-networks. Before we can move on to discussing exactly how a DQN is trained, we're first going to explain the concepts of experience replay and replay memory, which are utilized during the training process. So, let's get to it!
Sources:
Reinforcement Learning: An Introduction, Second Edition by Richard S. Sutton and Andrew G. Bartow
http://incompleteideas.net/book/RLbook2020.pdf
Playing Atari with Deep Reinforcement Learning by Deep Mind Technologies
https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf
Jürgen Schmidhuber interview: https://youtu.be/zK_x3Ba2l5Q
🕒🦎 VIDEO SECTIONS 🦎🕒
00:00 Welcome to DEEPLIZARD - Go to deeplizard.com for learning resources
00:30 Help deeplizard add video timestamps - See example in the description
05:51 Collective Intelligence and the DEEPLIZARD HIVEMIND
💥🦎 DEEPLIZARD COMMUNITY RESOURCES 🦎💥
👋 Hey, we're Chris and Mandy, the creators of deeplizard!
👀 CHECK OUT OUR VLOG:
🔗 https://youtube.com/deeplizardvlog
👉 Check out the blog post and other resources for this video:
🔗 https://deeplizard.com/learn/video/Bcuj2fTH4_4
💻 DOWNLOAD ACCESS TO CODE FILES
🤖 Available for members of the deeplizard hivemind:
🔗 https://deeplizard.com/resources
🧠 Support collective intelligence, join the deeplizard hivemind:
🔗 https://deeplizard.com/hivemind
🤜 Support collective intelligence, create a quiz question for this video:
🔗 https://deeplizard.com/create-quiz-question
🚀 Boost collective intelligence by sharing this video on social media!
❤️🦎 Special thanks to the following polymaths of the deeplizard hivemind:
Tammy
Prash
Zach Wimpee
👀 Follow deeplizard:
Our vlog: https://youtube.com/deeplizardvlog
Facebook: https://facebook.com/deeplizard
Instagram: https://instagram.com/deeplizard
Twitter: https://twitter.com/deeplizard
Patreon: https://patreon.com/deeplizard
YouTube: https://youtube.com/deeplizard
🎓 Deep Learning with deeplizard:
Fundamental Concepts - https://deeplizard.com/learn/video/gZmobeGL0Yg
Beginner Code - https://deeplizard.com/learn/video/RznKVRTFkBY
Intermediate Code - https://deeplizard.com/learn/video/v5cngxo4mIg
Advanced Deep RL - https://deeplizard.com/learn/video/nyjbcRQ-uQ8
🎓 Other Courses:
Data Science - https://deeplizard.com/learn/video/d11chG7Z-xk
Trading - https://deeplizard.com/learn/video/ZpfCK_uHL9Y
🛒 Check out products deeplizard recommends on Amazon:
🔗 https://amazon.com/shop/deeplizard
📕 Get a FREE 30-day Audible trial and 2 FREE audio books using deeplizard's link:
🔗 https://amzn.to/2yoqWRn
🎵 deeplizard uses music by Kevin MacLeod
🔗 https://youtube.com/channel/UCSZXFhRIx6b0dFX3xS8L1yQ
🔗 http://incompetech.com/
❤️ Please use the knowledge gained from deeplizard content for good, not evil.
Видео Replay Memory Explained - Experience for Deep Q-Network Training канала deeplizard
Sources:
Reinforcement Learning: An Introduction, Second Edition by Richard S. Sutton and Andrew G. Bartow
http://incompleteideas.net/book/RLbook2020.pdf
Playing Atari with Deep Reinforcement Learning by Deep Mind Technologies
https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf
Jürgen Schmidhuber interview: https://youtu.be/zK_x3Ba2l5Q
🕒🦎 VIDEO SECTIONS 🦎🕒
00:00 Welcome to DEEPLIZARD - Go to deeplizard.com for learning resources
00:30 Help deeplizard add video timestamps - See example in the description
05:51 Collective Intelligence and the DEEPLIZARD HIVEMIND
💥🦎 DEEPLIZARD COMMUNITY RESOURCES 🦎💥
👋 Hey, we're Chris and Mandy, the creators of deeplizard!
👀 CHECK OUT OUR VLOG:
🔗 https://youtube.com/deeplizardvlog
👉 Check out the blog post and other resources for this video:
🔗 https://deeplizard.com/learn/video/Bcuj2fTH4_4
💻 DOWNLOAD ACCESS TO CODE FILES
🤖 Available for members of the deeplizard hivemind:
🔗 https://deeplizard.com/resources
🧠 Support collective intelligence, join the deeplizard hivemind:
🔗 https://deeplizard.com/hivemind
🤜 Support collective intelligence, create a quiz question for this video:
🔗 https://deeplizard.com/create-quiz-question
🚀 Boost collective intelligence by sharing this video on social media!
❤️🦎 Special thanks to the following polymaths of the deeplizard hivemind:
Tammy
Prash
Zach Wimpee
👀 Follow deeplizard:
Our vlog: https://youtube.com/deeplizardvlog
Facebook: https://facebook.com/deeplizard
Instagram: https://instagram.com/deeplizard
Twitter: https://twitter.com/deeplizard
Patreon: https://patreon.com/deeplizard
YouTube: https://youtube.com/deeplizard
🎓 Deep Learning with deeplizard:
Fundamental Concepts - https://deeplizard.com/learn/video/gZmobeGL0Yg
Beginner Code - https://deeplizard.com/learn/video/RznKVRTFkBY
Intermediate Code - https://deeplizard.com/learn/video/v5cngxo4mIg
Advanced Deep RL - https://deeplizard.com/learn/video/nyjbcRQ-uQ8
🎓 Other Courses:
Data Science - https://deeplizard.com/learn/video/d11chG7Z-xk
Trading - https://deeplizard.com/learn/video/ZpfCK_uHL9Y
🛒 Check out products deeplizard recommends on Amazon:
🔗 https://amazon.com/shop/deeplizard
📕 Get a FREE 30-day Audible trial and 2 FREE audio books using deeplizard's link:
🔗 https://amzn.to/2yoqWRn
🎵 deeplizard uses music by Kevin MacLeod
🔗 https://youtube.com/channel/UCSZXFhRIx6b0dFX3xS8L1yQ
🔗 http://incompetech.com/
❤️ Please use the knowledge gained from deeplizard content for good, not evil.
Видео Replay Memory Explained - Experience for Deep Q-Network Training канала deeplizard
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![Training a Deep Q-Network - Reinforcement Learning](https://i.ytimg.com/vi/0bt0SjbS3xc/default.jpg)
![Faster QLearning with Experience Replay](https://i.ytimg.com/vi/lQB08wcuCeM/default.jpg)
![How to triple your memory by using this trick | Ricardo Lieuw On | TEDxHaarlem](https://i.ytimg.com/vi/JsC9ZHi79jo/default.jpg)
![Layers in a Neural Network explained](https://i.ytimg.com/vi/FK77zZxaBoI/default.jpg)
![Why Are 96,000,000 Black Balls on This Reservoir?](https://i.ytimg.com/vi/uxPdPpi5W4o/default.jpg)
![](https://i.ytimg.com/vi/P0EyvELTQy0/default.jpg)
![Hindsight Experience Replay | Two Minute Papers #192](https://i.ytimg.com/vi/Dvd1jQe3pq0/default.jpg)
![Backpropagation explained | Part 1 - The intuition](https://i.ytimg.com/vi/XE3krf3CQls/default.jpg)
![Deep Reinforcement Learning: Neural Networks for Learning Control Laws](https://i.ytimg.com/vi/IUiKAD6cuTA/default.jpg)
![Hindsight Experience Replay in Reinforcement Learning - The Core Idea in 17 min.](https://i.ytimg.com/vi/7CxdWEpe6NU/default.jpg)
![Neural Network Learns to Play Snake using Deep Reinforcement Learning](https://i.ytimg.com/vi/i0Pkgtbh1xw/default.jpg)
![Eight-time World Memory Champion Dominic O'Brien: Learn how to learn](https://i.ytimg.com/vi/ACw5YVgg4lc/default.jpg)
![Policy Gradient Theorem Explained - Reinforcement Learning](https://i.ytimg.com/vi/cQfOQcpYRzE/default.jpg)
![Reinforcement Learning: Machine Learning Meets Control Theory](https://i.ytimg.com/vi/0MNVhXEX9to/default.jpg)
![Build Deep Q-Network - Reinforcement Learning Code Project](https://i.ytimg.com/vi/PyQNfsGUnQA/default.jpg)
![How Machines Learn](https://i.ytimg.com/vi/R9OHn5ZF4Uo/default.jpg)
![CUDA Explained - Why Deep Learning uses GPUs](https://i.ytimg.com/vi/6stDhEA0wFQ/default.jpg)
![Deep Q-Learning - Combining Neural Networks and Reinforcement Learning](https://i.ytimg.com/vi/wrBUkpiRvCA/default.jpg)
![Q-Learning Explained - A Reinforcement Learning Technique](https://i.ytimg.com/vi/qhRNvCVVJaA/default.jpg)
![Multi-Agent Deep Deterministic Policy Gradient (MADDPG)](https://i.ytimg.com/vi/Ku5h_FBL6Lg/default.jpg)