Why Deep Q Learning Needs A Target Network and Replay Memory | Course Excerpt For Cyber Monday
The two biggest innovations in deep Q learning were the introduction of the target network and the replay memory. One would think that simply bolting a deep neural network to the Q learning algorithm would be enough for a robust deep Q learning agent, but that isn't the case. In this video I'll show you how this naive implementation of the deep q learning agent fails, and spectacularly at that.
#DeepQLearning #PyTorch #ReinforcementLearning
This is an excerpt from my new course, Actor Critic Methods from Paper to Code
Learn how to turn deep reinforcement learning papers into code:
Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.
Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to sales@neuralnet.ai
https://www.neuralnet.ai/courses
Or, pickup my Udemy courses here:
Deep Q Learning:
https://www.udemy.com/course/deep-q-learning-from-paper-to-code/?couponCode=DQN-JUNE-22
Actor Critic Methods:
https://www.udemy.com/course/actor-critic-methods-from-paper-to-code-with-pytorch/?couponCode=AC-JUNE-22
Curiosity Driven Deep Reinforcement Learning
https://www.udemy.com/course/curiosity-driven-deep-reinforcement-learning/?couponCode=ICM-JUNE-22
Natural Language Processing from First Principles:
https://www.udemy.com/course/natural-language-processing-from-first-principles/?couponCode=NLP-JUNE-22
Reinforcement Learning Fundamentals
https://www.manning.com/livevideo/reinforcement-learning-in-motion
Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: https://bit.ly/3fXHy8W
Grokking Deep Learning: https://bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: https://bit.ly/2VNAXql
Come hang out on Discord here:
https://discord.gg/Zr4VCdv
Need personalized tutoring? Help on a programming project? Shoot me an email! phil@neuralnet.ai
Website: https://www.neuralnet.ai
Github: https://github.com/philtabor
Twitter: https://twitter.com/MLWithPhil
Видео Why Deep Q Learning Needs A Target Network and Replay Memory | Course Excerpt For Cyber Monday канала Machine Learning with Phil
#DeepQLearning #PyTorch #ReinforcementLearning
This is an excerpt from my new course, Actor Critic Methods from Paper to Code
Learn how to turn deep reinforcement learning papers into code:
Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.
Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to sales@neuralnet.ai
https://www.neuralnet.ai/courses
Or, pickup my Udemy courses here:
Deep Q Learning:
https://www.udemy.com/course/deep-q-learning-from-paper-to-code/?couponCode=DQN-JUNE-22
Actor Critic Methods:
https://www.udemy.com/course/actor-critic-methods-from-paper-to-code-with-pytorch/?couponCode=AC-JUNE-22
Curiosity Driven Deep Reinforcement Learning
https://www.udemy.com/course/curiosity-driven-deep-reinforcement-learning/?couponCode=ICM-JUNE-22
Natural Language Processing from First Principles:
https://www.udemy.com/course/natural-language-processing-from-first-principles/?couponCode=NLP-JUNE-22
Reinforcement Learning Fundamentals
https://www.manning.com/livevideo/reinforcement-learning-in-motion
Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: https://bit.ly/3fXHy8W
Grokking Deep Learning: https://bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: https://bit.ly/2VNAXql
Come hang out on Discord here:
https://discord.gg/Zr4VCdv
Need personalized tutoring? Help on a programming project? Shoot me an email! phil@neuralnet.ai
Website: https://www.neuralnet.ai
Github: https://github.com/philtabor
Twitter: https://twitter.com/MLWithPhil
Видео Why Deep Q Learning Needs A Target Network and Replay Memory | Course Excerpt For Cyber Monday канала Machine Learning with Phil
Показать
Комментарии отсутствуют
Информация о видео
30 ноября 2019 г. 22:13:33
00:32:46
Другие видео канала
![I Didn't Know You Can Do This With the Type Keyword](https://i.ytimg.com/vi/rUGnC0z4Q1E/default.jpg)
![Machine Learning Freelancer Part 5 - Hourly or Fixed Price Gigs?](https://i.ytimg.com/vi/qmc3y0-QTWc/default.jpg)
![A Crash Course In Machine Learning Part 1](https://i.ytimg.com/vi/GOepw7AxLGQ/default.jpg)
![How to Code A Neural Network From Scratch Part 2 - Processing the MNIST Data](https://i.ytimg.com/vi/DYm0w_f9Z9o/default.jpg)
![How to Code A Deep Neural Network From Scratch | PyTorch Tutorial](https://i.ytimg.com/vi/47UH4JijIk4/default.jpg)
![How to Get Your First Freelance Machine Learning Gig on Upwork](https://i.ytimg.com/vi/RqiTTGRu8zY/default.jpg)
![How Large of A Replay Buffer Do You Need? A Deeper Look at Experience Replay | Paper Analysis & Code](https://i.ytimg.com/vi/HDi0-7D9-7g/default.jpg)
![5 Artificial Intelligence Channels You Should Subscribe To](https://i.ytimg.com/vi/p3nD8oHup-g/default.jpg)
![How To Debug Deep Learning Programs | A Simple Process Anybody Can Use](https://i.ytimg.com/vi/apcQ49BpDD8/default.jpg)
![DeepMind Makes Prototyping Papers Easy with ACME](https://i.ytimg.com/vi/pFMuQWpHI5k/default.jpg)
![We already live with ASI](https://i.ytimg.com/vi/SCV1vAx-QOw/default.jpg)
![Here's How Deep Mind Coded N Step Deep Q Learning](https://i.ytimg.com/vi/XD0g2iBC80k/default.jpg)
![Basic Hyperparameter Tuning in DeepMinds ACME Framework](https://i.ytimg.com/vi/PBzUJ9SmhHg/default.jpg)
![How to code a neural network from scratch - part 1](https://i.ytimg.com/vi/jmQwYVeCUVI/default.jpg)
![Machine Learning Freelancer Part 3 - How To Find Good Machine Learning Jobs](https://i.ytimg.com/vi/6gOwZQYSfXM/default.jpg)
![Reinforcement Learning Still A Viable Path To AGI](https://i.ytimg.com/vi/SzAuSn964v8/default.jpg)
![Deep Q Learning for Malware: Black Hat Reinforcement Learning](https://i.ytimg.com/vi/yUpRvTuYJvU/default.jpg)
![Watch GTC 2023 and Win a Free RTX4280](https://i.ytimg.com/vi/r1Eo4Ld4fOQ/default.jpg)
![Thoughts on the M2 Max](https://i.ytimg.com/vi/m_yySwbnKtA/default.jpg)
![This Language Model Can Read Papers](https://i.ytimg.com/vi/BEUL2T1jdk8/default.jpg)
![Mastering Continuous Robotic Control with TD3 | Twin Delayed Deep Deterministic Policy Gradients](https://i.ytimg.com/vi/ZhFO8EWADmY/default.jpg)