Actor Critic Algorithms
Reinforcement learning is hot right now! Policy gradients and deep q learning can only get us so far, but what if we used two networks to help train and AI instead of one? Thats the idea behind actor critic algorithms. I'll explain how they work in this video using the 'Doom" shooting game as an example.
Code for this video:
https://github.com/llSourcell/actor_critic
i-Nickk's winning code:
https://github.com/I-NicKK/Tic-Tac-Toe
Vignesh's runner up code:
https://github.com/tj27-vkr/Q-learning-conv-net
Taryn's Twitter:
https://twitter.com/tarynsouthern
More learning resources:
https://papers.nips.cc/paper/1786-actor-critic-algorithms.pdf
http://rll.berkeley.edu/deeprlcourse/f17docs/lecture_5_actor_critic_pdf.pdf
http://web.mit.edu/jnt/www/Papers/J094-03-kon-actors.pdf
http://mlg.eng.cam.ac.uk/rowan/files/rl/06_actorcritic.pdf
http://mi.eng.cam.ac.uk/~mg436/LectureSlides/MLSALT7/L5.pdf
Please Subscribe! And like. And comment. That's what keeps me going.
Want more inspiration & education? Connect with me:
Twitter: https://twitter.com/sirajraval
Facebook: https://www.facebook.com/sirajology
Join us in the Wizards Slack channel:
http://wizards.herokuapp.com/
And please support me on Patreon:
https://www.patreon.com/user?u=3191693 Instagram: https://www.instagram.com/sirajraval/ Instagram: https://www.instagram.com/sirajraval/
Signup for my newsletter for exciting updates in the field of AI:
https://goo.gl/FZzJ5w
Hit the Join button above to sign up to become a member of my channel for access to exclusive content!
Видео Actor Critic Algorithms канала Siraj Raval
Code for this video:
https://github.com/llSourcell/actor_critic
i-Nickk's winning code:
https://github.com/I-NicKK/Tic-Tac-Toe
Vignesh's runner up code:
https://github.com/tj27-vkr/Q-learning-conv-net
Taryn's Twitter:
https://twitter.com/tarynsouthern
More learning resources:
https://papers.nips.cc/paper/1786-actor-critic-algorithms.pdf
http://rll.berkeley.edu/deeprlcourse/f17docs/lecture_5_actor_critic_pdf.pdf
http://web.mit.edu/jnt/www/Papers/J094-03-kon-actors.pdf
http://mlg.eng.cam.ac.uk/rowan/files/rl/06_actorcritic.pdf
http://mi.eng.cam.ac.uk/~mg436/LectureSlides/MLSALT7/L5.pdf
Please Subscribe! And like. And comment. That's what keeps me going.
Want more inspiration & education? Connect with me:
Twitter: https://twitter.com/sirajraval
Facebook: https://www.facebook.com/sirajology
Join us in the Wizards Slack channel:
http://wizards.herokuapp.com/
And please support me on Patreon:
https://www.patreon.com/user?u=3191693 Instagram: https://www.instagram.com/sirajraval/ Instagram: https://www.instagram.com/sirajraval/
Signup for my newsletter for exciting updates in the field of AI:
https://goo.gl/FZzJ5w
Hit the Join button above to sign up to become a member of my channel for access to exclusive content!
Видео Actor Critic Algorithms канала Siraj Raval
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Everything You Need To Master Actor Critic Methods | Tensorflow 2 TutorialMonte Carlo PredictionReinforcement Learning - "DDPG" explainedSong GeneratorAI Gridworlds - ComputerphileAn introduction to Policy Gradient methods - Deep Reinforcement LearningDivide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning (Paper Explained)Deep Learning Frameworks 2019CS885 Lecture 7b: Actor CriticQ-Learning Explained - A Reinforcement Learning TechniqueAdvantage Actor Critic Tutorial: minA2CQ Learning Explained (tutorial)Meta LearningAn Introduction to Actor-Critic Deep RL AlgorithmsCheapest Deep Learning PC in 2020Build an AI Startup with PyTorchReinforcement Learning for Stock PredictionReinforcement Learning 6: Policy Gradients and Actor CriticsDeep Q Learning for Video Games - The Math of Intelligence #9