Monte Carlo Prediction
We're going to program a virtual robot to do some house cleaning for us using a technique called monte carlo prediction. i'm going to explain what it is, how it works and how we can use it for reinforcement learning.
Code for this video:
https://github.com/llSourcell/navigating_a_virtual_world_with_dynamic_programming
Justin's Winning code:
https://github.com/wagonhelm/Value-Iteration
Sakcham's runner up code:
https://github.com/sakchhams/pacman_ai
Please Subscribe! And like. And comment. That's what keeps me going.
Want more inspiration & education? Connect with me:
Twitter: https://twitter.com/sirajraval
Facebook: https://www.facebook.com/sirajology
More learning resources:
https://ocw.mit.edu/courses/aeronautics-and-astronautics/16-410-principles-of-autonomy-and-decision-making-fall-2010/lecture-notes/MIT16_410F10_lec23.pdf
http://uhaweb.hartford.edu/compsci/ccli/projects/QLearning.pdf
https://medium.com/@m.alzantot/deep-reinforcement-learning-demysitifed-episode-2-policy-iteration-value-iteration-and-q-978f9e89ddaa
https://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a-html/node19.html
http://cs.stanford.edu/people/karpathy/reinforcejs/gridworld_dp.html
https://www.quora.com/How-is-policy-iteration-different-from-value-iteration
http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching_files/DP.pdf
Join us in the Wizards Slack channel:
http://wizards.herokuapp.com/
And please support me on Patreon:
https://www.patreon.com/user?u=3191693 Instagram: https://www.instagram.com/sirajraval/ Instagram: https://www.instagram.com/sirajraval/
Signup for my newsletter for exciting updates in the field of AI:
https://goo.gl/FZzJ5w
Hit the Join button above to sign up to become a member of my channel for access to exclusive content!
Видео Monte Carlo Prediction канала Siraj Raval
Code for this video:
https://github.com/llSourcell/navigating_a_virtual_world_with_dynamic_programming
Justin's Winning code:
https://github.com/wagonhelm/Value-Iteration
Sakcham's runner up code:
https://github.com/sakchhams/pacman_ai
Please Subscribe! And like. And comment. That's what keeps me going.
Want more inspiration & education? Connect with me:
Twitter: https://twitter.com/sirajraval
Facebook: https://www.facebook.com/sirajology
More learning resources:
https://ocw.mit.edu/courses/aeronautics-and-astronautics/16-410-principles-of-autonomy-and-decision-making-fall-2010/lecture-notes/MIT16_410F10_lec23.pdf
http://uhaweb.hartford.edu/compsci/ccli/projects/QLearning.pdf
https://medium.com/@m.alzantot/deep-reinforcement-learning-demysitifed-episode-2-policy-iteration-value-iteration-and-q-978f9e89ddaa
https://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a-html/node19.html
http://cs.stanford.edu/people/karpathy/reinforcejs/gridworld_dp.html
https://www.quora.com/How-is-policy-iteration-different-from-value-iteration
http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching_files/DP.pdf
Join us in the Wizards Slack channel:
http://wizards.herokuapp.com/
And please support me on Patreon:
https://www.patreon.com/user?u=3191693 Instagram: https://www.instagram.com/sirajraval/ Instagram: https://www.instagram.com/sirajraval/
Signup for my newsletter for exciting updates in the field of AI:
https://goo.gl/FZzJ5w
Hit the Join button above to sign up to become a member of my channel for access to exclusive content!
Видео Monte Carlo Prediction канала Siraj Raval
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![A Random Walk & Monte Carlo Simulation || Python Tutorial || Learn Python Programming](https://i.ytimg.com/vi/BfS2H1y6tzQ/default.jpg)
![MarIQ -- Q-Learning Neural Network for Mario Kart -- 2M Sub Special](https://i.ytimg.com/vi/Tnu4O_xEmVk/default.jpg)
![6. Monte Carlo Simulation](https://i.ytimg.com/vi/OgO1gpXSUzU/default.jpg)
![](https://i.ytimg.com/vi/d7ldapHJlhA/default.jpg)
![Learn English With Brad Pitt on FRIENDS](https://i.ytimg.com/vi/0UKA1H2xpWY/default.jpg)
![Reinforcement Learning for Stock Prediction](https://i.ytimg.com/vi/05NqKJ0v7EE/default.jpg)
![Generating Pokemon with a Generative Adversarial Network](https://i.ytimg.com/vi/yz6dNf7X7SA/default.jpg)
![Time Series Prediction](https://i.ytimg.com/vi/d4Sn6ny_5LI/default.jpg)
![Algorithmic Trading on YouTube is Fake | Trading Strategies that Actually Work](https://i.ytimg.com/vi/5K8-liOIWFU/default.jpg)
![Monte Carlo Simulation](https://i.ytimg.com/vi/7ESK5SaP-bc/default.jpg)
![Kaggle Earthquake Prediction Challenge](https://i.ytimg.com/vi/TffGdSsWKlA/default.jpg)
![Bootstrapping Main Ideas!!!](https://i.ytimg.com/vi/Xz0x-8-cgaQ/default.jpg)
![Everything You Need to Know About Deep Deterministic Policy Gradients (DDPG) | Tensorflow 2 Tutorial](https://i.ytimg.com/vi/4jh32CvwKYw/default.jpg)
![Dynamic Programming](https://i.ytimg.com/vi/DiAtV7SneRE/default.jpg)
![RL 7: Monte-Carlo Method | Reinforcement Learning](https://i.ytimg.com/vi/HLnGZe-R0Xg/default.jpg)
![Watch Me Build an AI Startup](https://i.ytimg.com/vi/NzmoPqte4V4/default.jpg)
![Algorithme Monte Carlo pour faire un moteur IA de jeu](https://i.ytimg.com/vi/UFqaCkTZ65w/default.jpg)
![Yoshua Bengio Extra Footage 1: Brainstorm with students 🔴](https://i.ytimg.com/vi/g9V-MHxSCcs/default.jpg)
![Monte Carlo Integration In Python For Noobs](https://i.ytimg.com/vi/WAf0rqwAvgg/default.jpg)
![Simulation du risque de portefeuille par la méthode de Monte Carlo](https://i.ytimg.com/vi/PguchLJwwU4/default.jpg)