Загрузка страницы

Monte Carlo Prediction

We're going to program a virtual robot to do some house cleaning for us using a technique called monte carlo prediction. i'm going to explain what it is, how it works and how we can use it for reinforcement learning.

Code for this video:
https://github.com/llSourcell/navigating_a_virtual_world_with_dynamic_programming

Justin's Winning code:
https://github.com/wagonhelm/Value-Iteration

Sakcham's runner up code:
https://github.com/sakchhams/pacman_ai

Please Subscribe! And like. And comment. That's what keeps me going.

Want more inspiration & education? Connect with me:
Twitter: https://twitter.com/sirajraval
Facebook: https://www.facebook.com/sirajology
More learning resources:
https://ocw.mit.edu/courses/aeronautics-and-astronautics/16-410-principles-of-autonomy-and-decision-making-fall-2010/lecture-notes/MIT16_410F10_lec23.pdf
http://uhaweb.hartford.edu/compsci/ccli/projects/QLearning.pdf
https://medium.com/@m.alzantot/deep-reinforcement-learning-demysitifed-episode-2-policy-iteration-value-iteration-and-q-978f9e89ddaa
https://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a-html/node19.html
http://cs.stanford.edu/people/karpathy/reinforcejs/gridworld_dp.html
https://www.quora.com/How-is-policy-iteration-different-from-value-iteration
http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching_files/DP.pdf

Join us in the Wizards Slack channel:
http://wizards.herokuapp.com/

And please support me on Patreon:
https://www.patreon.com/user?u=3191693 Instagram: https://www.instagram.com/sirajraval/ Instagram: https://www.instagram.com/sirajraval/
Signup for my newsletter for exciting updates in the field of AI:
https://goo.gl/FZzJ5w
Hit the Join button above to sign up to become a member of my channel for access to exclusive content!

Видео Monte Carlo Prediction канала Siraj Raval
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
25 ноября 2017 г. 1:38:11
00:10:38
Яндекс.Метрика