Introduction (Move 37)
Welcome to my new reinforcement learning course titled Move 37! in this 10 week course we'll go over the basics of reinforcement learning up to modern day techniques that involve neural networks called 'deep' reinforcement learning. In this first video, i'll introduce the idea of a Markov Decision Process. This is the basic mathematical framework for framing the reinforcement learning problem. We'll also briefly mention the ideas of a 'policy' and the agent-environment loop. Get hype!
Code for this video:
https://github.com/llSourcell/Introduction_Move37
Please Subscribe! And like. And comment. That's what keeps me going.
The School of AI:
https://www.theschool.ai
Want more education? Connect with me here:
Twitter: https://twitter.com/sirajraval
Facebook: https://www.facebook.com/sirajology
instagram: https://www.instagram.com/sirajraval
Github Syllabus:
https://github.com/llSourcell/Move_37_Syllabus
More learning resources:
https://towardsdatascience.com/reinforcement-learning-demystified-markov-decision-processes-part-1-bf00dda41690
https://www.cs.rice.edu/~vardi/dag01/givan1.pdf
http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching_files/MDP.pdf
https://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-825-techniques-in-artificial-intelligence-sma-5504-fall-2002/lecture-notes/Lecture20FinalPart1.pdf
https://artint.info/html/ArtInt_224.html
Join us in the Wizards Slack channel:
http://wizards.herokuapp.com/
And please support me on Patreon:
https://www.patreon.com/user?u=3191693
Signup for my newsletter for exciting updates in the field of AI:
https://goo.gl/FZzJ5w
Hit the Join button above to sign up to become a member of my channel for access to exclusive content! Join my AI community: http://chatgptschool.io/ Sign up for my AI Sports betting Bot, WagerGPT! (500 spots available):
https://www.wagergpt.co
Видео Introduction (Move 37) канала Siraj Raval
Code for this video:
https://github.com/llSourcell/Introduction_Move37
Please Subscribe! And like. And comment. That's what keeps me going.
The School of AI:
https://www.theschool.ai
Want more education? Connect with me here:
Twitter: https://twitter.com/sirajraval
Facebook: https://www.facebook.com/sirajology
instagram: https://www.instagram.com/sirajraval
Github Syllabus:
https://github.com/llSourcell/Move_37_Syllabus
More learning resources:
https://towardsdatascience.com/reinforcement-learning-demystified-markov-decision-processes-part-1-bf00dda41690
https://www.cs.rice.edu/~vardi/dag01/givan1.pdf
http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching_files/MDP.pdf
https://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-825-techniques-in-artificial-intelligence-sma-5504-fall-2002/lecture-notes/Lecture20FinalPart1.pdf
https://artint.info/html/ArtInt_224.html
Join us in the Wizards Slack channel:
http://wizards.herokuapp.com/
And please support me on Patreon:
https://www.patreon.com/user?u=3191693
Signup for my newsletter for exciting updates in the field of AI:
https://goo.gl/FZzJ5w
Hit the Join button above to sign up to become a member of my channel for access to exclusive content! Join my AI community: http://chatgptschool.io/ Sign up for my AI Sports betting Bot, WagerGPT! (500 spots available):
https://www.wagergpt.co
Видео Introduction (Move 37) канала Siraj Raval
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Really Quick Questions with Sebastian ThrunSchool of AI Research GrantsBeautiful Mathematics in the Natural World5 Ways to Increase ProductivityMove 37 Official Trailer10 Books to Learn Machine LearningHow to Find Meaningful Work In LifeHow to Learn Advanced Concepts Fast10 Ways to Learn Faster10 People that Inspire MeLearn Natural Language ProcessingSocial Media for DevelopersNeural Arithmetic Logic UnitsInternet of Things OptimizationThe Trust That BindsComputer Science CurriculumLearn Data Science in 3 MonthsArtificial CuriosityAI that Dresses ItselfSolving the Basic Game of Pong