Lecture 14 | Deep Reinforcement Learning
In Lecture 14 we move from supervised learning to reinforcement learning (RL), in which an agent must learn to interact with an environment in order to maximize its reward. We formalize reinforcement learning using the language of Markov Decision Processes (MDPs), policies, value functions, and Q-Value functions. We discuss different algorithms for reinforcement learning including Q-Learning, policy gradients, and Actor-Critic. We show how deep reinforcement learning has been used to play Atari games and to achieve super-human Go performance in AlphaGo.
Keywords: Reinforcement learning, RL, Markov decision process, MDP, Q-Learning, policy gradients, REINFORCE, actor-critic, Atari games, AlphaGo
Slides: http://cs231n.stanford.edu/slides/2017/cs231n_2017_lecture14.pdf
--------------------------------------------------------------------------------------
Convolutional Neural Networks for Visual Recognition
Instructors:
Fei-Fei Li: http://vision.stanford.edu/feifeili/
Justin Johnson: http://cs.stanford.edu/people/jcjohns/
Serena Yeung: http://ai.stanford.edu/~syyeung/
Computer Vision has become ubiquitous in our society, with applications in search, image understanding, apps, mapping, medicine, drones, and self-driving cars. Core to many of these applications are visual recognition tasks such as image classification, localization and detection. Recent developments in neural network (aka “deep learning”) approaches have greatly advanced the performance of these state-of-the-art visual recognition systems. This lecture collection is a deep dive into details of the deep learning architectures with a focus on learning end-to-end models for these tasks, particularly image classification. From this lecture collection, students will learn to implement, train and debug their own neural networks and gain a detailed understanding of cutting-edge research in computer vision.
Website:
http://cs231n.stanford.edu/
For additional learning opportunities please visit:
http://online.stanford.edu/
Видео Lecture 14 | Deep Reinforcement Learning канала Stanford University School of Engineering
Keywords: Reinforcement learning, RL, Markov decision process, MDP, Q-Learning, policy gradients, REINFORCE, actor-critic, Atari games, AlphaGo
Slides: http://cs231n.stanford.edu/slides/2017/cs231n_2017_lecture14.pdf
--------------------------------------------------------------------------------------
Convolutional Neural Networks for Visual Recognition
Instructors:
Fei-Fei Li: http://vision.stanford.edu/feifeili/
Justin Johnson: http://cs.stanford.edu/people/jcjohns/
Serena Yeung: http://ai.stanford.edu/~syyeung/
Computer Vision has become ubiquitous in our society, with applications in search, image understanding, apps, mapping, medicine, drones, and self-driving cars. Core to many of these applications are visual recognition tasks such as image classification, localization and detection. Recent developments in neural network (aka “deep learning”) approaches have greatly advanced the performance of these state-of-the-art visual recognition systems. This lecture collection is a deep dive into details of the deep learning architectures with a focus on learning end-to-end models for these tasks, particularly image classification. From this lecture collection, students will learn to implement, train and debug their own neural networks and gain a detailed understanding of cutting-edge research in computer vision.
Website:
http://cs231n.stanford.edu/
For additional learning opportunities please visit:
http://online.stanford.edu/
Видео Lecture 14 | Deep Reinforcement Learning канала Stanford University School of Engineering
Показать
Комментарии отсутствуют
Информация о видео
11 августа 2017 г. 22:04:21
01:04:01
Другие видео канала
Lecture 15 | Efficient Methods and Hardware for Deep LearningLecture 17 - MDPs & Value/Policy Iteration | Stanford CS229: Machine Learning (Autumn 2018)Lecture 13 | Generative ModelsMIT 6.S091: Introduction to Deep Reinforcement Learning (Deep RL)Deep Reinforcement Learning in the Enterprise: Bridging the Gap from Games to IndustryLecture 10: Reinforcement LearningAn introduction to Reinforcement LearningLecture 10 | Recurrent Neural NetworksLecture 7: Markov Decision Processes - Value Iteration | Stanford CS221: AI (Autumn 2019)Reinforcement Learning: Machine Learning Meets Control TheoryDeep Q-Learning - Combining Neural Networks and Reinforcement LearningDeep Reinforcement Learning Tutorial for Python in 20 MinutesMIT 6.S191 (2019): Deep Reinforcement LearningLecture 16 | Adversarial Examples and Adversarial TrainingDeep Q Learning NetworksRL Course by David Silver - Lecture 2: Markov Decision ProcessDeep Reinforcement Learning: Neural Networks for Learning Control LawsFoundations of Q-LearningStanford CS234: Reinforcement Learning | Winter 2019 | Lecture 2 - Given a Model of the WorldEmbeddings for Everything: Search in the Neural Network Era