Загрузка страницы

Stanford CS234: Reinforcement Learning | Winter 2019 | Lecture 5 - Value Function Approximation

For more information about Stanford’s Artificial Intelligence professional and graduate programs, visit: https://stanford.io/ai

Professor Emma Brunskill, Stanford University
http://onlinehub.stanford.edu/

Professor Emma Brunskill
Assistant Professor, Computer Science
Stanford AI for Human Impact Lab
Stanford Artificial Intelligence Lab
Statistical Machine Learning Group

To follow along with the course schedule and syllabus, visit: http://web.stanford.edu/class/cs234/index.html

0:00 Introduction
1:19 Class Structure
3:18 Value Function Approximation (VFA)
4:26 Motivation for VFA
5:01 Benefits of Generalization
10:03 Function Approximators
11:16 Review: Gradient Descent
13:47 Value Function Approximation for Policy Evaluation with an Oracle
15:11 Stochastic Gradient Descent
18:02 Model Free VFA Policy Evaluation
18:22 Model Free VFA Prediction / Policy Evaluation
19:06 Feature Vectors
30:06 MC Linear Value Function Approacimation for Policy Evaluation
35:48 Baird (1995)-Like Example with MC Policy Evaluation
43:55 Convergence Guarantees for Linear Value Function Approximation for Policy Evaluation: Preliminaries
50:43 Batch Monte Carlo Value Function Approximation
53:48 Recall: Temporal Difference Learning w/ Lookup Table
54:42 Temporal Difference (TD(0)) Learning with Value Function Approximation
57:40 TD(0) Linear Value Function Approximation for Policy Evaluation
58:10 Baird Example with TD(0) On Policy Evaluation

Видео Stanford CS234: Reinforcement Learning | Winter 2019 | Lecture 5 - Value Function Approximation канала Stanford Online
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
30 марта 2019 г. 10:16:44
01:22:27
Другие видео канала
Stanford Seminar - Fight over the Law of Software APIs & stories from Electronic Frontier FoundationStanford Seminar - Fight over the Law of Software APIs & stories from Electronic Frontier FoundationStanford CS224W: Machine Learning with Graphs | 2021 | Lecture 17.3 - Cluster GCN: Scaling up GNNsStanford CS224W: Machine Learning with Graphs | 2021 | Lecture 17.3 - Cluster GCN: Scaling up GNNsStanford Seminar - Intelligence Augmentation through the Lens of Interactive Data VisualizationStanford Seminar - Intelligence Augmentation through the Lens of Interactive Data VisualizationStanford Seminar - Get in touch: Tactile perception for human-robot systemsStanford Seminar - Get in touch: Tactile perception for human-robot systemsLearner Spotlight: Andrew PelosiLearner Spotlight: Andrew PelosiStanford Seminar - What Asia Means for the Next Generation of Silicon Valley EntrepreneursStanford Seminar - What Asia Means for the Next Generation of Silicon Valley EntrepreneursStanford Seminar - Strategies for Student EntrepreneursStanford Seminar - Strategies for Student EntrepreneursStanford Seminar - Entrepreneurial Lessons from Self-Driving CarsStanford Seminar - Entrepreneurial Lessons from Self-Driving CarsProduct Strategy Example: Product Management Interview PrepProduct Strategy Example: Product Management Interview PrepStanford Seminar - Enntrepreneurial Thought Leaders: Minnie Ingersoll of ShiftStanford Seminar - Enntrepreneurial Thought Leaders: Minnie Ingersoll of ShiftStanford CS229M - Lecture 16: Implicit regularization in classification problemsStanford CS229M - Lecture 16: Implicit regularization in classification problemsStanford Seminar - Human-AI Interaction Under Societal DisagreementStanford Seminar - Human-AI Interaction Under Societal DisagreementStanford Seminar - Entrepreneurial Thought Leaders: Stewart Butterfield of SlackStanford Seminar - Entrepreneurial Thought Leaders: Stewart Butterfield of SlackStudent Spotlight: Daphne Wallbridge talks about the Creativity and Design Thinking ProgramStudent Spotlight: Daphne Wallbridge talks about the Creativity and Design Thinking ProgramStanford Seminar - Leslie Field of Stanford UniversityStanford Seminar - Leslie Field of Stanford UniversityWebinar - Big Breaches: What We Learned From the World’s Most Disruptive Cybersecurity AttacksWebinar - Big Breaches: What We Learned From the World’s Most Disruptive Cybersecurity AttacksStanford Seminar - Interview with Amy Chang of AccompanyStanford Seminar - Interview with Amy Chang of AccompanyStanford Webinar - Natural Language Understanding Student Project Showcase - Plus AI Program Q&AStanford Webinar - Natural Language Understanding Student Project Showcase - Plus AI Program Q&ABuilding Energy Efficiency: Technology, Policy & Policy (bee.stanford.edu)Building Energy Efficiency: Technology, Policy & Policy (bee.stanford.edu)Stanford CS224W: Machine Learning with Graphs | 2021 | Lecture 10.2 - Knowledge Graph CompletionStanford CS224W: Machine Learning with Graphs | 2021 | Lecture 10.2 - Knowledge Graph CompletionStanford Workshop - Innovation at Work Prototyping PreviewStanford Workshop - Innovation at Work Prototyping Preview
Яндекс.Метрика