DeepMind x UCL RL Lecture Series - Policy-Gradient and Actor-Critic methods [9/13]
Research Scientist Hado van Hasselt covers policy algorithms that can learn policies directly and actor critic algorithms that combine value predictions for more efficient learning.
Slides: https://dpmd.ai/policygradient
Full video lecture series: https://dpmd.ai/DeepMindxUCL21
Видео DeepMind x UCL RL Lecture Series - Policy-Gradient and Actor-Critic methods [9/13] канала Google DeepMind
Slides: https://dpmd.ai/policygradient
Full video lecture series: https://dpmd.ai/DeepMindxUCL21
Видео DeepMind x UCL RL Lecture Series - Policy-Gradient and Actor-Critic methods [9/13] канала Google DeepMind
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Robotics and AI | Inside Google DeepMind - Stefano’s storyLife is like a game - DeepMind: The Podcast (S1, Ep3)DeepMind x UCL RL Lecture Series - Model-free Control [6/13]RL Course by David Silver - Lecture 8: Integrating Learning and PlanningNeuroscience and AI - Matt BotvinickICML 2017: Test of Time Award (Sylvain Gelly & David Silver)AlphaFold Server Demo - Google DeepMindFinding humour in AI | Inside Google DeepMind - Kory's storyRL Course by David Silver - Lecture 9: Exploration and ExploitationUnlocking a decade-old antibiotics resistance problem with AlphaFold - Google DeepMindAI for science - DeepMind: The Podcast (S2, Ep6)AI as a tool for science - EMBL-EBI and AlphaFold - Google DeepMindDeepMind x UCL | Deep Learning Lectures | 3/12 | Convolutional Neural Networks for Image RecognitionRL Course by David Silver - Lecture 1: Introduction to Reinforcement LearningAlphaFold: The making of a scientific breakthroughDeepMind x UCL RL Lecture Series - Model-free Prediction [5/13]Asynchronous Methods for Deep Reinforcement Learning: MuJoCoUsing AlphaFold in the fight against plastic pollution - Google DeepMindGo to Zero - DeepMind: The Podcast (S1, Ep2)Reinforcement Learning 10: Classic Games Case Study