Reinforcement Learning with sparse rewards
In this video I dive into three advanced papers that addres the problem of the sparse reward setting in Deep Reinforcement Learning and pose interesting research directions for mastering unsupervised learning in autonomous agents.
Papers discussed:
Reinforcement Learning with Unsupervised Auxiliary Tasks - DeepMind:
https://arxiv.org/abs/1611.05397
Curiosity Driven Exploration - UC Berkeley:
https://arxiv.org/abs/1705.05363
Hindsight Experience Replay - OpenAI:
https://arxiv.org/abs/1707.01495
If you want to support this channel, here is my patreon link:
https://patreon.com/ArxivInsights --- You are amazing!! ;)
If you have questions you would like to discuss with me personally, you can book a 1-on-1 video call through Pensight: https://pensight.com/x/xander-steenbrugge
Видео Reinforcement Learning with sparse rewards канала Arxiv Insights
Papers discussed:
Reinforcement Learning with Unsupervised Auxiliary Tasks - DeepMind:
https://arxiv.org/abs/1611.05397
Curiosity Driven Exploration - UC Berkeley:
https://arxiv.org/abs/1705.05363
Hindsight Experience Replay - OpenAI:
https://arxiv.org/abs/1707.01495
If you want to support this channel, here is my patreon link:
https://patreon.com/ArxivInsights --- You are amazing!! ;)
If you have questions you would like to discuss with me personally, you can book a 1-on-1 video call through Pensight: https://pensight.com/x/xander-steenbrugge
Видео Reinforcement Learning with sparse rewards канала Arxiv Insights
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
An introduction to Reinforcement LearningVariational AutoencodersOpenAI Five: When AI beats professional gamersAn introduction to Policy Gradient methods - Deep Reinforcement LearningWhat AI have MIT been creating? - BBC ClickHindsight Experience Replay | Two Minute Papers #192'How neural networks learn' - Part I: Feature VisualizationAlphaGo - How AI mastered the hardest boardgame in historyThe Real Reason to be Afraid of Artificial Intelligence | Peter Haas | TEDxDirigoCart-Pole Reinforcement LearningLecture 14 | Deep Reinforcement LearningAlphaFold and the Grand Challenge to solve protein foldingTraining AI Without Writing A Reward Function, with Reward ModellingWhy humans learn so much faster than AIThe Molecular Basis of LifeA Short Introduction to Entropy, Cross-Entropy and KL-DivergenceBut what is a neural network? | Chapter 1, Deep learningWhy Most People FAIL to Learn Programming'How neural networks learn' - Part III: Generalization and OverfittingDeep Q Learning for Video Games - The Math of Intelligence #9