Offline Reinforcement Learning
Offline Reinforcement Learning describes training an agent without interacting with the environment. The agent learns from previously collected experiences such as from another RL policy trained online or from a human demonstrator. This video explores two recent advancements in Offline RL!
Thanks for watching! Please Subscribe!
Paper Links:
An Optimistic Perspective on Offline Reinforcement Learning: https://arxiv.org/pdf/1907.04543.pdf
Datasets for Data-Driven Reinforcement Learning: https://arxiv.org/pdf/2004.07219.pdf
Q-Learning (Wikipedia): https://en.wikipedia.org/wiki/Q-learning
AVID (Robot in intro animation): https://bair.berkeley.edu/blog/2019/12/13/humans-cyclegan/
Nature-inspired robotics (Robot in intro animation): https://ai.googleblog.com/2020/04/exploring-nature-inspired-robot-agility.html
Видео Offline Reinforcement Learning канала Henry AI Labs
Thanks for watching! Please Subscribe!
Paper Links:
An Optimistic Perspective on Offline Reinforcement Learning: https://arxiv.org/pdf/1907.04543.pdf
Datasets for Data-Driven Reinforcement Learning: https://arxiv.org/pdf/2004.07219.pdf
Q-Learning (Wikipedia): https://en.wikipedia.org/wiki/Q-learning
AVID (Robot in intro animation): https://bair.berkeley.edu/blog/2019/12/13/humans-cyclegan/
Nature-inspired robotics (Robot in intro animation): https://ai.googleblog.com/2020/04/exploring-nature-inspired-robot-agility.html
Видео Offline Reinforcement Learning канала Henry AI Labs
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![](https://i.ytimg.com/vi/hwnXbnm9Ucw/default.jpg)
![What is Artificial Intelligence? In 5 minutes.](https://i.ytimg.com/vi/2ePf9rue1Ao/default.jpg)
![A crawling robot: a Q-learning example](https://i.ytimg.com/vi/2iNrJx6IDEo/default.jpg)
![MarI/O - Machine Learning for Video Games](https://i.ytimg.com/vi/qv6UVOQ0F44/default.jpg)
![Datasets for Data-Driven Reinforcement Learning](https://i.ytimg.com/vi/-h1KB8ps11A/default.jpg)
![AI learns to play Google Chrome Dinosaur Game || Can you beat it??](https://i.ytimg.com/vi/sB_IGstiWlc/default.jpg)
![An introduction to Reinforcement Learning](https://i.ytimg.com/vi/JgvyzIkgxF0/default.jpg)
![Reinforcement Learning Basics](https://i.ytimg.com/vi/2xATEwcRpy8/default.jpg)
![[WACV2019] Deep-Hurricane-Tracker: Tracking and forecasting extreme climate events using ConvLSTM](https://i.ytimg.com/vi/ar4C-g7F7bU/default.jpg)
![Introduction to Reinforcement Learning: Chapter 1](https://i.ytimg.com/vi/4SLGEq_HZxk/default.jpg)
![BEST SCIENCE GADGETS/TOYS THAT WILL MAKE YOU SAY WOW!](https://i.ytimg.com/vi/MPJkTyw77VA/default.jpg)
![CURL: Contrastive Unsupervised Representations for Reinforcement Learning](https://i.ytimg.com/vi/-Drowt9r4zY/default.jpg)
![Control of a Quadrotor with Reinforcement Learning](https://i.ytimg.com/vi/T0A9voXzhng/default.jpg)
![Artificial Intelligence: it will kill us | Jay Tuck | TEDxHamburgSalon](https://i.ytimg.com/vi/BrNs0M77Pd4/default.jpg)
![AI Learns to Park - Deep Reinforcement Learning](https://i.ytimg.com/vi/VMp6pq6_QjI/default.jpg)
![Off-the-Shelf Models versus Fine-Tuning!](https://i.ytimg.com/vi/imtQlCbFdCM/default.jpg)
![Full Stack Neural Search](https://i.ytimg.com/vi/4Huw6rQQDC4/default.jpg)
![OpenAI Embeddings API - (Interview Recap and Background)](https://i.ytimg.com/vi/vyf2sqhAm4Y/default.jpg)
![Reinforcement Learning in TensorFlow with TF-Agents (TF Dev Summit '19)](https://i.ytimg.com/vi/-TTziY7EmUA/default.jpg)
![AI Weekly Update - January 24th, 2022](https://i.ytimg.com/vi/G5VWnAX9vlE/default.jpg)