Foundations of Real-World Reinforcement Learning
Reinforcement learning (RL) is an approach to sequential decision making under uncertainty which formalizes the principles for designing an autonomous learning agent. The broad goal of a reinforcement learning agent is to find an optimal policy which maximizes its long-term rewards over time. Its list of applications is growing as the technology advances and continues to be further integrated into many areas, such as education, health, advertising, autonomous systems, and gaming.
By starting from the perspective of an agent which interacts with and affects its environment, RL provides an improvement upon supervised learning in situations requiring decisions, and not just predictions. In particular, it motivates exploratory actions to discover novel rewarding behavior in the environment, a hallmark of intelligent agents.
In this webinar—led by Microsoft Researchers John Langford, Partner Research Manager with over a decade of experience in reinforcement learning-related research, and Alekh Agarwal, Principal Research Manager and leader of the Reinforcement Learning group in Redmond—learn how RL works to impact real-world problems across a variety of domains.
Together, you'll explore:
■ The definition and uses of RL, from a general paradigm to its broad range of applications
■ The various benefits of using RL as well as its current challenges
■ The specific types of RL—contextual bandits, imitation learning, and strategic exploration
■ Where these cutting-edge methods might take the future of RL.
𝗥𝗲𝘀𝗼𝘂𝗿𝗰𝗲 𝗹𝗶𝘀𝘁:
■ Reinforcement learning for the real world with Dr. John Langford and Rafah Hosn (Podcast) - https://www.microsoft.com/en-us/research/podcast/reinforcement-learning-for-the-real-world-with-dr-john-langford-and-rafah-hosn/
■ Real World Reinforcement Learning (Project page) - https://www.microsoft.com/en-us/research/project/real-world-reinforcement-learning/
■ Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning (Publication) - https://www.microsoft.com/en-us/research/publication/kinematic-state-abstraction-and-provably-efficient-rich-observation-reinforcement-learning/
■ Provably efficient reinforcement learning with rich observations (Blog) - https://www.microsoft.com/en-us/research/blog/provably-efficient-reinforcement-learning-with-rich-observations/
■ ICML 2017 Tutorial on Real World Interactive Learning (Tutorial) - https://hunch.net/~rwil/
■ Machine Learning (Theory) (John Langford’s blog) - https://hunch.net/
■ Vowpal Wabbit (open source project) - https://vowpalwabbit.org/
■ Reinforcement Learning (Career opportunities) - https://www.microsoft.com/en-us/research/theme/reinforcement-learning-group/#!opportunities
■ Alekh Agarwal (researcher profile) - https://www.microsoft.com/en-us/research/people/alekha/
■ John Langford (researcher profile) - https://www.microsoft.com/en-us/research/people/jcl/
*This on-demand webinar features a previously recorded Q&A session and open captioning.
This webinar originally aired on December 5, 2019
Explore more Microsoft Research webinars: https://aka.ms/msrwebinars
Видео Foundations of Real-World Reinforcement Learning канала Microsoft Research
By starting from the perspective of an agent which interacts with and affects its environment, RL provides an improvement upon supervised learning in situations requiring decisions, and not just predictions. In particular, it motivates exploratory actions to discover novel rewarding behavior in the environment, a hallmark of intelligent agents.
In this webinar—led by Microsoft Researchers John Langford, Partner Research Manager with over a decade of experience in reinforcement learning-related research, and Alekh Agarwal, Principal Research Manager and leader of the Reinforcement Learning group in Redmond—learn how RL works to impact real-world problems across a variety of domains.
Together, you'll explore:
■ The definition and uses of RL, from a general paradigm to its broad range of applications
■ The various benefits of using RL as well as its current challenges
■ The specific types of RL—contextual bandits, imitation learning, and strategic exploration
■ Where these cutting-edge methods might take the future of RL.
𝗥𝗲𝘀𝗼𝘂𝗿𝗰𝗲 𝗹𝗶𝘀𝘁:
■ Reinforcement learning for the real world with Dr. John Langford and Rafah Hosn (Podcast) - https://www.microsoft.com/en-us/research/podcast/reinforcement-learning-for-the-real-world-with-dr-john-langford-and-rafah-hosn/
■ Real World Reinforcement Learning (Project page) - https://www.microsoft.com/en-us/research/project/real-world-reinforcement-learning/
■ Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning (Publication) - https://www.microsoft.com/en-us/research/publication/kinematic-state-abstraction-and-provably-efficient-rich-observation-reinforcement-learning/
■ Provably efficient reinforcement learning with rich observations (Blog) - https://www.microsoft.com/en-us/research/blog/provably-efficient-reinforcement-learning-with-rich-observations/
■ ICML 2017 Tutorial on Real World Interactive Learning (Tutorial) - https://hunch.net/~rwil/
■ Machine Learning (Theory) (John Langford’s blog) - https://hunch.net/
■ Vowpal Wabbit (open source project) - https://vowpalwabbit.org/
■ Reinforcement Learning (Career opportunities) - https://www.microsoft.com/en-us/research/theme/reinforcement-learning-group/#!opportunities
■ Alekh Agarwal (researcher profile) - https://www.microsoft.com/en-us/research/people/alekha/
■ John Langford (researcher profile) - https://www.microsoft.com/en-us/research/people/jcl/
*This on-demand webinar features a previously recorded Q&A session and open captioning.
This webinar originally aired on December 5, 2019
Explore more Microsoft Research webinars: https://aka.ms/msrwebinars
Видео Foundations of Real-World Reinforcement Learning канала Microsoft Research
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
The Intern Experience at Microsoft Research CambridgeBattling Tuberculosis Using Microsoft TechnologyFast and Flexible Multi-Task Classification Using Conditional Neural Adaptive ProcessesData Visualization Reaches New Heights with LayerscapeFaculty Summit 2018 IntroductionMicrosoft Pix - Take Better Photos of People, AutomaticallyImprovements on Higher Order Ambisonics ReproductionCambridge lab overview with Chris BishopGet free cloud computing time and storage on Microsoft AzureHow interns at our New England Lab impact research at MicrosoftAI Institute "Geometry of Deep Learning" 2019 [Workshop] Day 1 | Session 2De-Identifying Healthcare Data for ResearchIntelligent cloud computing lifts villages out of water povertyHow Microsoft and Novartis created Assess MS (short version)How OSIsoft and Deschutes Brewery used Microsoft Security Risk DetectionChronoZoom curriculum and technologyRecent Advances in Image Captioning, Image-Text Retrieval and…Using machine learning and AI to reduce hospital readmissionsIROS 2020 - Mixed Reality and Robotics Tutorial - Demo 1: InteractionCamera-based non-contact health sensingKinect and sign language translation (2:43)