Загрузка страницы

Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments

As software and hardware agents begin to perform tasks of genuine interest, they will be faced with environments too complex for humans to predetermine the correct actions to take. Three characteristics shared by many complex domains are 1) high-dimensional state and action spaces, 2) partial observability, and 3) multiple learning agents. To tackle such problems I will describe algorithms that combine deep neural network function approximation with reinforcement learning. First I will describe using recurrent neural networks to handle partial observability in Atari games. Next, I will describe a multiagent soccer domain: Half-Field-Offense and approaches for learning effective policies in this parameterized-continuous action space. I will conclude with ongoing work on multiagent learning in HFO.

Видео Deep Multiagent Reinforcement Learning for Partially Observable Parameterized Environments канала Microsoft Research
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
12 июля 2016 г. 23:17:33
01:17:05
Яндекс.Метрика