Загрузка страницы

Marcin Andrychowicz - Solving Rubik’s Cube with a Robot Hand

Recorded at the ML in PL 2019 Conference, the University of Warsaw, 22-24 November 2019.

Marcin Andrychowicz (OpenAI/Google Brain Zurich)

The second part of the talk is available at https://youtu.be/Va5dIxejqx0.

Slides available at http://docs.mlinpl.org/conference/2019/slides/marcin_andrychowicz_mlinpl2019.pdf

Abstract:
I will describe how we can use Reinforcement Learning (RL) to train control policies for physical robots. I'll discuss the issue of transferring control policies from a simulator to the real world and present the technique of Automatic Domain Randomization, which relies on randomizing the appearance as well as the dynamics of the simulated environment and automatically generates a distribution over randomized environments of ever-increasing difficulty. In particular, I'll focus on the problem of dexterous in-hand manipulation with a humanoid hand (https://openai.com/blog/solving-rubiks-cube/).

Видео Marcin Andrychowicz - Solving Rubik’s Cube with a Robot Hand канала ML in PL
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
18 сентября 2020 г. 0:16:18
00:59:47
Другие видео канала
Rafał Pilarczyk: Is Artificial Intelligence a threat to musicians? – Music generation techniquesRafał Pilarczyk: Is Artificial Intelligence a threat to musicians? – Music generation techniquesPhilippe Preux - Bits of Reinforcement Learning | ML in PL 23Philippe Preux - Bits of Reinforcement Learning | ML in PL 23B. Ludwiczuk, K. Jasinska-Kobus (Allegro) - Batch construction strategies in deep metric learningB. Ludwiczuk, K. Jasinska-Kobus (Allegro) - Batch construction strategies in deep metric learningAdam Paszke: PyTorch 1.0: now and in the futureAdam Paszke: PyTorch 1.0: now and in the futureAdam Podraza: Applied time series forecasting using machine learningAdam Podraza: Applied time series forecasting using machine learningGül Varol - Learning human body representations from visual dataGül Varol - Learning human body representations from visual dataDavid Haber - Opportunities and Challenges when Building AI for Autonomous FlightDavid Haber - Opportunities and Challenges when Building AI for Autonomous FlightAdam Gonczarek (Alphamoon) – Intelligent Document ProcessingAdam Gonczarek (Alphamoon) – Intelligent Document ProcessingJonasz Pamuła (RTB House) – ML Challenges in cookieless worldJonasz Pamuła (RTB House) – ML Challenges in cookieless worldJoão Henriques - Mapping environments with deep networks and spatial memoriesJoão Henriques - Mapping environments with deep networks and spatial memoriesKrzysztof Geras (NYU): "Towards Solving Breast Cancer Screening Diagnosis with Deep Learning"Krzysztof Geras (NYU): "Towards Solving Breast Cancer Screening Diagnosis with Deep Learning"Stanisław Jastrzębski - Deep Learning in the Light of the Simplicity Bias | MLSS Kraków 2023Stanisław Jastrzębski - Deep Learning in the Light of the Simplicity Bias | MLSS Kraków 2023How to learn classifier chains using positive-unlabelled multi-label data? | ML in PL 22How to learn classifier chains using positive-unlabelled multi-label data? | ML in PL 22Yoshua Bengio – Cognitively-inspired inductive biases for higher-level cognitionYoshua Bengio – Cognitively-inspired inductive biases for higher-level cognitionTomasz Grel (Nvidia): Faster Deep Learning with mixed precision and multiple GPUsTomasz Grel (Nvidia): Faster Deep Learning with mixed precision and multiple GPUsPanel Discussion – Women in MLPanel Discussion – Women in MLMichał Jamroż - Class fitting in residual convolutional networks | ML in PL 23Michał Jamroż - Class fitting in residual convolutional networks | ML in PL 23Sebastian Cygert - Toward continually learning models | ML in PL 23Sebastian Cygert - Toward continually learning models | ML in PL 23Barbara Rychalska - Neural Machine Translation: achievements, challenges and the way forwardBarbara Rychalska - Neural Machine Translation: achievements, challenges and the way forwardStanisław Jastrzębski - Gradient Alignment: When Deep Networks Work, and When They Don'tStanisław Jastrzębski - Gradient Alignment: When Deep Networks Work, and When They Don't
Яндекс.Метрика