Marcin Andrychowicz - Solving Rubik’s Cube with a Robot Hand
Recorded at the ML in PL 2019 Conference, the University of Warsaw, 22-24 November 2019.
Marcin Andrychowicz (OpenAI/Google Brain Zurich)
The second part of the talk is available at https://youtu.be/Va5dIxejqx0.
Slides available at http://docs.mlinpl.org/conference/2019/slides/marcin_andrychowicz_mlinpl2019.pdf
Abstract:
I will describe how we can use Reinforcement Learning (RL) to train control policies for physical robots. I'll discuss the issue of transferring control policies from a simulator to the real world and present the technique of Automatic Domain Randomization, which relies on randomizing the appearance as well as the dynamics of the simulated environment and automatically generates a distribution over randomized environments of ever-increasing difficulty. In particular, I'll focus on the problem of dexterous in-hand manipulation with a humanoid hand (https://openai.com/blog/solving-rubiks-cube/).
Видео Marcin Andrychowicz - Solving Rubik’s Cube with a Robot Hand канала ML in PL
Marcin Andrychowicz (OpenAI/Google Brain Zurich)
The second part of the talk is available at https://youtu.be/Va5dIxejqx0.
Slides available at http://docs.mlinpl.org/conference/2019/slides/marcin_andrychowicz_mlinpl2019.pdf
Abstract:
I will describe how we can use Reinforcement Learning (RL) to train control policies for physical robots. I'll discuss the issue of transferring control policies from a simulator to the real world and present the technique of Automatic Domain Randomization, which relies on randomizing the appearance as well as the dynamics of the simulated environment and automatically generates a distribution over randomized environments of ever-increasing difficulty. In particular, I'll focus on the problem of dexterous in-hand manipulation with a humanoid hand (https://openai.com/blog/solving-rubiks-cube/).
Видео Marcin Andrychowicz - Solving Rubik’s Cube with a Robot Hand канала ML in PL
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![Rafał Pilarczyk: Is Artificial Intelligence a threat to musicians? – Music generation techniques](https://i.ytimg.com/vi/J1fUVeGM9Wg/default.jpg)
![Philippe Preux - Bits of Reinforcement Learning | ML in PL 23](https://i.ytimg.com/vi/BZc2GWoEddc/default.jpg)
![B. Ludwiczuk, K. Jasinska-Kobus (Allegro) - Batch construction strategies in deep metric learning](https://i.ytimg.com/vi/ZU9faIBxXbc/default.jpg)
![Adam Paszke: PyTorch 1.0: now and in the future](https://i.ytimg.com/vi/cNeAPzlPl7A/default.jpg)
![Adam Podraza: Applied time series forecasting using machine learning](https://i.ytimg.com/vi/Qb9ekWYIxcU/default.jpg)
![Gül Varol - Learning human body representations from visual data](https://i.ytimg.com/vi/aUwqZbupkbE/default.jpg)
![David Haber - Opportunities and Challenges when Building AI for Autonomous Flight](https://i.ytimg.com/vi/pssmL5x2RFM/default.jpg)
![Adam Gonczarek (Alphamoon) – Intelligent Document Processing](https://i.ytimg.com/vi/QvhznidHYFM/default.jpg)
![Jonasz Pamuła (RTB House) – ML Challenges in cookieless world](https://i.ytimg.com/vi/seS3Hqn7bpY/default.jpg)
![João Henriques - Mapping environments with deep networks and spatial memories](https://i.ytimg.com/vi/vc0HwO_AJ40/default.jpg)
![Krzysztof Geras (NYU): "Towards Solving Breast Cancer Screening Diagnosis with Deep Learning"](https://i.ytimg.com/vi/IUELwCM1Efs/default.jpg)
![Stanisław Jastrzębski - Deep Learning in the Light of the Simplicity Bias | MLSS Kraków 2023](https://i.ytimg.com/vi/PV2EAkcgc7o/default.jpg)
![How to learn classifier chains using positive-unlabelled multi-label data? | ML in PL 22](https://i.ytimg.com/vi/rCalVE-kNkQ/default.jpg)
![Yoshua Bengio – Cognitively-inspired inductive biases for higher-level cognition](https://i.ytimg.com/vi/02ABljCu5Zw/default.jpg)
![Tomasz Grel (Nvidia): Faster Deep Learning with mixed precision and multiple GPUs](https://i.ytimg.com/vi/zbBUExOG-To/default.jpg)
![Panel Discussion – Women in ML](https://i.ytimg.com/vi/GH1axtAEYhk/default.jpg)
![Michał Jamroż - Class fitting in residual convolutional networks | ML in PL 23](https://i.ytimg.com/vi/Zx5uG8VhDgA/default.jpg)
![Sebastian Cygert - Toward continually learning models | ML in PL 23](https://i.ytimg.com/vi/tf424AmS5ps/default.jpg)
![Barbara Rychalska - Neural Machine Translation: achievements, challenges and the way forward](https://i.ytimg.com/vi/0bZagsD7oPg/default.jpg)
![Stanisław Jastrzębski - Gradient Alignment: When Deep Networks Work, and When They Don't](https://i.ytimg.com/vi/9J0_jLyI5IE/default.jpg)