Embodied Multimodal Intelligence with Foundation Models - Dr. Oier Mees (UC Berkeley)

Despite considerable progress in robot learning, most real-world robots remain confined to a narrow set of preprogrammed behaviors, falling short of public expectations. As robots become more ubiquitous in human-centered environments, the need for “generalist” robots grows: how can we scale robot learning systems to generalize and adapt, enabling them to perform a wide range of everyday tasks in unstructured settings based on arbitrary user instructions? In this talk, I will discuss the challenges and opportunities in building robot foundation models and outline the key ingredients for developing generalist robot policies—including cross-embodied learning, multimodal alignment, and scalable learning and evaluation procedures. I will present the first instantiation of such a model, capable of performing bimanual manipulation, visual navigation, quadruped locomotion, single-arm manipulation, and even aviation. I will then discuss how this model serves as a pre-trained backbone for downstream tasks, including humanoid control. Finally, I will show how incorporating intelligent reasoning not only enables robots to use common sense to think before acting, but also significantly enhances their generalization, interpretability, and ability to interact effectively with humans.

Видео Embodied Multimodal Intelligence with Foundation Models - Dr. Oier Mees (UC Berkeley) канала Soft Robotics Lab [SRL] / ETH Zurich

Комментарии отсутствуют

Информация о видео

15 марта 2025 г. 19:51:19

01:08:57

Soft Robotics Lab [SRL] / ETH Zurich

Правообладателям

Жалоба на материал Недопустимый материал Нарушение авторских прав

Комментарии

Другие видео канала

Embodied Multimodal Intelligence with Foundation Models - Dr. Oier Mees (UC Berkeley)

Advances in self-supervised multimodal learning - Prof. Dr. Hilde Kuehne (Tuebingen AI Center)

Meta-Learning via Classifier(-free) Diffusion Guidance

Building Generalist Robots with Agility via Learning and Control - Prof. Dr. Guanya Shi (CMU)

Towards Next-Gen 3D Reconstruction and Generation - Prof. Dr. Lingjie Liu (UPenn)

RAPTOR: Rapid Aerial Pickup and Transport of Objects by Robots

Advancing Robust Controllers with Reinforcement Learning | ETH Zürich Real World Robotics Tutorial 6

SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning with Normalizing Flow Policies

Planar Modeling and Sim-to-Real of a Tethered Multimaterial Soft Swimmer Driven by Peano-HASELs

Multidirectional Filamented Light Biofabrication Creates Aligned and Contractile Cardiac Tissues

Thomas Buchner - PhD Defense

DeSKO: Stability-Assured Robust Control with a Deep Stochastic Koopman Operator

Simple Models, Real Swimming: Digital Twins for Tendon-Driven Underwater Robots

Identifying Kinematics and Dynamics of Robots | ETH Zürich Real World Robotics Tutorial 3

SMORS: A soft multirotor UAV for multimodal locomotion and robust interaction

Flying Hydraulically Amplified Electrostatic Gripper System for Aerial Object Manipulation

[manipulation RL tutorial] faive_gym: train your own robot hand with IsaacGym in the GPU

Bringing Robots to the World | ETH Zürich Real World Robotics Tutorial 7

Pathway to Robotic Intelligence - Prof. Dr. Pulkit Agrawal (MIT)

ICLR 2022 - DeSKO: Stability-Assured Robust Control with a Deep Stochastic Koopman Operator

Soft and Perceptive Robots - Prof. Dr. Perla Maiolino (University of Oxford)