AgiBot GO-1: The Evolution of Generalist Embodied Foundation Model from VLA to ViLLA
Today, AgiBot launches Genie Operator-1 (GO-1), an innovative generalist embodied foundation model. GO-1 introduces the novel Vision-Language-Latent-Action (ViLLA) framework, combining a Vision-Language Model (VLM) and Mixture of Experts (MoE). The VLM utilizes internet-scale heterogeneous data to establish a solid foundation for scene and object understanding. The MoE consists of two key components: the Latent Planner, which learns from cross-embodiment and human operation data to develop general action understanding, and the Action Expert, which uses over a million real robot demonstrations to achieve high-frequency and dexterous manipulation.
Видео AgiBot GO-1: The Evolution of Generalist Embodied Foundation Model from VLA to ViLLA канала AgiBot
Видео AgiBot GO-1: The Evolution of Generalist Embodied Foundation Model from VLA to ViLLA канала AgiBot
Комментарии отсутствуют
Информация о видео
10 марта 2025 г. 7:08:39
00:03:47
Другие видео канала