Загрузка...

Embodied Chain-of-Thought Reasoning vision-language-action model to generate reasoning steps

Embodied Chain-of-Thought Reasoning (ECoT): is a vision-language-action model to generate reasoning steps in response to instructions and images before choosing a robot action, enabling better performance, interpretability, and generalization

Github ECoT: https://github.com/MichalZawalski/embodied-CoT

Model: https://huggingface.co/Embodied-CoT/ecot-openvla-7b-bridge/tree/main

Видео Embodied Chain-of-Thought Reasoning vision-language-action model to generate reasoning steps канала Mani rajan
Страницу в закладки Мои закладки
Все заметки Новая заметка Страницу в заметки