- Популярные видео
- Авто
- Видео-блоги
- ДТП, аварии
- Для маленьких
- Еда, напитки
- Животные
- Закон и право
- Знаменитости
- Игры
- Искусство
- Комедии
- Красота, мода
- Кулинария, рецепты
- Люди
- Мото
- Музыка
- Мультфильмы
- Наука, технологии
- Новости
- Образование
- Политика
- Праздники
- Приколы
- Природа
- Происшествия
- Путешествия
- Развлечения
- Ржач
- Семья
- Сериалы
- Спорт
- Стиль жизни
- ТВ передачи
- Танцы
- Технологии
- Товары
- Ужасы
- Фильмы
- Шоу-бизнес
- Юмор
HRM-Text: Efficient Pretraining Beyond Scaling
Paper: HRM-Text: Efficient Pretraining Beyond Scaling (2605.20613)
Published: 20 May 2026.
Learn more on Emergent Mind: https://www.emergentmind.com/papers/2605.20613
arXiv: https://arxiv.org/abs/2605.20613
Sign up for our free trending papers email digest: https://www.emergentmind.com/subscribe
Follow us on X: https://x.com/EmergentMind
Join our Discord: https://discord.gg/BhfTC4mTXq
This presentation explores HRM-Text, a groundbreaking approach to language model pretraining that achieves competitive performance with models 2 to 7 times its size while using up to 432 times less compute and 900 times fewer training tokens. Through a dual-timescale recurrent architecture inspired by biological multi-timescale processing, combined with instruction-response training objectives and novel stabilization techniques, HRM-Text demonstrates that brute-force scaling is not the only path to capable language models. We examine the architectural innovations, training methodology, empirical results, and implications for democratizing large language model research.
Видео HRM-Text: Efficient Pretraining Beyond Scaling канала Emergent Mind
Published: 20 May 2026.
Learn more on Emergent Mind: https://www.emergentmind.com/papers/2605.20613
arXiv: https://arxiv.org/abs/2605.20613
Sign up for our free trending papers email digest: https://www.emergentmind.com/subscribe
Follow us on X: https://x.com/EmergentMind
Join our Discord: https://discord.gg/BhfTC4mTXq
This presentation explores HRM-Text, a groundbreaking approach to language model pretraining that achieves competitive performance with models 2 to 7 times its size while using up to 432 times less compute and 900 times fewer training tokens. Through a dual-timescale recurrent architecture inspired by biological multi-timescale processing, combined with instruction-response training objectives and novel stabilization techniques, HRM-Text demonstrates that brute-force scaling is not the only path to capable language models. We examine the architectural innovations, training methodology, empirical results, and implications for democratizing large language model research.
Видео HRM-Text: Efficient Pretraining Beyond Scaling канала Emergent Mind
Комментарии отсутствуют
Информация о видео
23 мая 2026 г. 9:41:47
00:02:12
Другие видео канала




![[DEV] Clawed and Dangerous: Can We Trust Open Agentic Systems?](https://i.ytimg.com/vi/SaEg8CBKF9E/default.jpg)
















