- Популярные видео
- Авто
- Видео-блоги
- ДТП, аварии
- Для маленьких
- Еда, напитки
- Животные
- Закон и право
- Знаменитости
- Игры
- Искусство
- Комедии
- Красота, мода
- Кулинария, рецепты
- Люди
- Мото
- Музыка
- Мультфильмы
- Наука, технологии
- Новости
- Образование
- Политика
- Праздники
- Приколы
- Природа
- Происшествия
- Путешествия
- Развлечения
- Ржач
- Семья
- Сериалы
- Спорт
- Стиль жизни
- ТВ передачи
- Танцы
- Технологии
- Товары
- Ужасы
- Фильмы
- Шоу-бизнес
- Юмор
How a 1B model hit 62% on text-to-SQL #llm #aiforanalysts #dataengineering #ai #aiagents #jepa
Production questions worth thinking about:
1. The dual-timescale recurrence gives the model variable compute depth per input. How would you decide how many H and L cycles a query actually needs, and what does adding cycles cost you at inference?
2. Execution accuracy scores whether the SQL returns the right rows, not whether the SQL is written the way you would write it. Where does that metric mislead you when you put a text-to-SQL model in front of real analysts?
3. The fine-tune uses 3 in-context schema demos per example so the model learns to ground SQL in whatever schema you hand it. How does that generalize to a 400-table warehouse with cryptic column names?
4. 8 percent to 62 percent came from 15 minutes on one GPU. What part of that jump is the architecture versus simply seeing Spider-shaped examples, and how would you tell them apart?
5. If you served this in-house instead of calling a frontier model, where does the hidden cost move: GPU memory, eval and guardrails, or schema drift over time?
#shorts #texttoSQL #LLM #dataengineering #aiforanalysts
Видео How a 1B model hit 62% on text-to-SQL #llm #aiforanalysts #dataengineering #ai #aiagents #jepa канала JH-Analytics | 2.0
1. The dual-timescale recurrence gives the model variable compute depth per input. How would you decide how many H and L cycles a query actually needs, and what does adding cycles cost you at inference?
2. Execution accuracy scores whether the SQL returns the right rows, not whether the SQL is written the way you would write it. Where does that metric mislead you when you put a text-to-SQL model in front of real analysts?
3. The fine-tune uses 3 in-context schema demos per example so the model learns to ground SQL in whatever schema you hand it. How does that generalize to a 400-table warehouse with cryptic column names?
4. 8 percent to 62 percent came from 15 minutes on one GPU. What part of that jump is the architecture versus simply seeing Spider-shaped examples, and how would you tell them apart?
5. If you served this in-house instead of calling a frontier model, where does the hidden cost move: GPU memory, eval and guardrails, or schema drift over time?
#shorts #texttoSQL #LLM #dataengineering #aiforanalysts
Видео How a 1B model hit 62% on text-to-SQL #llm #aiforanalysts #dataengineering #ai #aiagents #jepa канала JH-Analytics | 2.0
hrm text hierarchical reasoning model text to sql text2sql spider benchmark sql llm fine tuning llm single gpu fine tune sapient intelligence small language model dual timescale execution accuracy llm for data ai for analysts ai for data data engineering warehouse schema sql generation h100 fine tuning open source llm ai shorts llm 2026 ai news reasoning model schema grounding
Комментарии отсутствуют
Информация о видео
21 мая 2026 г. 14:30:25
00:01:04
Другие видео канала





















