Загрузка...

Test-time compute — the new AI scaling law

What if the same model got smarter — just by thinking longer? That's exactly what o1 did.

For years, smarter AI meant one knob — bigger model. More parameters, more data, more GPUs. Then OpenAI shipped o1: same brain, but allowed to think longer before answering. And hard-math scores doubled — with no new training.

What's actually happening? A normal LLM (large language model) answers in one shot — left to right, no pause. A reasoning model writes a chain of thought first — a scratchpad where the model talks to itself. It tries one path, hits a dead end, tries another. The longer it scratches, the more paths it explores. That's test-time compute — thinking at answer time, not at training time. In plain English: you pay for thought, not a bigger brain.

On AIME — a hard math benchmark — the gap is wild. GPT-4o, answering fast, scored about 13%. o1, allowed to think, hit around 83%. Same family of models — just more compute at answer time. The catch: every thinking token costs money. Longer answers, bigger bill.

The new rule: make the model think, not just bigger.

Music: Markvard - Time [NCS Release] (NoCopyrightSounds)
https://ncs.io

#ai #reasoning #o1 #scaling #llm #shorts #programming

Видео Test-time compute — the new AI scaling law канала ProCode

ai ai scaling chain of thought llm machine learning o1 openai reasoning models shorts test-time compute

Комментарии отсутствуют

Информация о видео

30 мая 2026 г. 17:30:53

00:01:20

Правообладателям

Жалоба на материал Недопустимый материал Нарушение авторских прав

Комментарии

Поделиться

Другие видео канала

How does RAG read images in your PDFs? (Multimodal RAG)

Build a multi-agent system in 90 seconds

Why does var log undefined instead of throwing?

LangGraph vs LangChain — why teams switch for real agents

Binary Tree Maximum Path Sum | Blind 75 LeetCode Sheet Solved | Code Explanation in hindi

Arrays in Javascript Tutorial (Hindi/Urdu) | Javascript for beginners ( Hindi/Urdu) | push unshift

Create a Captivating Triangle Loading Animation | HTML & CSS Tutorial For beginner| In Hindi/Urdu

Why does a 2GB upload crash your Node server?

Top 5 Most Common Databases in 2022 as a Beginner #shorts #shortvideo #shortsfeed #shortsvideos

🌡️ Temperature Converter with JavaScript | HTML, CSS & JS Tutorial | Step-by-Step Guide 🚀

Tricky Javascript Interview Questions 37 #shorts #shortvideo #shortsfeed #shortsvideo #coding

Why does forEach + await silently skip?

Why does [] == false return true in JavaScript?

H100 vs B200 vs MI300X — which GPU should you train LLMs on?

Claude Opus 4.8 — 4x fewer bugs, 3x cheaper

Self-RAG: when the model decides to retrieve

🌊 Button Ripple Effect Animation with CSS and JavaScript | Create Dynamic Animation 🚀| Step by Step

🔡 Vowel Counter with JavaScript | HTML, CSS & JS Tutorial | Step-by-Step Guide 🚀

K8s rolling vs blue-green — which is safer?

When should you actually add "use client"?

Why your JS bundle has unused code (tree shaking)

Все заметки Новая заметка Страницу в заметки

Страницу в закладки Мои закладки

На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.

О Cookies Напомнить позже Принять