Sleep Time Compute - AI That "Thinks" 24/7 (Breakthrough)
👉 Access all top AIs for $10 on https://mammouth.ai/
Join My Newsletter for Regular AI Updates 👇🏼
https://forwardfuture.ai
My Links 🔗
👉🏻 Subscribe: https://www.youtube.com/@matthew_berman
👉🏻 Twitter: https://twitter.com/matthewberman
👉🏻 Discord: https://discord.gg/xxysSXBxFW
👉🏻 Patreon: https://patreon.com/MatthewBerman
👉🏻 Instagram: https://www.instagram.com/matthewberman_ai
👉🏻 Threads: https://www.threads.net/@matthewberman_ai
👉🏻 LinkedIn: https://www.linkedin.com/company/forward-future-ai
Media/Sponsorship Inquiries ✅
https://bit.ly/44TC45V
0:00 Intro: AI That Thinks BEFORE You Ask?
0:13 Introducing Sleep-Time Compute
0:59 The Problem with Standard Test-Time Compute (Cost & Latency)
2:58 Stateful LLM Applications (Code, Docs, Chat)
3:33 Sleep Time vs. Test Time (Diagram Explained)
4:51 Why Sleep-Time is More Cost-Effective
6:00 Defining Sleep-Time Compute
6:26 Sponsor: Mammoth (Generative AI Platform)
7:18 Paper Details: How They Tested Non-Reasoning Models
9:24 Benchmarking Sleep-Time (The Juggle Example)
10:05 Models Used (GPT-4o, Claude, DeepSeek, etc.)
10:25 Results: Non-Reasoning Models (Graphs)
12:18 Results: Reasoning Models (Graphs)
13:39 Sleep Time vs. Parallel Sampling (A Big Issue)
14:41 Scaling Sleep-Time Compute
15:45 Amortizing Cost Across Queries (Why it's Cheaper!)
16:48 Predictable Queries Benefit Most
18:04 Paper Summary & Future Directions
18:40 Outro & Newsletter
Видео Sleep Time Compute - AI That "Thinks" 24/7 (Breakthrough) канала Matthew Berman
Join My Newsletter for Regular AI Updates 👇🏼
https://forwardfuture.ai
My Links 🔗
👉🏻 Subscribe: https://www.youtube.com/@matthew_berman
👉🏻 Twitter: https://twitter.com/matthewberman
👉🏻 Discord: https://discord.gg/xxysSXBxFW
👉🏻 Patreon: https://patreon.com/MatthewBerman
👉🏻 Instagram: https://www.instagram.com/matthewberman_ai
👉🏻 Threads: https://www.threads.net/@matthewberman_ai
👉🏻 LinkedIn: https://www.linkedin.com/company/forward-future-ai
Media/Sponsorship Inquiries ✅
https://bit.ly/44TC45V
0:00 Intro: AI That Thinks BEFORE You Ask?
0:13 Introducing Sleep-Time Compute
0:59 The Problem with Standard Test-Time Compute (Cost & Latency)
2:58 Stateful LLM Applications (Code, Docs, Chat)
3:33 Sleep Time vs. Test Time (Diagram Explained)
4:51 Why Sleep-Time is More Cost-Effective
6:00 Defining Sleep-Time Compute
6:26 Sponsor: Mammoth (Generative AI Platform)
7:18 Paper Details: How They Tested Non-Reasoning Models
9:24 Benchmarking Sleep-Time (The Juggle Example)
10:05 Models Used (GPT-4o, Claude, DeepSeek, etc.)
10:25 Results: Non-Reasoning Models (Graphs)
12:18 Results: Reasoning Models (Graphs)
13:39 Sleep Time vs. Parallel Sampling (A Big Issue)
14:41 Scaling Sleep-Time Compute
15:45 Amortizing Cost Across Queries (Why it's Cheaper!)
16:48 Predictable Queries Benefit Most
18:04 Paper Summary & Future Directions
18:40 Outro & Newsletter
Видео Sleep Time Compute - AI That "Thinks" 24/7 (Breakthrough) канала Matthew Berman
Комментарии отсутствуют
Информация о видео
25 апреля 2025 г. 21:22:26
00:19:07
Другие видео канала




















