Загрузка...

How Did DeepSeek V4 Make V4 So Cheap?

Need to fine-tune a model without the hassle? Try out Crusoe's serverless fine-tuning today! https://www.crusoe.ai/contact-sales/serverless-preview?utm_source=bycloud&utm_medium=influencer&utm_campaign=serverlessfinetuning

After a month of delay, here is my part 1 breakdown of the DeepSeek-V4 paper. In this video, I'll be covering all the key developments they've made that you should know if you want to keep up with the frontier of AI.

I will have a part 2 that is a deep dive into their infrastructure side of developments that is a lot more advanced so stay tuned!

*thumbnail: this is the price per million tokens when the input is a cache hit. The normal price for input (cache miss) is $0.435.
Learn AI intuitively, best intro into LLMs!
https://intuitiveai.academy/
limited time code "SUMMER" for 25% off yearly plan
We just wrote a new piece on RL & RLHF!

My Newsletter
https://mail.bycloud.ai/

My Patreon
https://www.patreon.com/c/bycloud

DeepSeek-V4
[Paper] https://www.alphaxiv.org/abs/deepseek-v4
Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI

This video is supported by the kind Patrons & YouTube Members:
🙏Spam Maj, Alex, Chris LeDoux, DX Research Group, Poof N' Inu, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa, Toru Mon, Lame Plane, Matej Macak, Len Mo, saylikhapekar, ZyanSheep, THEVIERAOS, Ricardo Raphael Corona-Moreno
[Discord] https://discord.gg/NhJZGtH
[Twitter] https://twitter.com/bycloudai
[Patreon] https://www.patreon.com/bycloud
[Business Inquiries] bycloud@smoothmedia.co
[Other Inquiries] bycloudai@gmail.com
[Profile & Banner Art] https://twitter.com/pygm7
[Video Editor] @Booga04
Manim Animations created with Manimate https://www.manimate.ai/
[Ko-fi] https://ko-fi.com/bycloudai

Видео How Did DeepSeek V4 Make V4 So Cheap? канала bycloud
Яндекс.Метрика
Все заметки Новая заметка Страницу в заметки
Страницу в закладки Мои закладки
На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.
О CookiesНапомнить позжеПринять