But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning
Unpacking how large language models work under the hood
Early view of the next chapter for patrons: https://3b1b.co/early-attention
Special thanks to these supporters: https://3b1b.co/lessons/gpt#thanks
To contribute edits to the subtitles, visit https://translate.3blue1brown.com/
Other recommended resources on the topic.
Richard Turner's introduction is one of the best starting places:
https://arxiv.org/pdf/2304.10557.pdf
Coding a GPT with Andrej Karpathy
https://youtu.be/kCc8FmEb1nY
Introduction to self-attention by John Hewitt
https://web.stanford.edu/class/cs224n/readings/cs224n-self-attention-transformers-2023_draft.pdf
History of language models by Brit Cruise:
https://youtu.be/OFS90-FX6pg
Paper about examples like the “woman - man” one presented here:
https://arxiv.org/pdf/1301.3781.pdf
------------------
Timestamps
0:00 - Predict, sample, repeat
3:03 - Inside a transformer
6:36 - Chapter layout
7:20 - The premise of Deep Learning
12:27 - Word embeddings
18:25 - Embeddings beyond words
20:22 - Unembedding
22:22 - Softmax with temperature
26:03 - Up next
------------------
These animations are largely made using a custom Python library, manim. See the FAQ comments here:
https://3b1b.co/faq#manim
https://github.com/3b1b/manim
https://github.com/ManimCommunity/manim/
All code for specific videos is visible here:
https://github.com/3b1b/videos/
The music is by Vincent Rubinetti.
https://www.vincentrubinetti.com
https://vincerubinetti.bandcamp.com/album/the-music-of-3blue1brown
https://open.spotify.com/album/1dVyjwS8FBqXhRunaG5W5u
------------------
3blue1brown is a channel about animating math, in all senses of the word animate. If you're reading the bottom of a video description, I'm guessing you're more interested than the average viewer in lessons here. It would mean a lot to me if you chose to stay up to date on new ones, either by subscribing here on YouTube or otherwise following on whichever platform below you check most regularly.
Mailing list: https://3blue1brown.substack.com
Twitter: https://twitter.com/3blue1brown
Instagram: https://www.instagram.com/3blue1brown
Reddit: https://www.reddit.com/r/3blue1brown
Facebook: https://www.facebook.com/3blue1brown
Patreon: https://patreon.com/3blue1brown
Website: https://www.3blue1brown.com
Видео But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning канала 3Blue1Brown
Early view of the next chapter for patrons: https://3b1b.co/early-attention
Special thanks to these supporters: https://3b1b.co/lessons/gpt#thanks
To contribute edits to the subtitles, visit https://translate.3blue1brown.com/
Other recommended resources on the topic.
Richard Turner's introduction is one of the best starting places:
https://arxiv.org/pdf/2304.10557.pdf
Coding a GPT with Andrej Karpathy
https://youtu.be/kCc8FmEb1nY
Introduction to self-attention by John Hewitt
https://web.stanford.edu/class/cs224n/readings/cs224n-self-attention-transformers-2023_draft.pdf
History of language models by Brit Cruise:
https://youtu.be/OFS90-FX6pg
Paper about examples like the “woman - man” one presented here:
https://arxiv.org/pdf/1301.3781.pdf
------------------
Timestamps
0:00 - Predict, sample, repeat
3:03 - Inside a transformer
6:36 - Chapter layout
7:20 - The premise of Deep Learning
12:27 - Word embeddings
18:25 - Embeddings beyond words
20:22 - Unembedding
22:22 - Softmax with temperature
26:03 - Up next
------------------
These animations are largely made using a custom Python library, manim. See the FAQ comments here:
https://3b1b.co/faq#manim
https://github.com/3b1b/manim
https://github.com/ManimCommunity/manim/
All code for specific videos is visible here:
https://github.com/3b1b/videos/
The music is by Vincent Rubinetti.
https://www.vincentrubinetti.com
https://vincerubinetti.bandcamp.com/album/the-music-of-3blue1brown
https://open.spotify.com/album/1dVyjwS8FBqXhRunaG5W5u
------------------
3blue1brown is a channel about animating math, in all senses of the word animate. If you're reading the bottom of a video description, I'm guessing you're more interested than the average viewer in lessons here. It would mean a lot to me if you chose to stay up to date on new ones, either by subscribing here on YouTube or otherwise following on whichever platform below you check most regularly.
Mailing list: https://3blue1brown.substack.com
Twitter: https://twitter.com/3blue1brown
Instagram: https://www.instagram.com/3blue1brown
Reddit: https://www.reddit.com/r/3blue1brown
Facebook: https://www.facebook.com/3blue1brown
Patreon: https://patreon.com/3blue1brown
Website: https://www.3blue1brown.com
Видео But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning канала 3Blue1Brown
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
25 Math explainers you may enjoy | SoME3 resultsQ&A with Grant Sanderson (3blue1brown)Explaining the barber pole effect from origins of light | Optics puzzles 22021 Summer of Math Exposition resultsAttention in transformers, visually explained | Chapter 6, Deep LearningHamming codes part 2: The one-line implementationAbstract vector spaces | Chapter 16, Essence of linear algebraOther math channels you'd enjoyBut what is a convolution?Three levels of understanding Bayes' theoremLockdown math announcementSimulating the electric field and a moving chargeHow They Fool Ya (live) | Math parody of HallelujahThree-dimensional linear transformations | Chapter 5, Essence of linear algebraEssence of linear algebra previewHow colliding blocks act like a beam of light...to compute pi.The determinant | Chapter 6, Essence of linear algebraConvolutions | Why X+Y in probability is a beautiful messWhy do we call them "scalars"?But what is a partial differential equation? | DE2