Multimodal Few-Shot Learning with Frozen Language Models | Paper Explained
❤️ Become The AI Epiphany Patreon ❤️ ► https://www.patreon.com/theaiepiphany
In this video I cover "Multimodal Few-Shot Learning with Frozen Language Models" from DeepMind. They introduce Frozen - which is able to handle both visual and textual inputs and shows good generalization capabilities to novel visual question answering datasets combined with fast binding mechanisms even though it was only trained on image captioning.
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
✅ Paper: https://arxiv.org/abs/2106.13884
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
⌚️ Timetable:
00:00 Intro
02:20 GPT-3 and emerging few-shot properties
04:20 Training procedure for Frozen
07:45 Inference
10:15 Strong generalization?
11:55 Prompting mechanisms and the hardest task
13:25 Quantitative results
19:50 Outro
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💰 BECOME A PATREON OF THE AI EPIPHANY ❤️
If these videos, GitHub projects, and blogs help you,
consider helping me out by supporting me on Patreon!
The AI Epiphany ► https://www.patreon.com/theaiepiphany
One-time donation:
https://www.paypal.com/paypalme/theaiepiphany
Much love! ❤️
Huge thank you to these AI Epiphany patreons:
Eli Mahler
Petar Veličković
Zvonimir Sabljic
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💡 The AI Epiphany is a channel dedicated to simplifying the field of AI using creative visualizations and in general, a stronger focus on geometrical and visual intuition, rather than the algebraic and numerical "intuition".
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
👋 CONNECT WITH ME ON SOCIAL
LinkedIn ► https://www.linkedin.com/in/aleksagordic/
Twitter ► https://twitter.com/gordic_aleksa
Instagram ► https://www.instagram.com/aiepiphany/
Facebook ► https://www.facebook.com/aiepiphany/
👨👩👧👦 JOIN OUR DISCORD COMMUNITY:
Discord ► https://discord.gg/peBrCpheKE
📢 SUBSCRIBE TO MY MONTHLY AI NEWSLETTER:
Substack ► https://aiepiphany.substack.com/
💻 FOLLOW ME ON GITHUB FOR COOL PROJECTS:
GitHub ► https://github.com/gordicaleksa
📚 FOLLOW ME ON MEDIUM:
Medium ► https://gordicaleksa.medium.com/
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#frozen #deepmind #multimodal
Видео Multimodal Few-Shot Learning with Frozen Language Models | Paper Explained канала Aleksa Gordić - The AI Epiphany
In this video I cover "Multimodal Few-Shot Learning with Frozen Language Models" from DeepMind. They introduce Frozen - which is able to handle both visual and textual inputs and shows good generalization capabilities to novel visual question answering datasets combined with fast binding mechanisms even though it was only trained on image captioning.
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
✅ Paper: https://arxiv.org/abs/2106.13884
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
⌚️ Timetable:
00:00 Intro
02:20 GPT-3 and emerging few-shot properties
04:20 Training procedure for Frozen
07:45 Inference
10:15 Strong generalization?
11:55 Prompting mechanisms and the hardest task
13:25 Quantitative results
19:50 Outro
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💰 BECOME A PATREON OF THE AI EPIPHANY ❤️
If these videos, GitHub projects, and blogs help you,
consider helping me out by supporting me on Patreon!
The AI Epiphany ► https://www.patreon.com/theaiepiphany
One-time donation:
https://www.paypal.com/paypalme/theaiepiphany
Much love! ❤️
Huge thank you to these AI Epiphany patreons:
Eli Mahler
Petar Veličković
Zvonimir Sabljic
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💡 The AI Epiphany is a channel dedicated to simplifying the field of AI using creative visualizations and in general, a stronger focus on geometrical and visual intuition, rather than the algebraic and numerical "intuition".
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
👋 CONNECT WITH ME ON SOCIAL
LinkedIn ► https://www.linkedin.com/in/aleksagordic/
Twitter ► https://twitter.com/gordic_aleksa
Instagram ► https://www.instagram.com/aiepiphany/
Facebook ► https://www.facebook.com/aiepiphany/
👨👩👧👦 JOIN OUR DISCORD COMMUNITY:
Discord ► https://discord.gg/peBrCpheKE
📢 SUBSCRIBE TO MY MONTHLY AI NEWSLETTER:
Substack ► https://aiepiphany.substack.com/
💻 FOLLOW ME ON GITHUB FOR COOL PROJECTS:
GitHub ► https://github.com/gordicaleksa
📚 FOLLOW ME ON MEDIUM:
Medium ► https://gordicaleksa.medium.com/
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#frozen #deepmind #multimodal
Видео Multimodal Few-Shot Learning with Frozen Language Models | Paper Explained канала Aleksa Gordić - The AI Epiphany
Показать
Комментарии отсутствуют
Информация о видео
2 июля 2021 г. 23:09:08
00:21:01
Другие видео канала
Feed-forward method | Neural Style Transfer #510k subscribers | joining Google DeepMind, updates, AMADay 25: Open NLLB - filtering HBS (Pt 2)Fastformer: Additive Attention Can Be All You Need | Paper ExplainedDay 24: Open NLLB - back from China, analyzing spikes, preparing HBS run (Pt 2)Day 6: Meta NLLB - data filtering (Pt. 4)Day 24: Open NLLB - back from China, filtering HBS data (Pt 3)Day 14: Open NLLB - Eval of our first run (English, Turkish, Hindi) (Pt 2.)The Vesuvius challenge breakthrough with Luke FarritorDay 4: Training 600M NLLB - data preps (Pt. 2)DeepMind's Android RL Environment - AndroidEnvHamel Husain - Building LLM Apps in ProductionDay 28: Open NLLB - debugging fuzzy dedup, training fasttext LID (Pt 3)Day 10: Open NLLB - evaluation data, filtering (Pt 3.)Day 18: Open NLLB - data loading document, GitHub tasks (Pt 1 cont.)DeepMind's TacticAI: an AI assistant for football tactics | Petar VeličkovićJarvis for Images! (demo) - run locally, no external APIsDay 15: Open NLLB - ALTI+, detecting hallucinations (Pt 2)Day 29: Open NLLB - training fasttext LID (Pt 1)LLaMA 2 w/ Thomas Scialom (LLaMA 2 lead)Day 19: Open NLLB - compute grant, downloading HBS parallel corpora (Pt 1)