CLIP: Connecting Text and Images
This video explains how CLIP from OpenAI transforms Image Classification into a Text-Image similarity matching task. This is done with Contrastive Training and Zero-Shot Pattern-Exploiting Training. Thanks for watching!
Paper Links:
Clip (Blog Post): https://openai.com/blog/clip/
VirTex: https://arxiv.org/pdf/2006.06666.pdf
ConVIRT: https://arxiv.org/pdf/2010.00747.pdf
Pattern-Exploiting Training: https://arxiv.org/pdf/2001.07676.pdf
Vision Transformer (Blog Post, Nice Animation): https://ai.googleblog.com/2020/12/transformers-for-image-recognition-at.html
Thanks for watching! Please Subscribe!
Видео CLIP: Connecting Text and Images канала Henry AI Labs
Paper Links:
Clip (Blog Post): https://openai.com/blog/clip/
VirTex: https://arxiv.org/pdf/2006.06666.pdf
ConVIRT: https://arxiv.org/pdf/2010.00747.pdf
Pattern-Exploiting Training: https://arxiv.org/pdf/2001.07676.pdf
Vision Transformer (Blog Post, Nice Animation): https://ai.googleblog.com/2020/12/transformers-for-image-recognition-at.html
Thanks for watching! Please Subscribe!
Видео CLIP: Connecting Text and Images канала Henry AI Labs
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
OpenAI CLIP: ConnectingText and Images (Paper Explained)ELECTRA: Pre-Training Text Encoders as Discriminators Rather than GeneratorsEfficientNet Paper WalkthroughVariational AutoencodersThe Most Important Computer Vision Model of 2021OpenAI CLIP - Connecting Text and Images | Paper ExplainedBART: Denoising Sequence-to-Sequence Pre-training for NLG & Translation (Explained)What It's Like To be a Computer: An Interview with GPT-3RustConf 2021 - Identifying Pokémon Cards by Hugo PeixotoOpenAI’s CLIP explained! | Examples, links to code and pretrained modelEmbeddings for Everything: Search in the Neural Network EraBig Self-Supervised Models are Strong Semi-Supervised Learners (Paper Explained)Few-Shot Learning with Reptile - Keras Code ExamplesEasy Manga Screentones in Photoshop for Beginners, simple and automaticMIT Bootcamps: How to identify business opportunities with AIGPT3: An Even Bigger Language Model - ComputerphileBrief Introduction to Image DenoisingFeature Visualization & The OpenAI microscopeSinGAN Explained! (ICCV '19 Best Paper)