Загрузка страницы

Image Captioning, VQA and Image or Text Embedding Extraction using BLIP |BLIP | Karndeep Singh

BLIP is a new VLP framework that transfers flexibly to vision-language understanding and generation tasks. BLIP effectively utilizes noisy web data by bootstrapping the captions, where a captioner generates synthetic captions and a filter removes the noisy ones. BLIP achieves state-of-the-art results on a wide range of vision-language tasks, such as image-text retrieval (+2.7% in average recall@1), image captioning (+2.8% in CIDEr), and VQA (+1.6% in VQA score). BLIP also demonstrates strong generalization ability when directly transferred to video language tasks in a zero-shot manner.

Github : https://github.com/salesforce/BLIP Notebook
Link : https://github.com/karndeepsingh/self-supervised-learning

Connect with me on :
1. LinkedIn: https://www.linkedin.com/in/karndeepsingh/
2. Telegram Group: https://telegram.me/datascienceclubachievers
3. Github: https://www.github.com/karndeepsingh

Видео Image Captioning, VQA and Image or Text Embedding Extraction using BLIP |BLIP | Karndeep Singh канала Karndeep Singh
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
27 ноября 2022 г. 15:00:08
00:11:41
Другие видео канала
Finetune LLAMA2 on custom dataset efficiently with QLoRA | Detailed Explanation| LLM| Karndeep SinghFinetune LLAMA2 on custom dataset efficiently with QLoRA | Detailed Explanation| LLM| Karndeep SinghLLAMA2 with Langchain Integration and its Use-cases Tutorial | LLMs | Karndeep SinghLLAMA2 with Langchain Integration and its Use-cases Tutorial | LLMs | Karndeep SinghBuild Production-Level ML & Data Pipeline with Kedro: Step-by-Step Guide | MLOps | Karndeep SinghBuild Production-Level ML & Data Pipeline with Kedro: Step-by-Step Guide | MLOps | Karndeep SinghExploring New HuggingFace's Transformers Agents with Usecases | Colab Demo | Karndeep SinghExploring New HuggingFace's Transformers Agents with Usecases | Colab Demo | Karndeep SinghAutomate Image & PDF document annotation using GPT for LayoutLM or LiLT Models|UBIAI| Karndeep SinghAutomate Image & PDF document annotation using GPT for LayoutLM or LiLT Models|UBIAI| Karndeep SinghAutomating NER Data Labeling with Few-shot Learning and UBIAI | Text Annotation | Karndeep SinghAutomating NER Data Labeling with Few-shot Learning and UBIAI | Text Annotation | Karndeep SinghSegment Anything Model ( SAM ) by META AI | Image Segmentation | Object Detection | Karndeep SinghSegment Anything Model ( SAM ) by META AI | Image Segmentation | Object Detection | Karndeep SinghZero-Shot NER using ChatGPT | Name Entity Recognition| Prompt Engineering | Karndeep SinghZero-Shot NER using ChatGPT | Name Entity Recognition| Prompt Engineering | Karndeep SinghFine-Tune Stanford Alpaca 7B Model | LLAMA Alpaca | Step-by-Step Guide | Karndeep SinghFine-Tune Stanford Alpaca 7B Model | LLAMA Alpaca | Step-by-Step Guide | Karndeep SinghRun Stanford Alpaca 7B LLAMA Instruction Finetuned Model on Local System | Dalai | Karndeep SinghRun Stanford Alpaca 7B LLAMA Instruction Finetuned Model on Local System | Dalai | Karndeep SinghBuild Website Using Text Prompt With AI | Automate Website Development | Karndeep SinghBuild Website Using Text Prompt With AI | Automate Website Development | Karndeep SinghOpenAI ChatGPT Python API | ChatGPT API | Karndeep SinghOpenAI ChatGPT Python API | ChatGPT API | Karndeep SinghHow to Load Large Hugging Face Models on Low-End Hardware | CoLab | HF | Karndeep SinghHow to Load Large Hugging Face Models on Low-End Hardware | CoLab | HF | Karndeep SinghV7's Revolutionary Auto Annotation Tool for Images and Videos | V7 Annotation Tool | Karndeep SinghV7's Revolutionary Auto Annotation Tool for Images and Videos | V7 Annotation Tool | Karndeep Singh5 High Paying Jobs In Data Science | Highest paying JOBS | Karndeep Singh5 High Paying Jobs In Data Science | Highest paying JOBS | Karndeep SinghGenerate 3D Images using Text Prompts | Text -To-3D Images | Karndeep SinghGenerate 3D Images using Text Prompts | Text -To-3D Images | Karndeep Singh3 Best Productive ChatGPT Chrome Extensions | ChatGPT |3 Best Productive ChatGPT Chrome Extensions | ChatGPT |Setup ChatGPT in VSCODE using CodeGPT Extension | ChatGPT | Karndeep SinghSetup ChatGPT in VSCODE using CodeGPT Extension | ChatGPT | Karndeep SinghMLFlow Tutorial Part-5: Setup MLFlow server on GCP - A Step-By-Step Guide | MLFlow | Karndeep SinghMLFlow Tutorial Part-5: Setup MLFlow server on GCP - A Step-By-Step Guide | MLFlow | Karndeep SinghMLFlow Tutorial Part-4 : Model Versioning and Model Registry using MLFlow | MLFlow | Karndeep SinghMLFlow Tutorial Part-4 : Model Versioning and Model Registry using MLFlow | MLFlow | Karndeep SinghTranslate text and detect the language of text using GoogleTrans | Google Translate | Karndeep SinghTranslate text and detect the language of text using GoogleTrans | Google Translate | Karndeep Singh
Яндекс.Метрика