How to build custom Datasets for Text in Pytorch
In this video we go through a bit more in depth into custom datasets and implement more advanced functions for dealing with text. Specifically we're looking at a image captioning dataset (Flickr8k data set) with an image and a corresponding caption text that describes what's going on in the image. I think the general principles from this video can be utilized to any project you're working with when dealing with text data be it either translation, question answering, sentiment analysis etc. I also recommend taking a look at my Torchtext which can also be quite helpful and simplify the data loading process.
❤️ Support the channel ❤️
https://www.youtube.com/channel/UCkzW5JSFwvKRjXABI-UTAkQ/join
Paid Courses I recommend for learning (affiliate links, no extra cost for you):
⭐ Machine Learning Specialization https://bit.ly/3hjTBBt
⭐ Deep Learning Specialization https://bit.ly/3YcUkoI
📘 MLOps Specialization http://bit.ly/3wibaWy
📘 GAN Specialization https://bit.ly/3FmnZDl
📘 NLP Specialization http://bit.ly/3GXoQuP
✨ Free Resources that are great:
NLP: https://web.stanford.edu/class/cs224n/
CV: http://cs231n.stanford.edu/
Deployment: https://fullstackdeeplearning.com/
FastAI: https://www.fast.ai/
💻 My Deep Learning Setup and Recording Setup:
https://www.amazon.com/shop/aladdinpersson
GitHub Repository:
https://github.com/aladdinpersson/Machine-Learning-Collection
✅ One-Time Donations:
Paypal: https://bit.ly/3buoRYH
▶️ You Can Connect with me on:
Twitter - https://twitter.com/aladdinpersson
LinkedIn - https://www.linkedin.com/in/aladdin-persson-a95384153/
Github - https://github.com/aladdinpersson
OUTLINE:
0:00 - Introduction
2:05 - Overview of what we're going to do
4:05 - Imports
5:20 - Setup of Pytorch Dataset for loading Flickr
11:50 - Setup of Vocabulary and Numericalization
22:19 - Creating Collate for Padding of Batch
25:20 - Function for getting data loader
29:15 - Running code & fixing couple of errors
33:09 - Ending
Видео How to build custom Datasets for Text in Pytorch канала Aladdin Persson
❤️ Support the channel ❤️
https://www.youtube.com/channel/UCkzW5JSFwvKRjXABI-UTAkQ/join
Paid Courses I recommend for learning (affiliate links, no extra cost for you):
⭐ Machine Learning Specialization https://bit.ly/3hjTBBt
⭐ Deep Learning Specialization https://bit.ly/3YcUkoI
📘 MLOps Specialization http://bit.ly/3wibaWy
📘 GAN Specialization https://bit.ly/3FmnZDl
📘 NLP Specialization http://bit.ly/3GXoQuP
✨ Free Resources that are great:
NLP: https://web.stanford.edu/class/cs224n/
CV: http://cs231n.stanford.edu/
Deployment: https://fullstackdeeplearning.com/
FastAI: https://www.fast.ai/
💻 My Deep Learning Setup and Recording Setup:
https://www.amazon.com/shop/aladdinpersson
GitHub Repository:
https://github.com/aladdinpersson/Machine-Learning-Collection
✅ One-Time Donations:
Paypal: https://bit.ly/3buoRYH
▶️ You Can Connect with me on:
Twitter - https://twitter.com/aladdinpersson
LinkedIn - https://www.linkedin.com/in/aladdin-persson-a95384153/
Github - https://github.com/aladdinpersson
OUTLINE:
0:00 - Introduction
2:05 - Overview of what we're going to do
4:05 - Imports
5:20 - Setup of Pytorch Dataset for loading Flickr
11:50 - Setup of Vocabulary and Numericalization
22:19 - Creating Collate for Padding of Batch
25:20 - Function for getting data loader
29:15 - Running code & fixing couple of errors
33:09 - Ending
Видео How to build custom Datasets for Text in Pytorch канала Aladdin Persson
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![CogVLM: The best open source Vision Language Model](https://i.ytimg.com/vi/xmfhxbR9x5o/default.jpg)
![How to use ChatGPT/GPT-4 API](https://i.ytimg.com/vi/r_Ewex-z5M4/default.jpg)
![🔴 LIVE 🔴 Learning to Rank is good for your ML Career | Learn RecSys with me #4](https://i.ytimg.com/vi/BXY3JS17_XI/default.jpg)
![🔴 LIVE 🔴 Learn RecSys with me #3](https://i.ytimg.com/vi/edhBny8l7Sg/default.jpg)
![Paper Time: The Netflix Recommender System](https://i.ytimg.com/vi/Zf5loAUMGqk/default.jpg)
![🔴 LIVE 🔴 MATRIX FACTORIZATION TECHNIQUES FOR RECOMMENDER SYSTEMS](https://i.ytimg.com/vi/_1n11td64B0/default.jpg)
![Aleksa Gordić: Embracing the AI Epiphany - DeepMind, Startups & Learning | The Real AI Podcast #4](https://i.ytimg.com/vi/-8LlteHQunY/default.jpg)
![Top 30 Twitter Accounts for Machine Learning Research](https://i.ytimg.com/vi/SG3hEbrqSqM/default.jpg)
![FrugalGPT: How to not get broke using GPT4](https://i.ytimg.com/vi/ZRAAn7LDJfo/default.jpg)
![Google PALM2: Is it better than GPT4?](https://i.ytimg.com/vi/hZgKM4XNen8/default.jpg)
![Are LLMs the Future of Recommender Systems?](https://i.ytimg.com/vi/ofgppCGkuB0/default.jpg)
![Open source AI is taking over - Leaked Google Document](https://i.ytimg.com/vi/R0utiRTQCHw/default.jpg)
![Deep Work: How to build your skill](https://i.ytimg.com/vi/MoGzRsloxNY/default.jpg)
![Geoffrey Hinton On The Dangers of AI](https://i.ytimg.com/vi/KTW4J8QVpkQ/default.jpg)
![MLBros #2: Leaked Docs, AI Doom, Mojo, AutoML-GPT, LLava, EvalPlus+, CodeGen & More! 😃](https://i.ytimg.com/vi/mzJbrcykezI/default.jpg)
![[Paper Review]: Deep & Cross Network for Ad Click Predictions by Google](https://i.ytimg.com/vi/oh6tKK1Sgyw/default.jpg)
![[Paper Review]: Deep Neural Networks for YouTube Recommendations](https://i.ytimg.com/vi/7C11p47NW44/default.jpg)
![[Paper Review]: Wide & Deep Learning for Recommender Systems](https://i.ytimg.com/vi/fLD1axAqsNQ/default.jpg)
![MLBros #1: Exploring LLMs, SAM, SEEM, TrackAnything & DINOv2](https://i.ytimg.com/vi/jmxJEBBPz38/default.jpg)
![Nicolai Nielsen: Mastering Computer Vision, Career Paths & Content Creation | The Real AI Podcast #3](https://i.ytimg.com/vi/8u4Jv9sfN7Q/default.jpg)
![PyTorch Lightning #10 - Multi GPU Training](https://i.ytimg.com/vi/70oXdSMcj9c/default.jpg)