Day 27: Open NLLB - speeding up fuzzy dedup, tqdm multi process (Pt 1)
👨👩👧👦 Join our Discord community 👨👩👧👦
https://discord.gg/peBrCpheKE
Continuing on with the data work (Serbian, Croatian, Bosnian) getting ready to train the first baseline English2HBS.
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💰 BECOME A PATREON OF THE AI EPIPHANY ❤️
If these videos, GitHub projects, and blogs help you,
consider helping me out by supporting me on Patreon!
The AI Epiphany - https://www.patreon.com/theaiepiphany
One-time donation - https://www.paypal.com/paypalme/theaiepiphany
Huge thank you to these AI Epiphany patreons:
Eli Mahler
Petar Veličković
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💼 LinkedIn - https://www.linkedin.com/in/aleksagordic/
🐦 Twitter - https://twitter.com/gordic_aleksa
👨👩👧👦 Discord - https://discord.gg/peBrCpheKE
📺 YouTube - https://www.youtube.com/c/TheAIEpiphany/
📚 Medium - https://gordicaleksa.medium.com/
💻 GitHub - https://github.com/gordicaleksa
📢 AI Newsletter - https://aiepiphany.substack.com/
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#opensource #meta #nolanguageleftbehind
Видео Day 27: Open NLLB - speeding up fuzzy dedup, tqdm multi process (Pt 1) канала Aleksa Gordić - The AI Epiphany
https://discord.gg/peBrCpheKE
Continuing on with the data work (Serbian, Croatian, Bosnian) getting ready to train the first baseline English2HBS.
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💰 BECOME A PATREON OF THE AI EPIPHANY ❤️
If these videos, GitHub projects, and blogs help you,
consider helping me out by supporting me on Patreon!
The AI Epiphany - https://www.patreon.com/theaiepiphany
One-time donation - https://www.paypal.com/paypalme/theaiepiphany
Huge thank you to these AI Epiphany patreons:
Eli Mahler
Petar Veličković
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
💼 LinkedIn - https://www.linkedin.com/in/aleksagordic/
🐦 Twitter - https://twitter.com/gordic_aleksa
👨👩👧👦 Discord - https://discord.gg/peBrCpheKE
📺 YouTube - https://www.youtube.com/c/TheAIEpiphany/
📚 Medium - https://gordicaleksa.medium.com/
💻 GitHub - https://github.com/gordicaleksa
📢 AI Newsletter - https://aiepiphany.substack.com/
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#opensource #meta #nolanguageleftbehind
Видео Day 27: Open NLLB - speeding up fuzzy dedup, tqdm multi process (Pt 1) канала Aleksa Gordić - The AI Epiphany
Показать
Комментарии отсутствуют
Информация о видео
1 октября 2023 г. 18:24:58
03:02:17
Другие видео канала
Thomas Wolf (HuggingFace) - the case for open-source!Jeremy Howard - answer.ai, what is wrong with the academia and industryLLaMA 2 w/ Thomas Scialom (LLaMA 2 lead)Lucas Beyer (Google DeepMind) - Convergence of Vision & LanguageJarvis for Images! (demo) - run locally, no external APIsOpenAI DALL-E 3 with James Betker (1st author)The Vesuvius challenge breakthrough with Luke FarritorDay 29: Open NLLB - testing & improving fasttext HBS LID (Pt 3)Day 29: Open NLLB - handling German data, training fasttext HBS LID (Pt 2)Day 29: Open NLLB - training fasttext LID (Pt 1)Day 28: Open NLLB - debugging fuzzy dedup, training fasttext LID (Pt 3)Day 28: Open NLLB - debugging fuzzy dedup, training fasttext LID (Pt 2)Day 27: Open NLLB - filtering stage hyperparams, HBS LID detector (Pt 3 cont.)Day 27: Open NLLB - filtering stage hyperparams, HBS LID detector (Pt 3)Day 27: Open NLLB - profiling parallel fuzzy dedup, filtering stage hyperparams (Pt 2)Day 26: Open NLLB - filtering HBS, union find, paper reading (Pt 2)Day 26: Open NLLB - filtering HBS, refactoring, wrapping up MinHash LSH (Pt 1)Day 25: Open NLLB - filtering HBS (Pt 3)Day 25: Open NLLB - filtering HBS (Pt 2)Day 25: Open NLLB - filtering HBS (fuzzy dedup, toxicity, LID) (Pt 1)