Загрузка...

Mixtral - Mixture of Experts (MoE) Free LLM that Rivals ChatGPT (3.5) by Mistral | Overview & Demo

Mixtral 8x7b is a cutting-edge Large Language Model (LLM) by Mistral.AI, licensed under Apache 2.0. It uses a Mixture of Experts and operates with the speed of a 12B parameter model but also surpasses the performance of Llama 2 70B and rivals GPT-3.5 in most benchmarks. It understands English, French, German, Spanish, and Italian.

We'll delve into the intriguing concept of a Mixture of Experts as implemented in the Transformers library. The model is already integrated in HuggingFace Chat and we'll try it out with a couple of prompts.

Blog Post: https://mistral.ai/news/mixtral-of-experts/
HF Chat: https://huggingface.co/chat/
MoE Explained: https://huggingface.co/blog/moe

AI Bootcamp (preview drops on Christmas): https://www.mlexpert.io/membership
Discord: https://discord.gg/UaNPxVD6tv
Subscribe: http://bit.ly/venelin-subscribe
GitHub repository: https://github.com/curiousily/Get-Things-Done-with-Prompt-Engineering-and-LangChain

Join this channel to get access to the perks and support my work:
https://www.youtube.com/channel/UCoW_WzQNJVAjxo4osNAxd_g/join

00:00 - Intro
00:16 - What is Mixtral?
03:00 - Performance
04:44 - Instruct/Chat Model
05:44 - Mixtral on HF Hub
06:20 - What is a Mixture of Experts (MoE)?
10:26 - MoE Implementation in Transformers
12:40 - Demo in HF Chat
18:16 - Conclusion

#llm #artificialintelligence #chatbot #promptengineering #python #chatgpt #llama2

Видео Mixtral - Mixture of Experts (MoE) Free LLM that Rivals ChatGPT (3.5) by Mistral | Overview & Demo канала Venelin Valkov
Яндекс.Метрика
Все заметки Новая заметка Страницу в заметки
Страницу в закладки Мои закладки
На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.
О CookiesНапомнить позжеПринять