Загрузка страницы

A much better LLM Leaderboard!!!

🏆 This leaderboard is based on the following three benchmarks.

Chatbot Arena - a crowdsourced, randomized battle platform. We use 100K+ user votes to compute Elo ratings.
MT-Bench - a set of challenging multi-turn questions. We use GPT-4 to grade the model responses.
MMLU (5-shot) - a test to measure a model's multitask accuracy on 57 tasks.

🔗 Links 🔗

ChatBOT Arena Leaderboard from Lmsys - https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard

Arena Leaderboard Elo Ranking Method - https://colab.research.google.com/drive/1RAWb22-PFNI-X1gPVzc927SGUdfr6nsR?usp=sharing

Play at the Arena - https://chat.lmsys.org/?arena
Intro Sound from Honest Trailers- https://youtu.be/lZMzf-SDWP8

❤️ If you want to support the channel ❤️
Support here:
Patreon - https://www.patreon.com/1littlecoder/
Ko-Fi - https://ko-fi.com/1littlecoder

🧭 Follow me on 🧭
Twitter - https://twitter.com/1littlecoder
Linkedin - https://www.linkedin.com/in/amrrs/

Видео A much better LLM Leaderboard!!! канала 1littlecoder
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
29 ноября 2023 г. 1:54:32
00:11:24
Яндекс.Метрика