Загрузка страницы

Developing for Indic languages | Gemma and Navarasa

While many early large language models were predominantly trained on English language data, the field is rapidly evolving. Newer models are increasingly being trained on multilingual datasets, and there's a growing focus on developing models specifically for the world’s languages. However, challenges remain in ensuring equitable representation and performance across diverse languages, particularly those with less available data and computational resources.

Gemma, Google's family of open models, is designed to address these challenges by enabling the development of projects in non-Germanic languages. Its tokenizer and large token vocabulary make it particularly well-suited for handling diverse languages. Watch how developers in India used Gemma to create Navarasa — a fine-tuned Gemma model for Indic languages.

Watch the full keynote: https://www.youtube.com/live/XEzRZ35urlk?si=Yo3HTskWQMJwdf3G

To watch this keynote with American Sign Language (ASL) interpretation, please click here: https://www.youtube.com/live/6rP2rEWsfpM?si=RuVQFeK4ESnOsUnp

#GoogleIO #GoogleIO2024

Subscribe to our Channel: https://www.youtube.com/google
Find us on X: https://twitter.com/google
Watch us on TikTok: https://tiktok.com/@google
Follow us on Instagram: https://www.instagram.com/google
Join us on Facebook: https://www.facebook.com/Google

Видео Developing for Indic languages | Gemma and Navarasa канала Google
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
14 мая 2024 г. 22:57:21
00:01:47
Яндекс.Метрика