- Популярные видео
- Авто
- Видео-блоги
- ДТП, аварии
- Для маленьких
- Еда, напитки
- Животные
- Закон и право
- Знаменитости
- Игры
- Искусство
- Комедии
- Красота, мода
- Кулинария, рецепты
- Люди
- Мото
- Музыка
- Мультфильмы
- Наука, технологии
- Новости
- Образование
- Политика
- Праздники
- Приколы
- Природа
- Происшествия
- Путешествия
- Развлечения
- Ржач
- Семья
- Сериалы
- Спорт
- Стиль жизни
- ТВ передачи
- Танцы
- Технологии
- Товары
- Ужасы
- Фильмы
- Шоу-бизнес
- Юмор
Vision Transformers and Automatic Speech Recognition in Spark NLP - Maziyar Panahi - NLP Summit 2022
Install NLP Libraries https://www.johnsnowlabs.com/install/
Register for Healthcare NLP Summit 2023: https://www.nlpsummit.org/#register
Watch all NLP Summit 2022 sessions: https://www.nlpsummit.org/nlp-summit-2022-watch-now/
Multi-modal learning, in which a model provides answers or predictions by combining data from multiple modalities – such as images, audio, and text – is necessary in a growing number of practical use cases. This session presents new capabilities in the open-source Spark NLP library to build highly scalable pipelines that unify computer vision, speech-to-text, and text-mining models. Both training and inference will be covered, as well as the currently available pre-trained, state-of-the-art models. Finally, benchmarks that compare these capabilities to other open-source libraries will highly the order-of-magnitude speedups that Spark NLP provides for these tasks, on both CPU’s and GPU’s.
Видео Vision Transformers and Automatic Speech Recognition in Spark NLP - Maziyar Panahi - NLP Summit 2022 канала John Snow Labs
Register for Healthcare NLP Summit 2023: https://www.nlpsummit.org/#register
Watch all NLP Summit 2022 sessions: https://www.nlpsummit.org/nlp-summit-2022-watch-now/
Multi-modal learning, in which a model provides answers or predictions by combining data from multiple modalities – such as images, audio, and text – is necessary in a growing number of practical use cases. This session presents new capabilities in the open-source Spark NLP library to build highly scalable pipelines that unify computer vision, speech-to-text, and text-mining models. Both training and inference will be covered, as well as the currently available pre-trained, state-of-the-art models. Finally, benchmarks that compare these capabilities to other open-source libraries will highly the order-of-magnitude speedups that Spark NLP provides for these tasks, on both CPU’s and GPU’s.
Видео Vision Transformers and Automatic Speech Recognition in Spark NLP - Maziyar Panahi - NLP Summit 2022 канала John Snow Labs
Комментарии отсутствуют
Информация о видео
4 ноября 2022 г. 20:05:50
00:08:26
Другие видео канала






![[WEBINAR] From GPT-4 to Llama-2: Supercharging State-of-the-Art Embeddings for Vector Databases](https://i.ytimg.com/vi/I_YpN3mgwqc/default.jpg)













