Загрузка...

Vision Transformers and Automatic Speech Recognition in Spark NLP - Maziyar Panahi - NLP Summit 2022

Install NLP Libraries https://www.johnsnowlabs.com/install/

Register for Healthcare NLP Summit 2023: https://www.nlpsummit.org/#register

Watch all NLP Summit 2022 sessions: https://www.nlpsummit.org/nlp-summit-2022-watch-now/

Multi-modal learning, in which a model provides answers or predictions by combining data from multiple modalities – such as images, audio, and text – is necessary in a growing number of practical use cases. This session presents new capabilities in the open-source Spark NLP library to build highly scalable pipelines that unify computer vision, speech-to-text, and text-mining models. Both training and inference will be covered, as well as the currently available pre-trained, state-of-the-art models. Finally, benchmarks that compare these capabilities to other open-source libraries will highly the order-of-magnitude speedups that Spark NLP provides for these tasks, on both CPU’s and GPU’s.

Видео Vision Transformers and Automatic Speech Recognition in Spark NLP - Maziyar Panahi - NLP Summit 2022 канала John Snow Labs
Яндекс.Метрика
Все заметки Новая заметка Страницу в заметки
Страницу в закладки Мои закладки
На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.
О CookiesНапомнить позжеПринять