Загрузка страницы

Anatoly Potapov: Pre-training Transformers with Catalyst

Data fest Online 2020
Catalyst Workshop track https://ods.ai/tracks/catalyst-df2020

Since the NLP's Imagenet moment (emergence of ELMO, BERT), language models are widely used as a backbone for a variety of supervised tasks (intent classification, named entity recognition, question answering), etc

At Tinkoff, we have tens of millions of unlabelled customer conversation samples. In such a scenario, it is highly beneficial to pre-train on in-domain data and having a custom vocabulary. We transferred our pre-training pipeline to the Catalyst framework and reduced our codebase while keeping features like distributed training and fp16 training.

In my presentation, I will tell you how to pre-train transformers at scale with the Catalyst framework without writing lots of "infrastructure" code.

Register and get access to the tracks: https://ods.ai/events/datafest2020
Join the community: https://ods.ai/

Видео Anatoly Potapov: Pre-training Transformers with Catalyst канала ODS AI Global
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
5 ноября 2020 г. 20:30:04
00:15:11
Другие видео канала
ML puzzler Boosting — Данила СавенковML puzzler Boosting — Данила СавенковEghbal Rahimkina, Ser-Huang Poon: ML for Realised Volatility ForecastingEghbal Rahimkina, Ser-Huang Poon: ML for Realised Volatility ForecastingRoman Schutski: Graphical models for tensor networks and machine learningRoman Schutski: Graphical models for tensor networks and machine learningEmbeddings for Graph Classification — Евгений БурнаевEmbeddings for Graph Classification — Евгений БурнаевDmitry Petrov: DataOps & ML automation with DVCDmitry Petrov: DataOps & ML automation with DVCRL for the Adaptive Speed Regulation on a Metallurgical Picking LineRL for the Adaptive Speed Regulation on a Metallurgical Picking LineПицца аля-semi-supervised — Артур КузинПицца аля-semi-supervised — Артур КузинУправление наличностью в банкоматах и офисах — Александр УльяновУправление наличностью в банкоматах и офисах — Александр УльяновVadim Safronov: Business Transformation as Graph ProblemsVadim Safronov: Business Transformation as Graph ProblemsДавай останемся друзьями - как устроен раздел Возможно Вы Знакомы в Одноклассниках – Евгений МалютинДавай останемся друзьями - как устроен раздел Возможно Вы Знакомы в Одноклассниках – Евгений МалютинGene Kogan | Machine learning for creativityGene Kogan | Machine learning for creativityIntroducing Smelter: движок для инференса – Андрей ВолодинIntroducing Smelter: движок для инференса – Андрей ВолодинСуровая действительность товарных рекомендаций бытовой техники – Владимир ЛитвинюкСуровая действительность товарных рекомендаций бытовой техники – Владимир ЛитвинюкAlex Farseev | Under the Boot of Google and Facebook and How to Crack it for better PerformanceAlex Farseev | Under the Boot of Google and Facebook and How to Crack it for better PerformanceAlexander Grishin: Controlling the overestimation biasAlexander Grishin: Controlling the overestimation biasNikolay Nikitin: Structural Learning for Composite Models Using Evolutionary OptimizationNikolay Nikitin: Structural Learning for Composite Models Using Evolutionary OptimizationML puzzlerML puzzlerCatalyst integration with WandbCatalyst integration with WandbML in Healthcare Track PremiereML in Healthcare Track PremiereStrategic keynote – Максим ФедоровStrategic keynote – Максим ФедоровMichael Kachala, Tati Gabru, Pavel Pleskov: WFH WFO BATTLEMichael Kachala, Tati Gabru, Pavel Pleskov: WFH WFO BATTLE
Яндекс.Метрика