Загрузка страницы

Arvind Neelakantan: Text and Code Embeddings

Text embeddings are useful features in many applications such as semantic search and computing text similarity. Previous work typically trains models customized for different use cases, varying in dataset choice, training objective and model architecture. In this work, Arvind Neelakantan (OpenAI) shows that contrastive pre-training on unsupervised data at scale leads to high quality vector representations of text and code. The same unsupervised text embeddings that achieve new state-of-the-art results in linear-probe classification also display impressive semantic search capabilities and sometimes even perform competitively with fine-tuned models. Similarly to text embeddings, he trains code embedding models on (text, code) pairs, obtaining a 20.8% relative improvement over prior best work on code search.

Видео Arvind Neelakantan: Text and Code Embeddings канала Oxford ML and Physics Seminars
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
13 июня 2022 г. 14:00:45
00:36:36
Другие видео канала
Mike Walmsley: Galaxy Zoo(m): Probabilistic Galaxy Morphology via Bayesian CNNs and Active LearningMike Walmsley: Galaxy Zoo(m): Probabilistic Galaxy Morphology via Bayesian CNNs and Active LearningRachel Prudden: Probabilistic modelling for atmospheric science: beyond the noiseRachel Prudden: Probabilistic modelling for atmospheric science: beyond the noiseRicardo Vinuesa: Artificial Intelligence, Computational Fluid Dynamics, and SustainabilityRicardo Vinuesa: Artificial Intelligence, Computational Fluid Dynamics, and SustainabilityAtılım Güneş Baydin: Probabilistic Programming for Inverse Problems in the Physical SciencesAtılım Güneş Baydin: Probabilistic Programming for Inverse Problems in the Physical SciencesArd Louis: Deep neural networks have an inbuilt Occam’s razorArd Louis: Deep neural networks have an inbuilt Occam’s razorGuillaume Lample: Deep Learning for Symbolic MathematicsGuillaume Lample: Deep Learning for Symbolic MathematicsBrian Spears: Cognitive Simulation: combining simulation and experiment with artificial intelligenceBrian Spears: Cognitive Simulation: combining simulation and experiment with artificial intelligenceEliu Huerta: AI for Science: Let’s talk businessEliu Huerta: AI for Science: Let’s talk businessBen Nachman: Extracting the most from collider data with deep learningBen Nachman: Extracting the most from collider data with deep learningLaure Zanna: Climate Modeling in the Age of Machine LearningLaure Zanna: Climate Modeling in the Age of Machine LearningTim Green: Highly accurate protein structure prediction with AlphaFoldTim Green: Highly accurate protein structure prediction with AlphaFoldPeter Dueben: Machine learning for weather predictionsPeter Dueben: Machine learning for weather predictionsDavid Spergel: Determining the Universe’s Initial ConditionsDavid Spergel: Determining the Universe’s Initial ConditionsAdrien Gaidon: Self-supervised 3D visionAdrien Gaidon: Self-supervised 3D visionMaurizio Pierini: Doing more with less: Deep Learning for Physics at the Large Hadron ColliderMaurizio Pierini: Doing more with less: Deep Learning for Physics at the Large Hadron ColliderMichael Kagan: Generative Model Based Design Optimization and UnfoldingMichael Kagan: Generative Model Based Design Optimization and UnfoldingSéamus Davis: Machine learning in electronic-quantum-matter imaging experimentsSéamus Davis: Machine learning in electronic-quantum-matter imaging experimentsStéphane Mallat: Hamiltonian Estimations by Conditional Renormalisation Group and Convolution NetsStéphane Mallat: Hamiltonian Estimations by Conditional Renormalisation Group and Convolution NetsPhiala Shanahan: Provably exact sampling for first-principles theoretical physicsPhiala Shanahan: Provably exact sampling for first-principles theoretical physicsJonas Buchli & Federico Felici: Magnetic control of tokamak plasmas with deep reinforcement learningJonas Buchli & Federico Felici: Magnetic control of tokamak plasmas with deep reinforcement learning
Яндекс.Метрика