Загрузка страницы

EI Seminar - Jacob Steinhardt - Large Language Models as Statisticians

EI Seminar - Jacob Steinhardt - Large Language Models as Statisticians

Given their complex behavior, diverse skills, and wide range of deployment scenarios, understanding large language models---and especially their failure modes---is important. Given that new models are released every few months, often with brand new capabilities, how can we achieve understanding that keeps pace with modern practice? In this talk, I will present an approach to this that leverages the skills of language models themselves, and so scales up as models get better. Specifically, we leverage the skill of language models as statisticians. At inference time, language models can read and process significant amounts of information due to their large context windows, and use this to generate useful statistical hypotheses. We will showcase several systems built on this principle, which allow us to audit other models for failures, identify spurious cues in datasets, label the internal representations of models, and factorize corpora into human-interpetable concepts. This is joint work with many collaborators and students, including Ruiqi Zhong, Erik Jones, and Yossi Gandelsman.

Видео EI Seminar - Jacob Steinhardt - Large Language Models as Statisticians канала MIT Embodied Intelligence
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
18 октября 2023 г. 12:10:32
01:02:45
Другие видео канала
MIT EI Seminar - Laura Schulz - Curiouser and curiouser: why we make problems for ourselvesMIT EI Seminar - Laura Schulz - Curiouser and curiouser: why we make problems for ourselvesEI Seminar - Graham Neubig - Learning to Explain and Explaining to LearnEI Seminar - Graham Neubig - Learning to Explain and Explaining to LearnEI Seminar - Martin Riedmiller - Learning Controllers - From Engineering to AGIEI Seminar - Martin Riedmiller - Learning Controllers - From Engineering to AGIEI Seminar Livestream - Max TegmarkEI Seminar Livestream - Max TegmarkEI Seminar  - Recent papers in Embodied IntelligenceEI Seminar - Recent papers in Embodied IntelligenceEI Seminar - Beomjoon Kim - Making Robots See and ManipulateEI Seminar - Beomjoon Kim - Making Robots See and ManipulateEI Seminar - Marco Pavone - Building Trust in AI for Autonomous VehiclesEI Seminar - Marco Pavone - Building Trust in AI for Autonomous VehiclesEI Seminar - Jacob Andreas - Good Old-fashioned LLMs (or, Autoformalizing the World)EI Seminar - Jacob Andreas - Good Old-fashioned LLMs (or, Autoformalizing the World)EI Seminar - Grey Yang - Tuning GPT-3 on a Single GPU via Zero-Shot Hyperparameter TransferEI Seminar - Grey Yang - Tuning GPT-3 on a Single GPU via Zero-Shot Hyperparameter TransferEI Seminar - Maurice Fallon - Multi-Sensor Robot Navigation and Subterranean ExplorationEI Seminar - Maurice Fallon - Multi-Sensor Robot Navigation and Subterranean ExplorationEI Seminar - Chad Jenkins - Semantic Robot Programming... and Maybe Making the Worlda Better PlaceEI Seminar - Chad Jenkins - Semantic Robot Programming... and Maybe Making the Worlda Better PlaceEI Seminar - Joydeep BiswasEI Seminar - Joydeep BiswasMIT EI Seminar - Lerrel Pinto - Diverse data and efficient algorithms for robot learningMIT EI Seminar - Lerrel Pinto - Diverse data and efficient algorithms for robot learningEI Seminar - Yuan Gong - Audio Large Language Models: From Sound Perception to UnderstandingEI Seminar - Yuan Gong - Audio Large Language Models: From Sound Perception to UnderstandingLawson Wong - High-Level Guidance for Generalizable Reinforcement LearningLawson Wong - High-Level Guidance for Generalizable Reinforcement LearningEI Seminar - Monroe Kennedy - Collaborative Robotics: From Dexterity to Teammate PredictionEI Seminar - Monroe Kennedy - Collaborative Robotics: From Dexterity to Teammate PredictionEI Seminar - Rob Fergus - Data Augmentation for Image-Based Reinforcement LearningEI Seminar - Rob Fergus - Data Augmentation for Image-Based Reinforcement LearningEI Seminar - Oriol Vinyals - The Deep Learning Toolbox: from AlphaFold to AlphaCodeEI Seminar - Oriol Vinyals - The Deep Learning Toolbox: from AlphaFold to AlphaCodeDaniel Wolpert - Computational principles underlying the learning of sensorimotor repertoiresDaniel Wolpert - Computational principles underlying the learning of sensorimotor repertoiresEI Seminar - Jeannette Bohg - Scaling Robot Learning for Long-Horizon Manipulation TasksEI Seminar - Jeannette Bohg - Scaling Robot Learning for Long-Horizon Manipulation Tasks
Яндекс.Метрика