EI Seminar - Jacob Steinhardt - Large Language Models as Statisticians
EI Seminar - Jacob Steinhardt - Large Language Models as Statisticians
Given their complex behavior, diverse skills, and wide range of deployment scenarios, understanding large language models---and especially their failure modes---is important. Given that new models are released every few months, often with brand new capabilities, how can we achieve understanding that keeps pace with modern practice? In this talk, I will present an approach to this that leverages the skills of language models themselves, and so scales up as models get better. Specifically, we leverage the skill of language models as statisticians. At inference time, language models can read and process significant amounts of information due to their large context windows, and use this to generate useful statistical hypotheses. We will showcase several systems built on this principle, which allow us to audit other models for failures, identify spurious cues in datasets, label the internal representations of models, and factorize corpora into human-interpetable concepts. This is joint work with many collaborators and students, including Ruiqi Zhong, Erik Jones, and Yossi Gandelsman.
Видео EI Seminar - Jacob Steinhardt - Large Language Models as Statisticians канала MIT Embodied Intelligence
Given their complex behavior, diverse skills, and wide range of deployment scenarios, understanding large language models---and especially their failure modes---is important. Given that new models are released every few months, often with brand new capabilities, how can we achieve understanding that keeps pace with modern practice? In this talk, I will present an approach to this that leverages the skills of language models themselves, and so scales up as models get better. Specifically, we leverage the skill of language models as statisticians. At inference time, language models can read and process significant amounts of information due to their large context windows, and use this to generate useful statistical hypotheses. We will showcase several systems built on this principle, which allow us to audit other models for failures, identify spurious cues in datasets, label the internal representations of models, and factorize corpora into human-interpetable concepts. This is joint work with many collaborators and students, including Ruiqi Zhong, Erik Jones, and Yossi Gandelsman.
Видео EI Seminar - Jacob Steinhardt - Large Language Models as Statisticians канала MIT Embodied Intelligence
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![MIT EI Seminar - Laura Schulz - Curiouser and curiouser: why we make problems for ourselves](https://i.ytimg.com/vi/1l0u5gctDP4/default.jpg)
![EI Seminar - Graham Neubig - Learning to Explain and Explaining to Learn](https://i.ytimg.com/vi/CtcP5bvODzY/default.jpg)
![EI Seminar - Martin Riedmiller - Learning Controllers - From Engineering to AGI](https://i.ytimg.com/vi/Pno8xsrgWA4/default.jpg)
![EI Seminar Livestream - Max Tegmark](https://i.ytimg.com/vi/aDaOuBP-jN4/default.jpg)
![EI Seminar - Recent papers in Embodied Intelligence](https://i.ytimg.com/vi/wcVejqmb1mQ/default.jpg)
![EI Seminar - Beomjoon Kim - Making Robots See and Manipulate](https://i.ytimg.com/vi/GZ-oiwOeRc8/default.jpg)
![EI Seminar - Marco Pavone - Building Trust in AI for Autonomous Vehicles](https://i.ytimg.com/vi/HjOt-4k6haI/default.jpg)
![EI Seminar - Jacob Andreas - Good Old-fashioned LLMs (or, Autoformalizing the World)](https://i.ytimg.com/vi/_TrKARhF5cI/default.jpg)
![EI Seminar - Grey Yang - Tuning GPT-3 on a Single GPU via Zero-Shot Hyperparameter Transfer](https://i.ytimg.com/vi/xbCibcC9Ud0/default.jpg)
![EI Seminar - Maurice Fallon - Multi-Sensor Robot Navigation and Subterranean Exploration](https://i.ytimg.com/vi/4D4TbI1gGIg/default.jpg)
![EI Seminar - Chad Jenkins - Semantic Robot Programming... and Maybe Making the Worlda Better Place](https://i.ytimg.com/vi/UaTq6ojGuYo/default.jpg)
![EI Seminar - Joydeep Biswas](https://i.ytimg.com/vi/0vPNN0J8M44/default.jpg)
![MIT EI Seminar - Lerrel Pinto - Diverse data and efficient algorithms for robot learning](https://i.ytimg.com/vi/tRcwyC-ivMQ/default.jpg)
![EI Seminar - Yuan Gong - Audio Large Language Models: From Sound Perception to Understanding](https://i.ytimg.com/vi/uqsW2eK-Rms/default.jpg)
![Lawson Wong - High-Level Guidance for Generalizable Reinforcement Learning](https://i.ytimg.com/vi/8KGbtpkMBZc/default.jpg)
![EI Seminar - Monroe Kennedy - Collaborative Robotics: From Dexterity to Teammate Prediction](https://i.ytimg.com/vi/ii8ZNXaZ0hg/default.jpg)
![EI Seminar - Rob Fergus - Data Augmentation for Image-Based Reinforcement Learning](https://i.ytimg.com/vi/Ny2CpgPrtB8/default.jpg)
![EI Seminar - Oriol Vinyals - The Deep Learning Toolbox: from AlphaFold to AlphaCode](https://i.ytimg.com/vi/dOlbnrsQy_I/default.jpg)
![Daniel Wolpert - Computational principles underlying the learning of sensorimotor repertoires](https://i.ytimg.com/vi/wp3c1E6oCTM/default.jpg)
![EI Seminar - Jeannette Bohg - Scaling Robot Learning for Long-Horizon Manipulation Tasks](https://i.ytimg.com/vi/Ca-CxLZ2mq8/default.jpg)