Christopher D Manning: A Neural Network Model That Can Reason (ICLR 2018 invited talk)
Abstract: Deep learning has had enormous success on perceptual tasks but still struggles in providing a model for inference. To address this gap, we have been developing Memory-Attention-Composition networks (MACnets). The MACnet design provides a strong prior for explicitly iterative reasoning, enabling it to learn explainable, structured reasoning, as well as achieve good generalization from a modest amount of data. The model builds from the great success of existing recurrent cells such as LSTMs: A MacNet is a sequence of a single recurrent Memory, Attention, and Composition (MAC) cell. However, its design imposes structural constraints on the operation of each cell and the interactions between them, incorporating explicit control and soft attention mechanisms. We demonstrate the model’s strength and robustness on the challenging CLEVR dataset for visual reasoning (Johnson et al. 2016), achieving a new state-of-the-art 98.9% accuracy, halving the error rate of the previous best model. More importantly, we show that the new model is more data-efficient, achieving good results from even a modest amount of training data. Joint work with Drew Hudson.
Видео Christopher D Manning: A Neural Network Model That Can Reason (ICLR 2018 invited talk) канала Steven Van Vaerenbergh
Видео Christopher D Manning: A Neural Network Model That Can Reason (ICLR 2018 invited talk) канала Steven Van Vaerenbergh
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Suchi Saria: Augmenting Clinical Intelligence with Machine Intelligence (ICLR 2018 invited talk)Evolution of CleverHans (May 15th, 2018)Erik Brynjolfsson: What Can Machine Learning Do? Workforce Implications (ICLR 2018)Machine Learning in Automated Mechanism Design for Pricing and Auctions (ICML 2018 tutorial)Max Welling: Intelligence per Kilowatthour (ICML 2018 invited talk)Using D-ID to create a talking avatar videoPieter Abbeel: Deep Learning for Robotics (NIPS 2017 Keynote)Probabilistic Methods, Applications sessions at NIPS 2017Demo of ChatGPT's visual capabilities (Oct. 2023)Yisong Yue and Hoang M Le: Tutorial on Imitation Learning (ICML 2018 tutorial)Truyen Tran - Learning to Remember More with Less Memorization (ICLR 2019 talk)Geometric reasoning with ChatGPT and GeoGebra, part 1Deep Learning session at NIPS 2017Michael Unser: Splines and Machine Learning: From classical RKHS methods to DNN (MLSP 2020 keynote)Kristen Grauman: Visual Learning With Unlabeled Video and Look-Around Policies ICLR2018 invited talkYikang Shen: Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks (ICLR2019)GitHub Copilot demonstration October 2023 (part 1)Copilot demonstration April 2023Joelle Pineau: Reproducibility, Reusability, and Robustness in Deep Reinforcement Learning ICLR 2018ChatGPT 4 system prompt (December 16, 2023)Eight years of scikit-learn development (Jan. 11th 2018)