David Silver - Deep Reinforcement Learning from AlphaGo to AlphaStar
Recently, self-learning systems have achieved remarkable success in several challenging problems for artificial intelligence, by combining reinforcement learnng with deep neural networks. In this talk, I describe the ideas and algorithms that led to AlphaGo: the first program to defeat a human champion in the game of Go; AlphaZero: which learned, from scratch, to also defeat the world computer champions in chess and shogi; and AlphaStar: the first program to defeat a human champion in the real-time strategy game of StarCraft.
Bio: David Silver is a principal research scientist at DeepMind and a professor at University College London. David's work focuses on artificially intelligent agents based on reinforcement learning. David co-led the project that combined deep learning and reinforcement learning to play Atari games directly from pixels (Nature 2015). He also led the AlphaGo project, culminating in the first program to defeat a top professional player in the full-size game of Go (Nature 2016), and the AlphaZero project, which learned by itself to defeat the world's strongest chess, shogi and Go programs (Nature 2017, Science 2018). Most recently, he co-led the AlphaStar project, which led to the world's first grandmaster level StarCraft player (Nature 2019). His work has been recognised by the Marvin Minsky award, Mensa Foundation Prize, and Royal Academy of Engineering Silver Medal.
*Sponsors*
Man AHL: At Man AHL, we mix machine learning, computer science and engineering with terabytes of data to invest billions of dollars every day.
https://evolution.ai/ : Machines that Read - Intelligent data extraction from corporate and financial documents.
Видео David Silver - Deep Reinforcement Learning from AlphaGo to AlphaStar канала London Machine Learning Meetup
Bio: David Silver is a principal research scientist at DeepMind and a professor at University College London. David's work focuses on artificially intelligent agents based on reinforcement learning. David co-led the project that combined deep learning and reinforcement learning to play Atari games directly from pixels (Nature 2015). He also led the AlphaGo project, culminating in the first program to defeat a top professional player in the full-size game of Go (Nature 2016), and the AlphaZero project, which learned by itself to defeat the world's strongest chess, shogi and Go programs (Nature 2017, Science 2018). Most recently, he co-led the AlphaStar project, which led to the world's first grandmaster level StarCraft player (Nature 2019). His work has been recognised by the Marvin Minsky award, Mensa Foundation Prize, and Royal Academy of Engineering Silver Medal.
*Sponsors*
Man AHL: At Man AHL, we mix machine learning, computer science and engineering with terabytes of data to invest billions of dollars every day.
https://evolution.ai/ : Machines that Read - Intelligent data extraction from corporate and financial documents.
Видео David Silver - Deep Reinforcement Learning from AlphaGo to AlphaStar канала London Machine Learning Meetup
Показать
Комментарии отсутствуют
Информация о видео
29 января 2020 г. 14:37:43
01:32:11
Другие видео канала
David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86RL Course by David Silver - Lecture 8: Integrating Learning and PlanningDeepMind StarCraft II DemonstrationBenjamin Schrauwen: Deep learning for 3D printing manufacturingAlphaGo - The Movie | Full DocumentaryVincent Spruyt: Loc2Vec: Self-supervised metric learning through triplet-lossAlphaGo - RequiemTraining AI Without Writing A Reward Function, with Reward ModellingOpening Keynote: "The Story of Alpha Go", presented by Aja Huang 7d and Fan Hui 2pHow the US government is using blockchain to fight fraud | Kathryn Haun | TEDxSanFrancisco2020 Machine Learning Roadmap (still valid for 2021)Charles Blundell - Agent57: Outperforming the Atari Human BenchmarkRL Course by David Silver - Lecture 2: Markov Decision ProcessGoogle's self-learning AI AlphaZero masters chess in 4 hoursMatch 1 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGoMIT 6.S191 (2019): Deep Reinforcement LearningVirtual HLF 2020 – Talk: David SilverTutorial: Introduction to Reinforcement Learning with Function ApproximationPublic Lecture with Google DeepMind's Demis HassabisDeepmind AlphaZero - Mastering Games Without Human Knowledge