Human-in-the-Loop Reinforcement Learning
(Pieter Abbeel, UC Berkeley | Covariant)
Pieter Abbeel is Professor at UC Berkeley, where he is Director of the Berkeley Robot Learning Lab and Co-Director of the Berkeley Artificial Intelligence (BAIR) Lab. Abbeel’s research strives to build ever more intelligent systems, with main emphasis on deep reinforcement learning, meta-learning. His lab also investigates how AI could advance other science and engineering disciplines. Abbeel has founded several companies, including Gradescope (AI to help instructors with grading homework and exams), Covariant (AI for robotic automation of warehouses and factories). Abbeel is also the host of The Robot Brains Podcast. Abbeel has received many awards and honors, including the PECASE, NSF-CAREER, ONR-YIP, Darpa-YFA, TR35. His work is frequently featured in the press, including the New York Times, Wall Street Journal, BBC, Rolling Stone, Wired, and Tech Review.
Видео Human-in-the-Loop Reinforcement Learning канала Anyscale
Pieter Abbeel is Professor at UC Berkeley, where he is Director of the Berkeley Robot Learning Lab and Co-Director of the Berkeley Artificial Intelligence (BAIR) Lab. Abbeel’s research strives to build ever more intelligent systems, with main emphasis on deep reinforcement learning, meta-learning. His lab also investigates how AI could advance other science and engineering disciplines. Abbeel has founded several companies, including Gradescope (AI to help instructors with grading homework and exams), Covariant (AI for robotic automation of warehouses and factories). Abbeel is also the host of The Robot Brains Podcast. Abbeel has received many awards and honors, including the PECASE, NSF-CAREER, ONR-YIP, Darpa-YFA, TR35. His work is frequently featured in the press, including the New York Times, Wall Street Journal, BBC, Rolling Stone, Wired, and Tech Review.
Видео Human-in-the-Loop Reinforcement Learning канала Anyscale
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Redesigning Scheduling in Ray to Improve Cost-Efficiency at ScaleKubeRay: A Ray cluster management solution on KubernetesSkyML: Infra-less machine learning on any cloudDeveloping Ray Applications on Google Cloud TPUsOptimizing LLM Inference with AWS Trainium, Ray, vLLM, and AnyscaleKeynote: The Future of Ray - Robert Nishihara, AnyscaleAI In Production Key Challenges and Use Cases - Jonathan Berte RobovisionData Processing on RayGraphs at scale with Ray, for AI in ManufacturingLast Mile Data Processing for ML Training using RayKeynote: AlphaStar: How Google Infrastructure Made it Possible - Oriol Vinyals, Principal ScientistRay Summit 2022 - Day 1 Keynote - Kim HazelwoodScaling time-series forecasting models to cope with the multi-verse with Ray[Opening Keynote] State of RayLarge-scale deep learning to augment production RL workloads at Riot GamesEasy Access to SOTA NLP Models with Ray and Hugging Face - Thomas Wolf, Hugging FaceOffline RL with RLlibScaling Instacart fulfillment ML on RayFast Scheduling using Ray's Python API (0.8) by Edward OakesA Ray-based ML Platform for Early Cancer DetectionHow Meta scales distributed training of AI workloads on Ray