23. Accelerating Gradient Descent (Use Momentum)
MIT 18.065 Matrix Methods in Data Analysis, Signal Processing, and Machine Learning, Spring 2018
Instructor: Gilbert Strang
View the complete course: https://ocw.mit.edu/18-065S18
YouTube Playlist: https://www.youtube.com/playlist?list=PLUl4u3cNGP63oMNUHXqIUcrkS2PivhN3k
In this lecture, Professor Strang explains both momentum-based gradient descent and Nesterov's accelerated gradient descent.
License: Creative Commons BY-NC-SA
More information at https://ocw.mit.edu/terms
More courses at https://ocw.mit.edu
Видео 23. Accelerating Gradient Descent (Use Momentum) канала MIT OpenCourseWare
Instructor: Gilbert Strang
View the complete course: https://ocw.mit.edu/18-065S18
YouTube Playlist: https://www.youtube.com/playlist?list=PLUl4u3cNGP63oMNUHXqIUcrkS2PivhN3k
In this lecture, Professor Strang explains both momentum-based gradient descent and Nesterov's accelerated gradient descent.
License: Creative Commons BY-NC-SA
More information at https://ocw.mit.edu/terms
More courses at https://ocw.mit.edu
Видео 23. Accelerating Gradient Descent (Use Momentum) канала MIT OpenCourseWare
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
24. Linear Programming and Two-Person Games22. Gradient Descent: Downhill to a MinimumGradient Descent With Momentum (C2W2L06)Applied Optimization - Steepest DescentMachine Learning Lecture 12 "Gradient Descent / Newton's Method" -Cornell CS4780 SP1725. Stochastic Gradient Descent27 Words To Avoid In SalesThe Bizarre Behavior of Rotating Bodies, ExplainedGradient Descent, Step-by-Step35. Finding Clusters in GraphsAn Interview with Gilbert Strang on Teaching Linear AlgebraCourse Introduction of 18.065 by Professor StrangStochastic Gradient Descent, Clearly Explained!!!12a: Neural Nets2. Multiplying and Factoring MatricesAccelerating Stochastic Gradient DescentReactors of the Future (Generation IV)Gradient descent, how neural networks learn | Deep learning, chapter 2Optimizers - EXPLAINED!