Загрузка...

4.3 Neural Tangent Kernel (NTK) — Spectral Decomposition of Gradient Flow

We turn gradient descent into a continuous-time model, then use the NTK spectrum to explain why some parts of a function are learned faster than others.

Watch the full series:
https://www.youtube.com/playlist?list=PLdMM2W7mxqIYDURvh37jLz8sd8sDjonk5

0:00 — NTK dynamics and ODE
6:03 — Spectral decomposition and training dynamics
14:28 — Interpreting the spectrum

Topics:
• Gradient flow (ODE)
• Spectral / eigen decomposition
• Learning dynamics
• Fast vs slow modes
• PCA / covariance view
• Kernel perspective (NTK)
• Limits of fixed features
• SGD as noisy kernel

Видео 4.3 Neural Tangent Kernel (NTK) — Spectral Decomposition of Gradient Flow канала Jingyuan Hu

neural tangent kernel NTK NTK spectrum NTK spectral analysis neural tangent kernel explained gradient flow gradient descent dynamics spectral decomposition principal component analysis fast vs slow learning modes kernel methods wide neural networks deep learning theory mathematics of machine learning

Комментарии отсутствуют

Информация о видео

20 марта 2026 г. 8:43:39

00:24:32

Правообладателям

Жалоба на материал Недопустимый материал Нарушение авторских прав

Комментарии

Поделиться

Другие видео канала

Visualizing Singular Value Decomposition (SVD), 4 different ways

Equal distances in high dimensions: why t-SNE can mislead? — Dimensionality Reduction (7/7)

3.2 Support Vector Machines — Geometry of Maximum Margin

2.3 Local Linear Regression

Why Entropy Is Hidden Information?

Laplacian Eigenmaps - Dimensionality Reduction (4/7)

1.5 Multi-Dimensional Splines

2.2 Kernel Density Estimation

Forward and Backward PDEs Come as a Pair, Connected by Adjoints - Stochastic Processes & PDEs (7/8)

Path Integrals and the Distribution of Random Paths - Stochastic Processes and PDEs (8/8)

SVD Low Rank Approximation #math #linearalgebra #datascience

3.4 Support Vector Machines — KKT Conditions

Ito Calculus: Why Brownian Motion Breaks Ordinary Calculus - Stochastic Processes and PDEs (3/8)

Multidimensional Scaling (MDS) and Isomap - Dimensionality Reduction (2/7)

How Expectations Evolve: Black–Scholes & Kolmogorov Backward - Stochastic Processes and PDEs (6/8)

1.3b Generalized Additive Models (GAM)

1.7 Linear Algebra in Infinite Dimensions

1.10 Gaussian Process Regression: Bayesian Ridge Regression

3.1 Regularized Logistic Regression

SVD is change of basis #math #linearalgebra #svd #machinelearning

Все заметки Новая заметка Страницу в заметки

Страницу в закладки Мои закладки

На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.

О Cookies Напомнить позже Принять