How do you minimize a function when you can't take derivatives? CMA-ES and PSO
What happens when you want to minimize a function, say, the error function in order to train a machine learning model, but the function has no derivatives, or they are very hard to calculate? You can use Gradient-Free optimizers. In this video, I show you two of them:
- CMA-ES (Covariance matrix adaptation strategy)
- PSO (Particle swarm optimization)
This video is a sequel to "What is Quantum Machine Learning"
https://www.youtube.com/watch?v=j0DV_75LkFo
and also part of the blog post:
https://www.zapatacomputing.com/why-generative-modeling-is-leading-the-race-to-practical-quantum-advantage/
Introduction: (0:00)
CMA-ES: (1:23)
PSO (9:17)
Conclusion: (14:00)
Видео How do you minimize a function when you can't take derivatives? CMA-ES and PSO канала Serrano.Academy
- CMA-ES (Covariance matrix adaptation strategy)
- PSO (Particle swarm optimization)
This video is a sequel to "What is Quantum Machine Learning"
https://www.youtube.com/watch?v=j0DV_75LkFo
and also part of the blog post:
https://www.zapatacomputing.com/why-generative-modeling-is-leading-the-race-to-practical-quantum-advantage/
Introduction: (0:00)
CMA-ES: (1:23)
PSO (9:17)
Conclusion: (14:00)
Видео How do you minimize a function when you can't take derivatives? CMA-ES and PSO канала Serrano.Academy
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Geometric series and my Irish heritageProximal Policy Optimization (PPO) - How to train Large Language ModelsDecision trees - A friendly introductionReinforcement Learning with Human Feedback - How to train and fine-tune Transformer ModelsThe Binomial and Poisson DistributionsSingular Value Decomposition (SVD) and Image CompressionYou are much better at math than you thinkTraining Latent Dirichlet Allocation: Gibbs Sampling (Part 2 of 2)How Large Language Models are Shaping the FutureThe Attention Mechanism in Large Language ModelsThompson sampling, one armed bandits, and the Beta distributionBook by Luis Serrano - "Grokking Machine Learning" (40% off promo code)Latent Dirichlet Allocation (Part 1 of 2)The Gini Impurity Index explained in 8 minutes!Principal Component Analysis (PCA)Machine Learning: Testing and Error MetricsLogistic Regression and the Perceptron Algorithm: A friendly introductionClustering: K-means and HierarchicalWhat are Transformer Models and how do they work?A Friendly Introduction to Generative Adversarial Networks (GANs)