Thompson sampling, one armed bandits, and the Beta distribution
Thompson sampling is a strategy to explore a space while exploiting the wins. In this video we see an application to winning at a game of one-armed bandits.
Beta distributions video: https://www.youtube.com/watch?v=juF3r12nM5A
Tom Denton blog: https://inventingsituations.net/
Icons made by Freepik from https://www.flaticon.com
Announcement: Book by Luis Serrano! Grokking Machine Learning. bit.ly/grokkingML
40% discount code: serranoyt
Видео Thompson sampling, one armed bandits, and the Beta distribution канала Serrano.Academy
Beta distributions video: https://www.youtube.com/watch?v=juF3r12nM5A
Tom Denton blog: https://inventingsituations.net/
Icons made by Freepik from https://www.flaticon.com
Announcement: Book by Luis Serrano! Grokking Machine Learning. bit.ly/grokkingML
40% discount code: serranoyt
Видео Thompson sampling, one armed bandits, and the Beta distribution канала Serrano.Academy
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Thompson Sampling : Data Science ConceptsA friendly introduction to Bayes Theorem and Hidden Markov ModelsThe Gini Impurity Index explained in 8 minutes!Clustering: K-means and HierarchicalBinomial distributions | Probabilities of probabilities, part 1Multi-Armed Bandits: A Cartoon Introduction - DCBA #1The Beta distribution in 12 minutes!Reinforcement Learning: Thompson Sampling & The Multi Armed Bandit Problem - Part 01Introduction to sampling distributions | Sampling distributions | AP Statistics | Khan AcademyA Breakthrough in Graph Theory - NumberphileNaive Bayes classifier: A friendly approachLatent Dirichlet Allocation (Part 1 of 2)The medical test paradox, and redesigning Bayes' ruleThe covariance matrixA.I. teaches itself to drive in TrackmaniaAI Learns to Park - Deep Reinforcement LearningMulti-Armed Bandit : Data Science ConceptsYou are much better at math than you thinkSampling Distributions: Deriving the Mean and Variance of the Sample Mean