Contextual Bandit: from Theory to Applications. - Vernade - Workshop 3 - CEB T1 2019
Claire Vernade (Google Deepmind) / 05.04.2019
Contextual Bandit: from Theory to Applications.
Trading exploration versus exploration is a key problem in computer science: it is about learning how to make decisions in order to optimize a long-term cost. While many areas of machine learning aim at estimating a hidden function given a dataset, reinforcement learning is rather about optimally building a dataset of observations of this hidden function that contains just enough information to guarantee that the maximum is being properly estimated. The first part of this talk reviews the main techniques and results known on the contextual linear bandit. We'll mostly rely on the recent book of Lattimore and Szepesvari (2019) [1]. Indeed, real-world problems often don't behave as the theory would like them to. In the second part of this talk, we want to share our experience in applying bandit algorithms in industry [2]. In particular, it appears that while the system is supposed to be interacting with its environment, the customers' feedback is often delayed or missing and does not allow to perform the necessary updates. We propose a solution to this issue, propose some alternative models and architecture, and finish the presentation with open questions on sequential learning beyond bandits.
[1] Lattimore, Tor, and Csaba Szepesvári. Bandit algorithms. preprint (2018).
[2] Vernade, Claire, et al. Contextual bandits under delayed feedback. arXiv preprint arXiv:1807.02089 (2018)
----------------------------------
Vous pouvez nous rejoindre sur les réseaux sociaux pour suivre nos actualités.
Facebook : https://www.facebook.com/InstitutHenriPoincare/
Twitter : https://twitter.com/InHenriPoincare
Instagram : https://www.instagram.com/instituthenripoincare/
*************************************
Langue : Anglais; Date : 05.04.2019; Conférencier : Vernade, Claire; Évenement : Workshop 3 - CEB T1 2019; Lieu : IHP; Mots Clés :
Видео Contextual Bandit: from Theory to Applications. - Vernade - Workshop 3 - CEB T1 2019 канала Institut Henri Poincaré
Contextual Bandit: from Theory to Applications.
Trading exploration versus exploration is a key problem in computer science: it is about learning how to make decisions in order to optimize a long-term cost. While many areas of machine learning aim at estimating a hidden function given a dataset, reinforcement learning is rather about optimally building a dataset of observations of this hidden function that contains just enough information to guarantee that the maximum is being properly estimated. The first part of this talk reviews the main techniques and results known on the contextual linear bandit. We'll mostly rely on the recent book of Lattimore and Szepesvari (2019) [1]. Indeed, real-world problems often don't behave as the theory would like them to. In the second part of this talk, we want to share our experience in applying bandit algorithms in industry [2]. In particular, it appears that while the system is supposed to be interacting with its environment, the customers' feedback is often delayed or missing and does not allow to perform the necessary updates. We propose a solution to this issue, propose some alternative models and architecture, and finish the presentation with open questions on sequential learning beyond bandits.
[1] Lattimore, Tor, and Csaba Szepesvári. Bandit algorithms. preprint (2018).
[2] Vernade, Claire, et al. Contextual bandits under delayed feedback. arXiv preprint arXiv:1807.02089 (2018)
----------------------------------
Vous pouvez nous rejoindre sur les réseaux sociaux pour suivre nos actualités.
Facebook : https://www.facebook.com/InstitutHenriPoincare/
Twitter : https://twitter.com/InHenriPoincare
Instagram : https://www.instagram.com/instituthenripoincare/
*************************************
Langue : Anglais; Date : 05.04.2019; Conférencier : Vernade, Claire; Évenement : Workshop 3 - CEB T1 2019; Lieu : IHP; Mots Clés :
Видео Contextual Bandit: from Theory to Applications. - Vernade - Workshop 3 - CEB T1 2019 канала Institut Henri Poincaré
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Tropical motivic integrationMaking sense of cross-scale dynamics... - Ringsmuth - Workshop 3 - CEB T3 2019Un petit aperçu des matrices de WignerPartitions, Modular Forms and Moduli Spaces - 4/8SHM 18/10/2019 - Comment une approche émique des textes mathématiques... - ProustWistan Marchadour - Réseaux de neurones et application en OncologieWhat is macroscopic quantum information?Jam Maths & SantéIsoperimetric inequalities in high dimensional convex sets (Lecture 1 - Part 1)Promises and challenges of Deep Learning in Cosmology - Lanusse - Workshop 2 - CEB T3 2018Planetary environmental boundaries and their modelling - Gerten - Workshop 3 - CEB T3 2019Nonlinear and Stochastic methods in climate and GFD- Takao - Workshop 1 - CEB T3 2019SHM - 16/01/15 - Constructivismes en mathématiques - Frédéric BrechenmacherEstimating with continuous quantum measurements20/10/2015 - Sergiu Klainerman - Lecture 2 - On The Mathematical Theory of Black HolesSeize the Moments: Enhancing Moment Estimation for Subdiffraction Incoherent ImagingAsymptotic and Probabilistic Aspects of Representation Theory - 1/8Journées Hénon - 19/21 - Pierre HénonThe vacuum-gap transmon qubit...Sochastic process models for precipitation processes... - Neelin - Workshop 1 - CEB T3 2019La recherche mathématique se prend au jeu, du 24 au 27 mai 2018, Place Saint Sulpice