Probabilistic Topic Models and User Behavior
Probabilistic topic models provide a suite of tools for analyzing large document collections. Topic modeling algorithms discover the latent themes that underlie the documents and identify how each document exhibits those themes. Topic modeling can be used to help explore, summarize, and form predictions about documents. Topic modeling ideas have been adapted to many domains, including images, music, networks, genomics, and neuroscience. Traditional topic modeling algorithms analyze a document collection and estimate its latent thematic structure. However, many collections contain an additional type of data: how people use the documents. For example, readers click on articles in a newspaper website, scientists place articles in their personal libraries, and lawmakers vote on a collection of bills. Behavior data is essential both for making predictions about users (such as for a recommendation system) and for understanding how a collection and its users are organized. In this talk, I will review the basics of topic modeling and describe our recent research on collaborative topic models, models that simultaneously analyze a collection of texts and its corresponding user behavior. We studied collaborative topic models on 80,000 scientists' libraries from Mendeley and 100,000 users' click data from the arXiv. Collaborative topic models enable interpretable recommendation systems, capturing scientists' preferences and pointing them to articles of interest. Further, these models can organize the articles according to the discovered patterns of readership. For example, we can identify articles that are important within a field and articles that transcend disciplinary boundaries. More broadly, topic modeling is a case study in the large field of applied probabilistic modeling. Finally, I will survey some recent advances in this field. I will show how modern probabilistic modeling gives data scientists a rich language for expressing statistical assumptions and scalable algorithms for uncovering hidden patterns in massive data.
Видео Probabilistic Topic Models and User Behavior канала Microsoft Research
Видео Probabilistic Topic Models and User Behavior канала Microsoft Research
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![Microsoft Research Reinforcement Learning Day 2021](https://i.ytimg.com/vi/pBxnJsNkC-c/default.jpg)
![Natural Language Processing (Part 5): Topic Modeling with Latent Dirichlet Allocation in Python](https://i.ytimg.com/vi/NYkbqzTlW3w/default.jpg)
![Introduction to Latent Semantic Analysis (1/5)](https://i.ytimg.com/vi/hB51kkus-Rc/default.jpg)
![The story of Flash Fill and (how it shaped) me](https://i.ytimg.com/vi/421gU482xFE/default.jpg)
![Lecture 26 — Probabilistic Latent Semantic Analysis PLSA - Part 1 | UIUC](https://i.ytimg.com/vi/vtadpVDr1hM/default.jpg)
![Microsoft Research Conversations in STEM: Medical and Health Technology](https://i.ytimg.com/vi/OxywLN-xrlE/default.jpg)
![](https://i.ytimg.com/vi/SOtsFya0auA/default.jpg)
![The Genius of David Blei](https://i.ytimg.com/vi/JZ-h71NMV0I/default.jpg)
![From LSI to Probabilistic Topic Models: An introduction to Topic Models - Part 2](https://i.ytimg.com/vi/Ea2SMUKrFnQ/default.jpg)
![Reinforcement Learning (RL) Open Source Fest Day 1 Demos](https://i.ytimg.com/vi/sDxhRkUPfwI/default.jpg)
![Directions in ML: Automating ML Performance Metric Selection](https://i.ytimg.com/vi/fxBpePJxCKc/default.jpg)
![2021 Microsoft Research PhD Fellow: Jordan Henkel](https://i.ytimg.com/vi/1uQr5iuyURU/default.jpg)
![2021 Microsoft Research Ada Lovelace Fellow: Marrok Sedgwick [in ASL]](https://i.ytimg.com/vi/9cHaynQyTaE/default.jpg)
![Text Analytics and Topic Modeling](https://i.ytimg.com/vi/n-lnYKndATE/default.jpg)
![How To Use Python Split Text | pythonbeginner 2019](https://i.ytimg.com/vi/8MczQGhZB_A/default.jpg)
![2021 Microsoft Research PhD Fellow: Jiaxin Huang](https://i.ytimg.com/vi/g89H4lIkvhE/default.jpg)
![Microsoft Research Conversations in STEM: Research in STEM as a Career](https://i.ytimg.com/vi/Pd7c6R9COas/default.jpg)
![Microsoft Research Conversations in STEM: Future Horizons of Science](https://i.ytimg.com/vi/A6qNP0yzNtY/default.jpg)
![The actors behind Flash Fill](https://i.ytimg.com/vi/dBJLmYjfcbU/default.jpg)
![How to Split Strings in Python With the split() Method](https://i.ytimg.com/vi/-yzfxeMBe1s/default.jpg)