Visualizing Data Using t-SNE
Google Tech Talk
June 24, 2013
(more info below)
Presented by Laurens van der Maaten, Delft University of Technology, The Netherlands
ABSTRACT
Visualization techniques are essential tools for every data scientist. Unfortunately, the majority of visualization techniques can only be used to inspect a limited number of variables of interest simultaneously. As a result, these techniques are not suitable for big data that is very high-dimensional.
An effective way to visualize high-dimensional data is to represent each data object by a two-dimensional point in such a way that similar objects are represented by nearby points, and that dissimilar objects are represented by distant points. The resulting two-dimensional points can be visualized in a scatter plot. This leads to a map of the data that reveals the underlying structure of the objects, such as the presence of clusters.
We present a new technique to embed high-dimensional objects in a two-dimensional map, called t-Distributed Stochastic Neighbor Embedding (t-SNE), that produces substantially better results than alternative techniques. We demonstrate the value of t-SNE in domains such as computer vision and bioinformatics. In addition, we show how to scale up t-SNE to big data sets with millions of objects, and we present an approach to visualize objects of which the similarities are non-metric (such as semantic similarities).
This talk describes joint work with Geoffrey Hinton.
Видео Visualizing Data Using t-SNE канала Google TechTalks
June 24, 2013
(more info below)
Presented by Laurens van der Maaten, Delft University of Technology, The Netherlands
ABSTRACT
Visualization techniques are essential tools for every data scientist. Unfortunately, the majority of visualization techniques can only be used to inspect a limited number of variables of interest simultaneously. As a result, these techniques are not suitable for big data that is very high-dimensional.
An effective way to visualize high-dimensional data is to represent each data object by a two-dimensional point in such a way that similar objects are represented by nearby points, and that dissimilar objects are represented by distant points. The resulting two-dimensional points can be visualized in a scatter plot. This leads to a map of the data that reveals the underlying structure of the objects, such as the presence of clusters.
We present a new technique to embed high-dimensional objects in a two-dimensional map, called t-Distributed Stochastic Neighbor Embedding (t-SNE), that produces substantially better results than alternative techniques. We demonstrate the value of t-SNE in domains such as computer vision and bioinformatics. In addition, we show how to scale up t-SNE to big data sets with millions of objects, and we present an approach to visualize objects of which the similarities are non-metric (such as semantic similarities).
This talk describes joint work with Geoffrey Hinton.
Видео Visualizing Data Using t-SNE канала Google TechTalks
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![StatQuest: t-SNE, Clearly Explained](https://i.ytimg.com/vi/NEaUSP4YerM/default.jpg)
![Design at Large - Laurens van der Maaten, Visualizing Data Using Embeddings](https://i.ytimg.com/vi/EMD106bB2vY/default.jpg)
![A.I. Experiments: Visualizing High-Dimensional Space](https://i.ytimg.com/vi/wvsE8jm1GzE/default.jpg)
![Machine Intelligence - Lecture 4 (LDA, t-SNE)](https://i.ytimg.com/vi/HyYZWP8S2bA/default.jpg)
![But what is the Fourier Transform? A visual introduction.](https://i.ytimg.com/vi/spUNpyF58BY/default.jpg)
![The Best Way to Visualize a Dataset Easily](https://i.ytimg.com/vi/yQsOFWqpjkE/default.jpg)
![UMAP Uniform Manifold Approximation and Projection for Dimension Reduction | SciPy 2018 |](https://i.ytimg.com/vi/nq6iPZVUxZU/default.jpg)
![Cytobank Overview](https://i.ytimg.com/vi/tVG2b_s4qas/default.jpg)
![StatQuest: Principal Component Analysis (PCA), Step-by-Step](https://i.ytimg.com/vi/FgakZw6K1QQ/default.jpg)
![Tutorial: Make fancy tSNE plots in FlowJo with flow cytometry data](https://i.ytimg.com/vi/GPR_QrevOMY/default.jpg)
![Visualizing Data Using Embeddings | Laurens Van Der Maaten | Design@Large](https://i.ytimg.com/vi/g72uroShwmI/default.jpg)
![Variational Autoencoders](https://i.ytimg.com/vi/9zKuYvjFFS8/default.jpg)
![What is tSNE](https://i.ytimg.com/vi/9_dIZtpkdFU/default.jpg)
![TidyTuesday: Computer Vision with TidyModels and t-SNE](https://i.ytimg.com/vi/864iNqFNzc0/default.jpg)
![](https://i.ytimg.com/vi/5Af_S5hA0oU/default.jpg)
![How to apply t-SNE and interpret its output: Dimensionality reduction Lecture 25@ Applied AI Course](https://i.ytimg.com/vi/PTNfRk0v6YM/default.jpg)
![scRNA-seq: Dimension reduction (PCA, tSNE, UMAP)](https://i.ytimg.com/vi/94ZMJ8tq1Wk/default.jpg)
![StatQuest: A gentle introduction to RNA-seq](https://i.ytimg.com/vi/tlf6wYJrwKY/default.jpg)
![Why T-distributed Stochastic Neighbor Embedding (TSNE) is great for visualization? Math Step By Step](https://i.ytimg.com/vi/9-sjlrNDtc8/default.jpg)
![Tutorial on tSNE and FlowSOM Step-by-Step tool usage in FlowJo V10](https://i.ytimg.com/vi/OQe04SiT8I0/default.jpg)