PyData Tel Aviv Meetup: Visualizing High Dimensional Data (t-SNE) - Gal Yona
Many of us are used to think about data visualization as creating simple 2-variable plots to accompany our slides. In practice, this is hardly the case. I claim that in today's ML landscape - rich with complicated (and often uninterpretable) models, and usually very high dimensional data - good knowledge of data visualization techniques is a crucial tool for any serious practitioner. Like many other concepts in ML, while applying these techniques is usually easy (no more than 2-3 lines in Python) - truly understanding what's going on in the background, and coming up with useful applications, is a much more challenging task. In the first part of the talk, I'll give a short introduction to t-SNE - a beautiful algorithm and more importantly, the state of the art approach to visualizing high dimensional data. We'll talk about how it differs from simple dimensionality reduction techniques (like PCA) and give some intuition into why it usually performs very well. In the second part I'll mention two example applications that will hopefully spark your interest and creativity. The first application is arranging a huge collection of images by their visual similarity in an unsupervised approach. I will focus on some practical challenges I ran into while trying to efficiently implement this solution. The second application is revolves around visualizing the representations learnt by a neural network in order to better understand what is bases its predictions on. It shows how visualization can play a role in the task of "peeking" into the Deep Learning black box.
PyData Tel Aviv Meetup #6
10 August 2017
Sponsored by Oracle
https://www.meetup.com/PyData-Tel-Aviv/
www.pydata.org
PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.
PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.
Видео PyData Tel Aviv Meetup: Visualizing High Dimensional Data (t-SNE) - Gal Yona канала PyData
PyData Tel Aviv Meetup #6
10 August 2017
Sponsored by Oracle
https://www.meetup.com/PyData-Tel-Aviv/
www.pydata.org
PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.
PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.
Видео PyData Tel Aviv Meetup: Visualizing High Dimensional Data (t-SNE) - Gal Yona канала PyData
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![Design at Large - Laurens van der Maaten, Visualizing Data Using Embeddings](https://i.ytimg.com/vi/EMD106bB2vY/default.jpg)
![Dimensionality Reduction: High Dimensional Data, Part 1](https://i.ytimg.com/vi/sIyMAzAHw6I/default.jpg)
![Kai Chang - Visually Exploring Multidimensional Data](https://i.ytimg.com/vi/ypc7Ul9LkxA/default.jpg)
![StatQuest: t-SNE, Clearly Explained](https://i.ytimg.com/vi/NEaUSP4YerM/default.jpg)
![A.I. Experiments: Visualizing High-Dimensional Space](https://i.ytimg.com/vi/wvsE8jm1GzE/default.jpg)
![PyData Ann Arbor: Leland McInnes | PCA, t-SNE, and UMAP: Modern Approaches to Dimension Reduction](https://i.ytimg.com/vi/YPJQydzTLwQ/default.jpg)
![6. Dimensionality reduction of scRNA-seq data](https://i.ytimg.com/vi/qcLJ_JO6bn8/default.jpg)
![Machine Intelligence - Lecture 4 (LDA, t-SNE)](https://i.ytimg.com/vi/HyYZWP8S2bA/default.jpg)
![Aapo Tanskanen - Zero shot text classification without GPT 3 | PyData Helsinki](https://i.ytimg.com/vi/kxf1aKXQYf4/default.jpg)
![Visualizing Data Using t-SNE](https://i.ytimg.com/vi/RJVL80Gg3lA/default.jpg)
![StatQuest: PCA main ideas in only 5 minutes!!!](https://i.ytimg.com/vi/HMOI_lkzW08/default.jpg)
![4 Basic Types of Cluster Analysis used in Data Analytics](https://i.ytimg.com/vi/Se28XHI2_xE/default.jpg)
![Why T-distributed Stochastic Neighbor Embedding (TSNE) is great for visualization? Math Step By Step](https://i.ytimg.com/vi/9-sjlrNDtc8/default.jpg)
![High Dimensional Data Visualization with Clustergrammer2 |SciPy 2020| Nicolas Fernandez](https://i.ytimg.com/vi/BFCj5SIuuRE/default.jpg)
![Ali Ghodsi, Lec 15: t-SNE](https://i.ytimg.com/vi/4GBgqmq0XAY/default.jpg)
![Craziest Game of 2020?](https://i.ytimg.com/vi/YqOGKKR32HE/default.jpg)
![Real-Time Data Pipelines Made Easy with Structured Streaming in Apache Spark | Databricks](https://i.ytimg.com/vi/wQfm4P23Hew/default.jpg)
![Pydata Berlin Meetup February 2021: The Foundation of our Machine Learning Platform at GetYourGuide](https://i.ytimg.com/vi/LJU9_hz9abo/default.jpg)
![Daniel Pyrathon - A practical guide to Singular Value Decomposition in Python - PyCon 2018](https://i.ytimg.com/vi/d7iIb_XVkZs/default.jpg)
![t-SNE vs UMAP](https://i.ytimg.com/vi/4NlvatkpV3s/default.jpg)