High Quality, High Performance Clustering with HDBSCAN | SciPy 2016 | Leland McInnes
Data clustering is a powerful tool for data analysis. It can be particularly useful in exploratory data analysis for helping to summarize and give intuition about a dataset. Despite it's power clustering is used for this task far less frequently than it could be. A plethora of options for clustering algorithms exist, and we will provide a survey of some of the more popular options, discussing their strengths and weaknesses, particularly with regard to exploratory data analysis. Our focus, however, is on a relatively new algorithm that appears to be the best equipped to meet the needs of exploratory data analysis: HDBSCAN* has the strengths of density based algorithms, has a small robust set of parameters, and with suitable implementation can be made highly scalable to large datasets. We will discuss how the algorithm works, taking a few different perspectives, and explain the techniques used for a high performance implementation. Finally we'll discuss ways to extend the algorithm, drawing on ideas from topological data analysis.
More info on HDBSCAN here: https://github.com/lmcinnes/hdbscan.
See the complete SciPy 2016 Conference talk & tutorial playlist here: https://www.youtube.com/playlist?list=PLYx7XA2nY5Gf37zYZMw6OqGFRPjB1jCy6
Видео High Quality, High Performance Clustering with HDBSCAN | SciPy 2016 | Leland McInnes канала Enthought
More info on HDBSCAN here: https://github.com/lmcinnes/hdbscan.
See the complete SciPy 2016 Conference talk & tutorial playlist here: https://www.youtube.com/playlist?list=PLYx7XA2nY5Gf37zYZMw6OqGFRPjB1jCy6
Видео High Quality, High Performance Clustering with HDBSCAN | SciPy 2016 | Leland McInnes канала Enthought
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![HDBSCAN, Fast Density Based Clustering, the How and the Why - John Healy](https://i.ytimg.com/vi/dGsxd67IFiU/default.jpg)
![](https://i.ytimg.com/vi/DO_nId_szEo/default.jpg)
![Leland McInnes, John Healy | Clustering: A Guide for the Perplexed](https://i.ytimg.com/vi/ayZQj4llUSU/default.jpg)
![COMP3425/8410 DBSCAN Clustering in R](https://i.ytimg.com/vi/kv8mNIUcu30/default.jpg)
![Christian Hennig - Assessing the quality of a clustering](https://i.ytimg.com/vi/Mf6MqIS2ql4/default.jpg)
![Leland Mcinnes: Topological Techniques for Unsupervised Learning | PyData LA 2019](https://i.ytimg.com/vi/7pAVPjwBppo/default.jpg)
![Clustering with DBSCAN, Clearly Explained!!!](https://i.ytimg.com/vi/RDZUdRSDOok/default.jpg)
![Clustering (2): Hierarchical Agglomerative Clustering](https://i.ytimg.com/vi/OcoE7JlbXvY/default.jpg)
![Brian Kent: Density Based Clustering in Python](https://i.ytimg.com/vi/5cOhL4B5waU/default.jpg)
![DBSCAN Explanation and Visualization](https://i.ytimg.com/vi/_A9Tq6mGtLI/default.jpg)
![StatQuest: Hierarchical Clustering](https://i.ytimg.com/vi/7xHsRkOdVwo/default.jpg)
![DBSCAN: Part 1](https://i.ytimg.com/vi/sKRUfsc8zp4/default.jpg)
![UMAP Uniform Manifold Approximation and Projection for Dimension Reduction | SciPy 2018 |](https://i.ytimg.com/vi/nq6iPZVUxZU/default.jpg)
![Diff #27 - HDBSCAN, DBSCAN, Density based clustering and metrics](https://i.ytimg.com/vi/VFxby39Vv3M/default.jpg)
![Spatial Machine Learning Explained: Density-Based Clustering and Outlier Detection](https://i.ytimg.com/vi/1f0Q4kbKaZE/default.jpg)
![Clustering in Machine Learning: K Means, Hierarchical, DBSCAN, Difference Between Classification](https://i.ytimg.com/vi/399EoFHrI5U/default.jpg)
![What is dimension reduction in machine learning?](https://i.ytimg.com/vi/cowHdW2-RkU/default.jpg)
![Matrix Expressions and BLAS/LAPACK; SciPy 2013 Presentation](https://i.ytimg.com/vi/nVt24G_2VC0/default.jpg)
![IAML19.4 Agglomerative clustering: dendrogram](https://i.ytimg.com/vi/1jW9xlEtQao/default.jpg)
![Machine Learning with R and TensorFlow](https://i.ytimg.com/vi/atiYXm7JZv0/default.jpg)