Leland McInnes, John Healy | Clustering: A Guide for the Perplexed
PyData DC 2016
Finding clusters is a powerful tool for understanding and exploring data. While the task sounds easy, it can be surprisingly difficult to do it well. Most standard clustering algorithms can, and do, provide very poor clustering results in many cases. We discuss how to do clustering correctly.
Finding clusters is a powerful tool for understanding and exploring data. While the task sounds easy, it can be surprisingly difficult to it well. Most standard clustering algorithms can, and do, provide very poor clustering results in many cases. Our intuitions for what a cluster is are not as clear as we would like, and can easily be lead astray. We will attempt to find a definition of clustering that makes sense for most cases, and introduce an algorithm for finding such clusters, along with a high performance python implementation of the algorithm, building up more intuition for what clustering really means as we go.
Видео Leland McInnes, John Healy | Clustering: A Guide for the Perplexed канала PyData
Finding clusters is a powerful tool for understanding and exploring data. While the task sounds easy, it can be surprisingly difficult to do it well. Most standard clustering algorithms can, and do, provide very poor clustering results in many cases. We discuss how to do clustering correctly.
Finding clusters is a powerful tool for understanding and exploring data. While the task sounds easy, it can be surprisingly difficult to it well. Most standard clustering algorithms can, and do, provide very poor clustering results in many cases. Our intuitions for what a cluster is are not as clear as we would like, and can easily be lead astray. We will attempt to find a definition of clustering that makes sense for most cases, and introduce an algorithm for finding such clusters, along with a high performance python implementation of the algorithm, building up more intuition for what clustering really means as we go.
Видео Leland McInnes, John Healy | Clustering: A Guide for the Perplexed канала PyData
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Christian Hennig - Assessing the quality of a clusteringHDBSCAN, Fast Density Based Clustering, the How and the Why - John HealyA Bluffer's Guide to Dimension Reduction - Leland McInnesBrian Kent: Density Based Clustering in PythonHigh Quality, High Performance Clustering with HDBSCAN | SciPy 2016 | Leland McInnesSebastiaan J. van Zelst: Process Mining in Python | PyData Eindhoven 2019Vincent Warmerdam: Winning with Simple, even Linear, Models | PyData London 2018Machine Learning - Unsupervised Learning - Density Based ClusteringDBSCAN Clustering for Identifying Outliers Using Python - Tutorial 22 in Jupyter NotebookThe Most Powerful Way to Remember What You Study12. ClusteringPyData Ann Arbor: Leland McInnes | PCA, t-SNE, and UMAP: Modern Approaches to Dimension Reduction4 Basic Types of Cluster Analysis used in Data AnalyticsModern Time Series Analysis | SciPy 2019 Tutorial | Aileen NielsenHow To Remember Everything You LearnHierarchical Agglomerative Clustering [HAC - Single Link]Propensity Score Matching in Stata - psmatch2Affinity PropagationLecture 1.3 — Unsupervised Learning — [ Machine Learning | Andrew Ng]