Загрузка...

CODING UNSUPERVISED LEARNING | AGGLOMERATIVE CLUSTERING | MACHINE LEARNING TUTORIAL #shorts 18

In Unsupervised Learning the data has no labels.The Machine just look for the whatever pattern it can find .

Supervised Learning / Unsupervised Learning
Supervised Learning
Deals with the labelled data where the output patterns are known to the system.
Less complex.
Conducts offline analysis.
Comparatively more accurate and reliable results.
Include Classification and Regression. Include Classification and Regression.
Unsupervised Learning
Work with unlabelled data in which the output is just based on the collection of perception.
More complex.
Perform real time analysis.
Moderately accurate but reliable results.
Include clustering and associative rules mining problems.
Clustering :
“Clustering” is the process of grouping similar entities together. The goal of this unsupervised machine learning technique is to find similarities in the data point and group similar data points together.
Need of clustering :
To determine the intrinsic grouping in a set of unlabelled data.
To organise data into clusters showing the internal structure of the data.
To partition the data points.
To understand and extract value from large sets of structured and unstructured data.
Types of Clustering :
Hierarchical Clustering : A tree structure that has a set of nested clusters. These are of two types. a.) Agglomerative b.)Divisive
Partitioned clustering : A division of the set of data objects into non – overlapping sets or clusters such that every data objects is in just one subset. These are of two types a.) K-means b.)Fuzzy C-means.

Partitioned clustering / Hierarchical Clustering

Agglomerative Clustering :
In agglomerative or bottom-up clustering method we assign each observation to its own cluster. Then, compute the similarity (e.g., distance) between each of the clusters and join the two most similar clusters. Finally, repeat steps 2 and 3 until there is only a single cluster left.
Divisive Clustering :
In divisive or top-down clustering method we assign all of the observations to a single cluster and then partition the cluster to two least similar clusters. Finally, we proceed recursively on each cluster until there is one cluster for each observation. There is evidence that divisive algorithms produce more accurate hierarchies than agglomerative algorithms in some circumstances but is conceptually more complex.
Working : Hierarchical clustering
Assign each item to its own cluster,such that if you have N number of items,you now have N number of clusters.
Find the closest (most similar) pair of clusters and merge them into a single cluster.Now you have one cluster less.
Compute distances (similarities) between the new cluster and every old cluster.
Repeat steps two and tree until all items are clustered into a single cluster of size N.
Distance Measures :
Complete – Linkage Clustering : Find the maximum possible distance between points belonging to two different clusters.
Single-Linkage Clustering : Find the minimum possible distance between points belonging to two different clusters.
Mean – Linkage Clustering : Find all possible pair-wise distances for points belonging to two different clusters and then calculate the average.
Centroid -Linkage Clustering : Find the centroids of each cluster and. calculate the distance between them.
K-Means Algorithm : A iterative clustering algorithm whose goal is to find maxima in each iteration.
Steps :
Specify the desired number of clusters K
Randomly assign each data point to a cluster
Compute cluster centroids
Reassign each point to the closet cluster centroid and recompute cluster centroids.

Видео CODING UNSUPERVISED LEARNING | AGGLOMERATIVE CLUSTERING | MACHINE LEARNING TUTORIAL #shorts 18 канала Kartikeya
Яндекс.Метрика
Все заметки Новая заметка Страницу в заметки
Страницу в закладки Мои закладки
На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.
О CookiesНапомнить позжеПринять