Week 10: Clustering Flashcards
Clustering
A method of grouping samples based on how similar they are to each other.
K-Means Clustering
A method of clustering where samples are grouped based on the closest centroid. Centroids are recalculated after each iteration by averaging the coordinates of all samples assigned to the centroid.
Fuzzy K-means Clustering
A method assigning a probability of each sample belonging to each centroid. Centroids and probabilities are updated in each iteration.
Iterative Clustering
A subset of clustering that focuses on local optimisation over global optimisation.
Basic Iterative Min-Squared-Error Clustering
It’s a type of iterative clustering that’s essentially K-means clustering done sequential for each sample.
Hierarchical Clustering
A type of clustering where the samples are split into subclusters that have a hierarchy.
Agglomerative Hierarchical Clustering
A type of hierarchical clustering that begins with each sample in its own cluster. Clusters are merged together until only one cluster remains.
Competitive Learning
Similar to K-means, except it’s done sequentially and only the assigned centroid is updated each step.
Basic-leader-follower Clustering
It’s done sequentially, with the first sample being the first centroid. If the distance to the centroid with within a threshold, the centroid is kept and updated. If the distance isn’t within the threshold, a new centroid is created with the current sample.