Hierarchical clustering Flashcards
Hierarchical clustering algorithm operates in _______ fashion and why
Hierarchical clustering algorithms typically operate in a greedy fashion, making locally optimal choices at each step (merging the closest clusters or spitting the largest clusters) without reconsidering previous steps.
Hierarchal clustering is __________-
divide and conquer clustering
Another name of agglomerative clustering
Bottom up approach
Another name of agglomerative clusetring
Top down approach
Hierarchical clustering can be used for what or cant’s be used for what
Hierarchical clustering can be used for outlier detection but not for finding missing values (NA) or detected fake values.
Hierarchical clustering is primarily used for ______ because ________
Hierarchical clustering is primarily used for exploration because it helps in understanding the natural grouping within data which can be very useful in exploratory data analysis.
Hierarchical clustering is _________ visualization
Dendogram visualization
In hierarchical clustering do we need to specify the number of clusters?
No need to specify the number of clusters in hierarchical clustering
How hierarchical clustering provides flexibility or not
It allows you to choose the number of clusters by cutting the dendrogram at different levels, providing flexibility to explore the data at different granularities.
Hierarchal clustering is deterministic or not
Hierarchal clustering is deterministic because it allows a fixed sequence of merging or splitting clusters based on defined criteria like distance
Linkage (definition and types)
Linkage is how to link the clusters
Linkage techniques are two types: Single linkage and complete linkage
Single linkage
* Another name
* Keyword
* Definition
* Formula
- Another name: Nearest neighbour method
- Keyword: shortest distance
- Definition: This linkage technique focused on the shortest distance between data points in each cluster.
Complete linkage
* Another name
* Keyword
* Definition
* Formula
- Another name: Farthest neighbour method
- Keyword: longest distance
- Definition: This linkage technique focused on the longest distance between data points in each cluster.
Agglomerative clustering keyword
Merging approach
Agglomerative clustering use which linkage
can use any linkage
Single linkage or complete linkage