Cluster Analysis Flashcards

1
Q

What type of learning is cluster analysis?

A

Unsupervised

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does cluster analysis do?

A

Group data points into clusters based on certain characteristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Along the lines of maximie and minmize, what does cluster analysis do?

A

Maximize the similarity within a cluster and minmiize similarity between clusters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is intra-clusters?

A

Distances are minimized

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is inter-cluster?

A

Distances are maximizes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the two types of clustering methods?

A

Partitional clustering and hierarchical clustering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the 2 methods of partional clustering?

A

K means and k medoids

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are 2 methods of hierarchical clustering?

A

Agglomerative (bottom up) and divisive (top-down)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the 4 types of clusters?

A
  1. Center based
  2. Contiguous
  3. Density-based
  4. Conceptual
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is center based clusters?

A

Defining a cluster by its centroids

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the two ways to define a cluster by its centroid?

A
  1. either the average of all the points in the cluster or
  2. medoid, the most representative point in the cluster.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is contiguous?

A

Hierarchical clustering, based on the proximity of data points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is density based clustering? And an example

A

DBSCAN, wher eit identifies clusters as dense regions of points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is conceptual clustering? And an example

A

Latent class analysis, where clusters are formed based on shared underlying concepts or models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the formula that expresses similarity under clustering?

A

d(x,y)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the most popular similarity measure in clusters?

A

Euclidean distance

17
Q

When does the euclidean distance work well with?

A

Numeric, continuous data in spherical clusters

18
Q

Name three more distance measures good for cluster similarity?

A
  1. Correlation
  2. Cosine similarity
  3. Manhattan distance
19
Q

What is the correlation distance measure?

A

Based similarity

20
Q

What is the cosine similarity distance measure?

A

For high-dimensional text data or sparse vectors