Clustering Flashcards

1
Q

What is the difference beetween unsupervided learning and supervised learning?

A

10 / 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is clustering? What are the most common techniques?

A

10 / 2-3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Formulate the clustering problem (input & output, distance and similarity)

A

10 / 10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define the k-means clustering problem, in addition specify the different objective cost functions

A

10 / 14 and 16

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Write down the formula for a center given a cluster C_i

A

10 / 17

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe verbally a naive way to solve the k-means clustering, what is the complexity of this approach?

A

10 / 18-19

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How the Lloyd’s algorithm works? Which are the most commonly used convergence criteria? What’s the complexity? What’s the upper bound on the number of iterations?

A

10 / 20-24

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe the k-means++ algorithm. What problem is trying to solve?

A

10 / 26

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How the linkage-based clustering works? What are the 2 parameters that it needs?

A

10 / 29-30

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a dendrogram? What it represents?

A

10 / 31

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the common approach to choose the number k of clusters? What is the most common score to evaluate clusters?

A

10 / 36

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe all the formulas in the silhouette of a clustering C and its meaning

A

10 / 37-38

How well did you know this?
1
Not at all
2
3
4
5
Perfectly