Book - Chapter 4 clustering Flashcards

1
Q

What is clustering

A

Is the uses unsupervised techniques for grouping similar objects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the centre of a K means cluster

A

Arithmetic average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

In case it means are the clusters numerical or categorical

A

Numerical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the input of K means

A

Euclidean distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the outcome of K means

A

A cluster centre.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Clustering is primarily an exploratory technique to discover what

A

Hidden structures of the data, possibly as a prelude to more focused analysis or decision processes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the use cases of K beans

A

Image processing, medical, and customer segmentation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How would you find out the value of K

A

By using within the sum of squares (WSS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is WSS

A

The sum of the squares of the distances between each Datapoint and the closest centroid

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What do you do if you’re missing expected splits

A

Increase K

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do you do if clusters have few data points

A

Decrease K

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do you do if the centroids are close together

A

Decrease K

How well did you know this?
1
Not at all
2
3
4
5
Perfectly