K-means Flashcards
What type of learning method is k-means?
Unsupervised learning
What does an unsupervised learning method mean?
> All the data is unlabeled
This means that there is no correct answer
We may not know the answers
What is clustering?
> This is dividing a data set into groups which are similar to each other
There are lots of different ways of dividing a data set
We can plot graphs of the results
Why is scale imporant for k-means?
Because clusters may have sub-clusers if you look closer
What is the k-means method?
- Chose the number of clusters (k)
- Place k-centroids randomly
- Identify the points closest to the centroid
- Compute the new centroid clusers
- Repeat until centroids do not move
What are the pros of k-means?
> Very easy to implement
> Easy to explain
What are the cons of k-means?
> You have to chose the number of clusters
> Subject to local minima because of the initial positions of the centroids (so random restarts are required)