Unsupervised Learning Flashcards
Learning method where TARGET LABELS are unknown
Tries to find the innate structure of data
Eg clustering
Unsupervised learning
Groups the data into K CLUSTERS based on data’s DISTANCE FROM EACH CLUSTER’S CENTER
K means clustering
Each cluster’s centroid is considered a
Centroid
What is the algorithm of K Means clustering
> initialize k centroids RANDOMLY
for each data, compute distance from each centroid
get average of coordinates of data assigned to each centroid
if centroid stay the same, stop, else compute distances again
What do you do if a centroid has no data point?
Restrat the algorithm and choose different centroids
True or false: kmeans always produces the optimal solution
False. It can get stuck to a LOCAL optimum especially if value of k is TOO SMALL