Unsupervised learning Flashcards

Question 1

Q

For what tasks can we use unsupervised learning?

Answer

A

Dimensionality reduction
Anomaly detection
Visualization

Question 2

Q

What are some challenges of k-means clustering?

Answer

A

Clusters tend to be the same size
Depends on initialization
Handles anisotropic data and non-linearites poorly

Question 3

Q

How can we improve k-means for non-linear datasets?

Answer

A

Change the cost function from euclidean to geodesic/ graph based/ kernel based

Question 4

Q

What is the elbow method?

Answer

A

A methode for determining the number of clusters. (Stop increasing number of clusters when the gain of increase is small).

Question 5

Q

What are the steps of PCA?

Answer

A

Create design matrix
Center data
UDV^T = SVD(1/(N-1) XX^T)
Keep the n eigenvectors (columns of U) with largest eigenvalues.
v* = U^T x
x = Uv*

Question 6

Q

Describe the steps in AAM (Active appearance models)

Answer

A

Calculate the shape model ( mean + eigenmodes troughout the dataset)
Warp the image so it fits the landmark template
Create appearance model using PCA on the “shape free paches”
PCA jointly on the shape and appearance to capture correlations.

Question 7

Q

What are eigenfaces?

Answer

A

“Eigenvectors” created from face images using PCA. These eigenfaces can be used as a basis to reconstruct face images.

Question 8

Q

What are some interpretations of the PCA?

Answer

A

Best L2 recontruction error among all linear models of equal rank
k’th eigenvector is the direction of maximal variance orthogonal to all former eigenvectors.
PCA fits an ellipsoid to the data.

Question 9

Q

How can we interpret PCA probabilistic?

Answer

A

x = Uw + mu + e
w ~ N(0,I)
e ~ N(0, sigma^2*I)

Question 10

Q

How can we enforce sparsity on the PCA?

Answer

A

Add a sparsity weight penalty (L0, L1…)

Question 11

Q

What is one of the main advantages of sparse penalites in the PCA setting?

Answer

A

Can remove noise

Question 12

Q

What is the main dissadvantage of Autoencoders compared to PCA?

Answer

A

Visualizing the latent space

2. Generating new samples

Question 13

Q

How can AE be used for image anomaly detection?

Answer

A

Train the autoencoder on “normal” data. When using the autoencoder on a new image subtract the result from the original to find anomalies.

Question 14

Q

What is the difference between GMM and fully Bayesian GMM?

Answer

A

In fully bayesian GMM we assign hyperpriors to pi_n and my_k, sigma_k. A Dirchlete prior on pi_n usually favors fewer non-zero clusters, hyperpriors on mu, sigma can avoid clusters that degenerate to 0 variance.

Question 15

Q

What is variational Bayesian inference?

Answer

A

When we cannot compute the posterior p(theta | X) analytically, we can approximate it using q(theta) from a family of distributions Q.

Question 16

Q

What kind of families Q can we use?

Answer

Study These Flashcards

A

Delta dirac ( finds the mode)
Parametric familiy of e.g. gaussians
Mean field approximation (q factroizes).

Question 17

Q

In variational inference what metric do we use to calculate the distance between two distributions?

Answer

Study These Flashcards

A

KL(q || p) = Ep[log(q/p)]

Question 18

Q

What can we do instead of minimizing KL?

Answer

Study These Flashcards

A

Maximizing ELBO = Eq[log(p(X|theta)] - KL(q||p)

Unsupervised learning Flashcards

(18 cards)