Lecture 11 - Clustering: EM Algorithm for GMM Flashcards

1
Q

What is a Gaussian Mixture Model (GMM)?

A

A probabilistic model representing data as a mixture of Gaussian distributions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Priors - 𝜋𝑖

A

𝜋𝑖 : Probabilities of each cluster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Means - 𝜇𝑖

A

𝜇𝑖: Centers of Gaussian clusters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Covariances - Σ𝑖

A

Σ𝑖: Shapes and spreads of clusters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why use GMM instead of K-Means?
GMM vs. K-Means

A

Soft Clustering: Assigns probabilities to clusters, not hard assignments.

Handles Arbitrary Shapes: Covariances allow ellipsoid-shaped clusters.

Probabilistic Reasoning: More robust for complex data distributions.

Mnemonic: “GMM Gives More Meaningful shapes.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the Expectation-Maximization (EM) Algorithm?

A

An iterative algorithm to optimize parameters (π,μ,Σ) in GMM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What happens in the E-Step?

A

Compute the posterior probability hi,t that a point belongs to a cluster:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What happens in the M-Step?

A

Update parameters using hi,j.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How does EM converge?

A

Converges when the log-likelihood stabilizes.

Alternative: Stop when the change in log-likelihood is below a threshold.

Tip: Imagine EM climbing a mountain of likelihood to the peak (maximum).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Challenges with GMM and EM

A

Initialization Sensitivity: Poor starts may lead to local optima.

Choosing 𝑘: Methods include Elbow, BIC, and AIC.

Computational Cost: Iterative nature makes it slow for large datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Applications of GMM. Where is GMM used?

A

Image Segmentation: Dividing an image into meaningful regions.

Speech Recognition: Modeling phonemes or accents.

Anomaly Detection: Identifying outliers with low probabilities.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly