Gaussian Mixture Model Flashcards by Christina Christiansen

What is the goal of Gaussian mixture models (GMM)?

to model a probability distribution for data by assuming that the data is generated from a mixture of several Gaussian distributions. Each Gaussian component in the mixture is characterized by its own mean and covariance, and the GMM assigns a probability (weight) to each Gaussian.

GMM aims for flexibility in modeling distributions.

How well did you know this?

Not at all

Perfectly

What is the common strategy for obtaining a flexible distribution in GMM?

To make p(x) a combination of simpler, more tractable elements.

How well did you know this?

Not at all

Perfectly

What is the definition of the multivariate normal distribution for an M-dimensional vector?

It is defined as the density of a multivariate normal distribution.

How well did you know this?

Not at all

Perfectly

What does K represent in the context of GMM?

The number of components (or clusters) in the model.

How well did you know this?

Not at all

Perfectly

What is a one-hot vector in GMM?

A binary vector where exactly one entry is 1 and all other entries are 0.

How well did you know this?

Not at all

Perfectly

What does the binary variable z indicate?

It indicates the selected component in the GMM.
z = [0,1,0,0]

How well did you know this?

Not at all

Perfectly

What are the mixing probabilities in GMM?

They are the weights of the k-th Gaussian component in the mixture.

How well did you know this?

Not at all

Perfectly

What does the log-likelihood function represent in the EM algorithm?

How well the model explains the observed data X.

How well did you know this?

Not at all

Perfectly

What are the two main steps of the EM algorithm?

Expectation step (E-step)
Maximization step (M-step)

How well did you know this?

Not at all

Perfectly

What is the purpose of the E-step in the EM algorithm?

To calculate the probability (responsibility) that each data point belongs to component k.

How well did you know this?

Not at all

Perfectly

How does the M-step update model parameters in the EM algorithm?

Using the responsibilities to better fit the data in each iteration.

How well did you know this?

Not at all

Perfectly

What is the main advantage of GMM in density estimation?

It provides a flexible way to represent continuous densities.

How well did you know this?

Not at all

Perfectly

What is a kernel density estimator (KDE)?

A non-parametric tool for estimating the data’s probability density, providing a flexible alternative to the Gaussian mixture model without assuming a specific distribution.

How well did you know this?

Not at all

Perfectly

What does the kernel width λ control in KDE?

The size of the kernel’s ‘spread’ around each data point.

How well did you know this?

Not at all

Perfectly

What does a small λ in KDE lead to?

A very detailed density estimate, possibly leading to overfitting.

How well did you know this?

Not at all

Perfectly

What is the purpose of leave-one-out cross-validation in KDE?

Study These Flashcards

To avoid overestimating the density by excluding the i-th training sample.

What is Average Relative Density (ARD)?

Study These Flashcards

A method for identifying outliers by comparing density at a point x with average density of its K-nearest neighbors.

How is density around a point x computed in ARD?

Study These Flashcards

As the inverse of the average distance to the K-nearest neighbors.

What is a key limitation of GMM in outlier detection?

Study These Flashcards

It can be affected by initialization and may have difficulties with regions of different density.

What is the initialization process for GMM?

Study These Flashcards

Start with K Gaussian components defined by mean, covariance, and weight.

What are the benefits of using GMM?

Study These Flashcards

Flexible modeling of complex distributions
Probabilistic framework
Unsupervised learning capability

What are the limitations of GMM?

Study These Flashcards

Assumption of Gaussianity
Sensitivity to initialization
Overfitting potential

What does the ARD ratio indicate?

Study These Flashcards

A low ARD value suggests that x is in a lower-density region compared to its neighbors.

What is GMM?

Study These Flashcards

GMM is a probabilistic model which assumes the data is generated from a mixture of several gaussian components (clusters)

what is p(x|z=1) = N(x|μ_1, Σ_1 )?

The probability of the data point x given that it was generated from the first gaussian component

What does π_k mean?

The weight of the mixture proportions, where all weights for each gaussian components has to be summed to 1

what is N(x|μ_k, Σ_k)?

Normal distribution for the k-th gaussian component of the mixture, determining what the chance is that this given x (datapoint) is part of the k-th gaussian

How is KDE more flexible?

KDE more flexible when you have no prior knowledge of the data distribution and when you want to estimate the density without making any strong assumptions (i.e., not assuming the data is Gaussian).

what is a sign of a bad GMM?

Visual grouping in gaussian component clusters look bad or the log-likelihood never stabilizes, indicates a bad model and KDE might be useful

When is GMM often used?

The model is often used for tasks like clustering, density estimation, and anomaly detectio

Gaussian Mixture Model Flashcards

(30 cards)