LDA Flashcards

1
Q

assumptions of LDA

A

linear decision boundary (linear discriminant function, or the log odd ratio is a linear function of X)
the class-conditional distribution of X is a multivariate Gaussian with the same covariance matrix

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

discriminant function of LDA

A

a linear function of X

delta_k(x) = x^T \Sigma^-1 mu_k - 1/2 mu_k^T \Sigma^-1 mu_k + log(pi_k)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

how to estimate the parameters for X distribution

A
pi_k = n_k / N
mu_k = avg(x_i) when g_i = k
Sigma = avg_k (Sigma_k) where Sigma_k = Var(X) for X in class k
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

QDA (Quadratic Discriminant Analysis)

A

covariance matrix Sigma_k are different for differen k, the decision boundary is not linear but quadratic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Pros and Cons of LDA (Compared to Logistic Regression)

A

Pros:
* the estimated parameters have less variance if the assumption about X distribution are correct
* can use information from Xs without Y labels to inform Sigma
* well-defined in the case of perfectly separated data
Cons:
* more assumptions about X distribution
* less robust against outliers in the X space (far from decision boundary)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly