Principal Component Analysis Flashcards

Question 1

Q

Curse of Dimensionality

Answer

A

Standard regression classification techniques can become :
ill-defined for M >> N
ill conditioned/ numerically unstable even for M < N
increase in dimensionality > exponential increase of space > data becomes sparse
amount of data neede for a reliable result often grows exponentially with the dimensionality

Question 2

Q

Regularization

Answer

A

Question 3

Q

Maximum a-posteriori approach

Question 4

Q

Dimensionality Reduction

Answer

A

Goal: reduce data to features most relecant for learning task
i.e. significance test for single features; find relevant directions/subspaces in correlated data

WHY?

Question 5

Q

Principle Component Analysis

Answer

A

assume data is centered
find direction of maximum variance
Eigenvaluze Problem, direction of largest variance corresponds to direction of largest eigenvector
not robust to outliers

Question 6

Q

PCA applications

Answer

A

Question 7

Q

Power Iteration

Answer

A

Why: full eigendecomposition of scatter matrix is slow, often interesten only in a few first principal components
Power Iteration Method: start with random vector w, parameter update w<- Sw/||Sw||

(7 cards)