Principal Components Analysis Flashcards

Question 1

Q

How do we deal with high dimensionality (3)?

Answer

A

Use domain knowledge
- Feature engineering (e.g. color historgrams for object detection)
Make assumptions
- Independence
- Smoothness
- Symmetry
Reduce dimensionality

Question 2

Q

What are the two methods for reducing dimensionality?

Answer

A

Feature selection
Feature extraction

Question 3

Q

What is feature selection?

Answer

A

Choosing a subset of the original features (e.g. highest infomation gain)

Question 4

Q

What is feature extraction?

Answer

A

Contruct a new set of dimensions from a linear combination of the original

Question 5

Q

What does PCA try to preserve?

Answer

A

The structure (variance) in the data

Question 6

Q

What are principal components?

Answer

A

Eigen vectors with the largest eigen values

Question 7

Q

What happens when you multiply a random vector with the covariance matrix?

Answer

A

It moves in the direction of greatest variance

Question 8

Q

What is an eigen vector?

Answer

A

A vector when multiplied by a matrix does not change direction, only magnitude

Question 9

Q

What is an eigen value?

Answer

A

The scaler for which an eigen vector grows

Question 10

Q

How do you find eigenvalues?

Question 11

Q

What is the determinant of a 2x2 matrix?

Question 12

Q

How do you find eigenvectors (given the eigen values)?

Question 13

Q

Which eigenvectors do we pick for principle components?

Answer

A

Unit length eigen vectors

Question 14

Q

How do you project a coordinate x’ given e_i, …, e_m eigen vectors?

Answer

A

(x’ - mu)^Te_j for j = 1…m

Question 15

Q

What property does the eigen vector for a principle component have?

Answer

A

Its where the data is spread out the most

Question 16

Q

How do we pick the amount of components to use (PCA)?

Answer

Study These Flashcards

A

Pick the first m which explain some threshold of the total variance
Use a scree plot

Question 17

Q

What are typical variance threshold values (PCA)?

Answer

Study These Flashcards

A

0.9/0.95

Question 18

Q

How do you compute what porportion of the variance m principle components explain? (given their are d dimensions)

Answer

Study These Flashcards

A

lambda is the eigenvalues for each principle component

Question 19

Q

What do we do before finding principle components (PCA)?

Answer

Study These Flashcards

A

Center points (subtract mean)

Question 20

Q

What is the advantages using eigen faces for simularity?

Answer

Study These Flashcards

A

Insensitive to lighting, expression, orientation

Question 21

Q

What are the pratical issues with PCA?

Answer

Study These Flashcards

A

Sensitive to large values (large attribute -> large variance -> always picked as 1st component)
Always linear projection (line/hyperplane

Principal Components Analysis Flashcards

(21 cards)