Week 7 Flashcards

Principal Comp0onent Analysis

1
Q

What is Principal Component Analysis (PCA)?

A

Primarily used for Dimensionality Reduction.
Feature Selection.

Main Idea: Project given data onto a lower dimensional subspace such that
1. Reconstruction error is minimized
2. Variance of the projected data is maximised.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How does PCA work?

A

Fix a subspace and then find the best projection. (initial)
There is also an optimal subspace (later)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does PCA work?

A

If we have too many features, we may not be able to analyse using every feature. PCA combines features in a smart way and produces new features (aka Principal Components PC) which will reduce the loss of info.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the properties of principal Components (PCs)?

A

PCs are ordered, The first PC has higher weightage than the next. and so on. We may stop at the desired level of variance. Ideally, we want to get around 90% variance with just 2 to 3 PCs so that enough information is retained while we can still visualize our data on a plot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

how are PCs determined? What are loadings?

A

Loadings indicate the contribution of the variables to each PC . Note: each variable will get

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are real-symmetric matrices?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly