Dimensionanility Reduction and Matrix Decomposition Flashcards

1
Q

What is the aim of PCA?

A

To find a set of orthogonal principal components which capture the direction of maximum variance in the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the variance of X in the direction v?

A

vT XT X v

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Write down the optimisation problem of PCA and its Lagrange form

A

Check notes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the difference between PCA and Whitening?

A

Whitening normalises the variance along each principal component.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the pros and cons of Non-negative matrix factorization?

A

Pros
* Natural fit for positive-valued data
* Can be interpreted (meaningful signs)

Cons
* Only applicable to non-negative data
* Optimization procedure is non-convex; requires initialization
* “Interpretability” is unreliable
* Learned components are not orthogonal nor naturally ordered

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does independent component analysis work? When is it most useful?

A

First whiten X and then find X=AS where the components in S are statistically independent.
Good for time series with lots of signals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the aim of t-SNE?

A

To match distributions of distances between points in original high dimensional space to points in a lower dimensional space.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What do Cross-decomposition methods do?

A

They find associations across multi-view data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is PLS? Write it as an optimisation problem and an e/val problem

A

PLS is a method for finding the directions of the maximum shared covariance between two paired views of the data.
Check notes for formulation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is CCA? Write it as an optimisation problem and a SVD problem

A

Canonical Correlation Analysis is a method for finding the directions of the maximum correlation between two paired views of the data.
Check notes for formulation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Can PLS and CCA be computed when p>N?

A

PLS can, CCA cannot (within-view cov matrix cannot be inverted)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Write down the optimisation problem related to regularised PLS and CCA

A

Check notes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly