W11 - PCA - MULTIVARIATE Flashcards

1
Q

What does PCA stand for and what is it used for

A

Principal Component Analysis
It describes the variation in multivariate data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the benefits of using PCA’s

A

Allows for the descriptions of p >2 data clouds in fewer dimensions (reduces complexity)
Find biological interesting interactions between variables
Defines new variables free from multicollinearity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

In a PCA, what happens to the axis of the graph

A

They become vectors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is an eingenvector

A

The longest vector in space taken from a multivariate dataset
It accounts for the vector with the most variance in the dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Are eigenvectors correlated to one another?

A

no

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do we graph eigenvectors

A

Eigenvectors are unstructured data and we consider both variables as responses (plot variables on Y and X axis)
No biological reason to assume one variable causes change on the other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Describe eigenvalue

A

Eigenvalue = length
variance in direction described by the corresponding eigenvector

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe eigenvector

A

Eigenvector = direction
Coefficients or loading of measured variables to describe the direction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the assumptions of eigenanalysis

A

Best fitted for linear relationships between variables
Does not assume multivariate normality
Is influenced by outlier
Sensitive to sampling error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly