W10 - MULTIVARIATE DATA Flashcards

1
Q

What kind of variables could we used in a multivariate data analysis

A

Dependent or independent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What defines a multivariate dataset

A

p > 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a 11 x 5 dimensional data? Give an example

A

11 objects and 5 variables
Eg. 11 species of plants and 5 leaf morphology variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What do the matrix values tell us

A

It defines the position of n points in p dimensional space
(N = Objects, P = Variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What three parameters does a cross product and sum square matrices describes?

A

Means (centroid)
Variance
Covariance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define data variation and how it is quantified

A

Data variation explains the amount of different values we can observe for an object as they do not always have the same values
It is quantified by the sum of squares

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define data association and how it is quantified

A

Variables collected for the same object share patterns of deviation from the mean
It is quantified by the sum of cross-product or covariance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does a covariance plot differ from regression plot?

A

It is not a run over rise plot - we are interested in the shared variation between variables.
Either variable can be plotted on the horizontal axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Name the following matrices of covariance
(x1 x2)
(x3 x4)

A

(var1 cov1,2)
cov2,1 var2)
OR
(sum sqr sum cross)
sum cross sum sqr)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the three parameters in the sum of cross-product tell us when a cloud is plotted?

A

Mean tells us the position
Variance tell us the size
covariance tells us the shape

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the trace?

A

sum of variances in a variance-covariance matrix.
It is the sum of diagonal elements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is trace useful?

A

Helps us to understand the distribution of variance across multiple dimensions when dealing with high dimensional datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the numeral range of corelation coefficients?

A

Ranges from -1 to +1 with 0 being no corelations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the coefficient of determination?

A

Corelation squared = % of variation shared between variables
Interpreted the same was as Rsqr in regression analysis
For example a value of 0.83 means 83 percent of a variable is explained by the other variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly