L06 Correlations Flashcards

1
Q

Correlations - what?

A

Associations between variables

How closely the data points fall to the line of best fit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

4 measures of association

A
  1. Covariance
  2. Correlation:
    - Pearson’s coefficient of correlation
    - Variance accounted for (or explained)
  3. Spearman’s coefficient of rank correlation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Covariance - formula

A

Mean of the Product of Deviations
Cov(X,Y) = sum of (xi-xbar)*(yi-ybar) / n

Unit: of x*y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Covariance - features

A
  • Crude measure of correlation measured for calculating Pearson’s r
  • Parametric test
  • Unit depends on x and y: not standardized
  • +/- higher value means stronger correlation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

2 key tests of correlation

A
  1. Pearson’s coefficient of correlation

2. Spearman’s Rho Correlation Coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Pearson’s Coefficient of Correlation - when?

A

Linear relationship

2 continuous variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Spearman’s Rho Correlation Coefficient - when?

A

Correlation between ranked or ordinal data
Non-parametric; so no assumption of normality
No assumption of even spacing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Line of Best Fit

A

Added in a scatterplot
Line that best represents data
Correlation: how closely data points fall to this line
Regression: the characteristics or equation of the line
A good line of best fit - very small residuals for each point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Correlation of magnitude 1? 0?

A

1- Perfect (+/-) relationship

0 - no linear relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Pearson’s correlation coefficient symbol

A

r

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Pearson’s correlation coefficient formula

A
r= Cov(X,Y) / sx sy
where Cov(X,Y) = Covariance of X and Y
 sx and sy = standard deviations of the variables

Unit: no unit; standardized
Independent of overall variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Pearson’s correlation coefficient features

A

Linear relationship between 2 continuous variables
r = covariance of the two divided by SDs
No unit; standardized
Because divided by SD, independent of overall variability (so it only shows how well-related the variables are and not how much they vary)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is R squared

A

Variance accounted for

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Variance accounted for

A

Amount of variance explained by the data
Expressed as % or a fraction
Rsquared - square of Pearson’s r

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does Rsquare close to 1 mean?

A

Variance accounted for near 1 -> almost all the variation in data is shared between variables/explained by data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Partial correlation

A

“Quantifies the relationship between two variables while accounting for the effects of a THIRD variable on BOTH variables in the original correlation”

The part of correlation that is not related to variable C

17
Q

Semi-partial correlation

A

“Quantifies the relationship between two variables while accounting for the effects of a third variable on only ONE of the variables in the original correlation”

18
Q

Variable C in a partial correlation is called

A

Variable that is partialized out or held constant

19
Q

Zero order correlation

A

Another term for a simple bivariate correlation

as opposed to partial correlations

20
Q

First or second order correlation

A

Partial correlation that partials out one/two variables

21
Q

Is significance of correlation equal to importance?

A

Correlation can be significant but weak.

Strength and significance of correlation are not linked,

22
Q

Why do we use correlations?

A
  1. May highlight possible causal relations (though doesn’t imply it)’
  2. Null effects / absence of correlations allow us to discount some theories
  3. presence: correlation -> investigate why: experiment
23
Q

Biserial correlation

A

For when a dichotomy has an underlying continuum
One variable is categorical, other continuous
r is called a “point-biserial r”

24
Q

Small, medium and large correlation values

A

+/- 0.1 small effect
+/- 0.3 medium effect
+/- 0.5 large effect

25
Q

Correlations assumptions

A
  1. Continuous level of measurement: ratio/interval
  2. Linear relationship between variables
  3. No outliers
  4. For small sample, normal population