Week 6: Correlation Flashcards

1
Q

What does correlation measure in comparison to linear regression?

A

Linear regression estimates the best fit of a straight line. While correlation measures the strength of the linear association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the range of the correlation coefficient (r)?

A

-1 to 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does r = 0 signify in correlation?

A

There is no linear correlation between the variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How are strength levels of correlation categorised?

A
  • Very strong ± 0.90 to ± 1
  • Strong ± 0.70 to ± 0.89
  • Moderate ± 0.40 to ± 0.69
  • Weak ± 0.10 to ± 0.39
  • No or very weak ± 0 to ± 0.09
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does the coefficient of determination (R2) indicate?

A

The amount of variance shared between two variables. By multiplying R2, we can get a percentage of the variability explained by the predictor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is R2 calculated?

A

By squaring the coefficient (r)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is H0 in correlation hypothesis testing?

A

There is no correlation between the variables (r = 0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Key assumptions for Pearson correlation

A
  1. Random sample
  2. Continuous data
  3. Paired sample data
  4. Independence of observations
  5. Approximate normal distribution
  6. Linear association
  7. Absence of outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What alternatives exist if Pearson correlation assumptions are violated?

A

Use non-parametric methods like Spearman Rank correlation or Kendall’s tau

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does Spearman Rank correlation differ from Pearson correlation?

A

It assesses monotonic relationships (whether linear or not), ranking values instead of using the original measurements - used if data are measured on an ordinal scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why is visual inspection of data important before calculating correlation?

A

To confirm linearity and check for the presence of outliers - if there are outliers, can we justify removing them?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is correlation not equivalent to causation?

A

Correlation only indicates a statistical relationship, not a cause-effect link

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does a strong positive Pearson correlation coefficient (e.g., r = 0.76) suggest?

A

A strong positive linear relationship between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How should missing data in paired samples be handled when calculating r?

A

Use complete case analysis (only cases with data on x and y considered), but be aware of potential biases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What the main visual check to perform before calculating Pearson correlation?

A

Ensure the data appears linear in a scatter plot?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

When is Spearman’s rank correlation preferable over Pearson?

A

When data are not normally distributed or the relationship is monotonic but not linear

17
Q

What value is sufficient for hypothesis testing?

A

The estimate of r is often enough

18
Q

Name characteristic of correlation

A
  • Independent on units of measure
  • Symmetric (e.g., f for weight and plasma ~ r for plasma and weight)
  • Independent on scale (e.g., r between body weight in kg and plasma volume in litres is the same as r between body weight in g and plasma volume in ml)
  • If r = 0, there is no linear relation but could still be other relationships