LEC 10 Correlation Flashcards

1
Q

Correlation definition (3)

A
  • the quantification of the degree to which 2 random variables (continuous or ordinal) are related
  • variables must be numerical
  • provided that the relationship is linear
    (check scatter plot to check for potential linear relationship)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Scatter plot

A
  • plot y against x

- useful for visually examining whether a relationship exists between 2 numerical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Correlation coefficient

A
  • quantitative measure of the strength and direction of a linear relationship between 2 variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Types of correlation coefficient (2)

& the type of data analysed

A
  1. Pearson product-moment correlation coefficient
    - continuous normally distributed variables
    - r
  2. Spearman rank correlation coefficient
    - continuous non-normally distributed variables
    - ordinal
    - less sensitive to outlying values as it uses rank > definite values
    - rs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Are r and rs dimensionless?

A

Yes, no units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Range of possible r and rs values

A

-1 to 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does the sign of r and rs indicates?

A

The direction of the linear relationship between the 2 variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the magnitude of r and rs indicates?

A

The strength of the linear relationship between the 2 variables

<0.5 : weak
0.5-0.7 : strong
>= 0.7 : very strong

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

If r/rs = 0?

A
  • means no linear correlation
  • DOES NOT mean no correlation cos it can be other non-linear correlation (eg curve)
  • check scatter plot for relationship
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

If r/rs = 1

A

Perfect positive correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

If r/rs = -1

A

Perfect negative correlation
OR
Perfect inverse correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

If r/rs > 0 (positive)

A
  • means positive correlation

- both variables tend to increase together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

If r/rs < 0 (negative)

A
  • means negative/inverse correlation

- one variable increases as the other decreases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Potential misuse of the correlation coefficient (3)

A
  1. If correlation coefficient = 0, does not mean no relationship, only mean no LINEAR relationship
  2. If strong correlation coefficient, does not necessarily imply “linearity” as some parts of the graph might be non-linear
  3. Does not imply causation (cause-and-effect relationship)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Statistical test for correlation if BOTH data is :

  • continuous
  • normally distributed
A

Pearson product-moment correlation

(parametric test)

To test the null hypothesis that there is no correlation between the 2 numerical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Pearson product-moment correlation assumptions (3)

A
  1. The x and y values are independent
  2. The pairs of observations are randomly selected
  3. For Pearson product-moment correlation, the underlying populations of BOTH variables are normally distributed
17
Q

Pearson product-moment correlation hypothesis

A

Ho :
- There is no correlation between the 2 variables

H1 :

  • There is a correlation between the 2 variables (two-tailed)
  • There is a positive correlation between the 2 variables (one-tailed)
  • There is a negative correlation between the 2 variables (one-tailed)
18
Q

Spearman Rank correlation assumptions (2)

A
  1. The x and y values are independent

2. The pairs observed are randomly selected

19
Q

Spearman Rank correlation process

A

Involves ranking the values of x and y

20
Q

Correlation coefficient and p-value

A

Correlation =/ p-value

21
Q

What if your data is :
- 1 continuous normally distributed
- 1 ordinal / continuous non-normally distributed
OR
- both are ordinal / continuous non-normally distributed
?

A

Use Spearman Rank correlation

22
Q

Correlation assumption (1)

A

Linear relationship between the 2 variables

23
Q

Can you perform correlation analysis if scatter plot shows non-linear relationship?

A

No.

Correlation analysis assumes linear relationship between the 2 variables.

24
Q

If statistical report shows p = 0.000

A

p < 0.0005

25
Q

Advantage of Spearman Rank Correlation

A

Less sensitive to outlying values as compared to Pearson Product Moment Correlation
- use ranks rather than the actual values