RESS I: Data Analysis #3 Flashcards

1
Q

What is correlation?

A

Correlation measures the strength off the linear relationship between two numerical variables?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When is the Pearson Correlation Coefficient calculated? Where is the Spearman’s Rank Correlation Test done?

A

Pearsons: To find the association between two normally-distributed variables.

Spearman’s: When the variables are not normally distributed as Spearman correlation is less sensitive than Pearsons correlation to strong outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is are the requirements of the Pearson Correlation Coefficient?

A
  • The relationship must be linear
  • The variables must be normally distributed
  • The direction of the relationship will be positive or negative
  • The strength fo the relationship will be from low to hight (0 to 1)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How do you interpret ‘r’, the Pearsons Correlation Coefficient?

A

Ifr> 0 we have a positive correlation; implying that as one variable increases then so does the other.

Ifr< 0 we have a negative correlation; implying that as one variable increases then the other decreases.

Ifr= 0 we have no correlation; implying there is no association between the two variables.

If r=+1 there is a perfect positive correlation.

If r=-1 there is a perfect negative correlation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When should you not use correlation tests?

A
  1. When the relationship is non-linear
  2. When there is the presence of outliers
  3. There are distinct sub-groups e.g. health controls with diseased cases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How would you describe the association between one numerical and one categorical variable?

A

Mean (or median) difference measures the strength of the relationship between one numerical variable and one categorical variable.
This can then be demonstrated in a comparative box plot or histogram.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What statistical association tests can be done between one numerical and one categorical variable?

A
  • Independent samples t-test (also two-sample t-test)

- Mann-Whitney-U test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the t-test?

A

This test measure the association between one normally-distributed variable and one binary variable
– Difference between two means
– Direction of relationship positive or negative compared to control
– Strength/magnitude of relationship low to high (0 to infinity)…in units of the continuous variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the assumptions of the t-test/

A
  1. Two independent groups
  2. Numerical variable is Normally distributed in both groups
  3. Similar standard deviations in both groups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the t-statistic?

A

The mean difference/standard error of mean difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you describe the association between two categorical variables?

A

Proportion difference measures the strength of the relationship between two categorical variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What tests are done to compare two categorical variables?

A
  • Chi-squared test (standard)
  • Chi-squared test (continuity/Yates’ correction)
  • Fisher’s exact test
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the Chi-squared test used for?

A

To test to see if there is a statistical difference between the observed and expected result.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the continuity correction (Yates’ correction)?

A

For small sample sizes the chi-squared test is too likely to reject the null hypothesis. A continuity correction can be made to allow for this.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Summarise statistical tests for different variable relationships.

A

Two continuous = correlations
One continuous, one binary = t-tests
Two binary = chi-squared tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly