RESS I: Data Analysis #3 Flashcards

Question 1

Q

What is correlation?

Answer

A

Correlation measures the strength off the linear relationship between two numerical variables?

Question 2

Q

When is the Pearson Correlation Coefficient calculated? Where is the Spearman’s Rank Correlation Test done?

Answer

A

Pearsons: To find the association between two normally-distributed variables.

Spearman’s: When the variables are not normally distributed as Spearman correlation is less sensitive than Pearsons correlation to strong outliers.

Question 3

Q

What is are the requirements of the Pearson Correlation Coefficient?

Answer

A

The relationship must be linear
The variables must be normally distributed
The direction of the relationship will be positive or negative
The strength fo the relationship will be from low to hight (0 to 1)

Question 4

Q

How do you interpret ‘r’, the Pearsons Correlation Coefficient?

Answer

A

Ifr> 0 we have a positive correlation; implying that as one variable increases then so does the other.

Ifr< 0 we have a negative correlation; implying that as one variable increases then the other decreases.

Ifr= 0 we have no correlation; implying there is no association between the two variables.

If r=+1 there is a perfect positive correlation.

If r=-1 there is a perfect negative correlation.

Question 5

Q

When should you not use correlation tests?

Answer

A

When the relationship is non-linear
When there is the presence of outliers
There are distinct sub-groups e.g. health controls with diseased cases

Question 6

Q

How would you describe the association between one numerical and one categorical variable?

Answer

A

Mean (or median) difference measures the strength of the relationship between one numerical variable and one categorical variable.
This can then be demonstrated in a comparative box plot or histogram.

Question 7

Q

What statistical association tests can be done between one numerical and one categorical variable?

Answer

A

Independent samples t-test (also two-sample t-test)

- Mann-Whitney-U test

Question 8

Q

What is the t-test?

Answer

A

This test measure the association between one normally-distributed variable and one binary variable
– Difference between two means
– Direction of relationship positive or negative compared to control
– Strength/magnitude of relationship low to high (0 to infinity)…in units of the continuous variable.

Question 9

Q

What are the assumptions of the t-test/

Answer

A

Two independent groups
Numerical variable is Normally distributed in both groups
Similar standard deviations in both groups

Question 10

Q

What is the t-statistic?

Answer

A

The mean difference/standard error of mean difference

Question 11

Q

How do you describe the association between two categorical variables?

Answer

A

Proportion difference measures the strength of the relationship between two categorical variables.

Question 12

Q

What tests are done to compare two categorical variables?

Answer

A

Chi-squared test (standard)
Chi-squared test (continuity/Yates’ correction)
Fisher’s exact test

Question 13

Q

What is the Chi-squared test used for?

Answer

A

To test to see if there is a statistical difference between the observed and expected result.

Question 14

Q

What is the continuity correction (Yates’ correction)?

Answer

A

For small sample sizes the chi-squared test is too likely to reject the null hypothesis. A continuity correction can be made to allow for this.

Question 15

Q

Summarise statistical tests for different variable relationships.

Answer

A

Two continuous = correlations
One continuous, one binary = t-tests
Two binary = chi-squared tests

RESS I: Data Analysis #3 Flashcards

(15 cards)