3. Investigating Associations Flashcards

1
Q

Investigating Associations

A

on 2 measurements of a single sample, two different things can be investigated: whether the mean is different and whether the two measurements are associated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Scatter Plot

A

A Scatter plot is the most obvious way to compare data by plotting one against the other. This is easier to investigate an association between measurements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Limitations to visual examination

A

an apparent association can be obtained just by chance by choosing points which seem to follow a straight line.

In order to determine whether an association is likely to be real, a statistical test needs to be carried out.

  1. Work out the correlation coefficient.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Correlation Coefficient (r)

A

This is a single measurement describing the strength and direction of an association.

  • 1 -> Perfect negative association
    + 1 -> perfect positive association
    0 -> no association

The further r is from 0, the larger the sample size, the less likely is the association due to chance.

Calculation:

  1. Calculate the means of x and y
  2. Add up the sum of all and divide by a scaling factor.

zy)i = (yi – ȳ) / s y -> calculate a standardized value for each yi. Add the products from the last step together
n - 1 -> n is the total number of points.

Calculating the Correlation Coefficient
1) Calculate the means of x and y, ẋ and ẏ.
2) Add up the sum of all 
(x –ẋ)(y –ẏ) 
and divide by a scaling factor 
√[(x - ẋ)2(y -ẏ)2]
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why does calculating the correlation coefficient work?

A

positive association:

all points will be either above and to the right of the centre of the distribution, or below to the left.

negative association:

all points will be either above and the the left of the centre of the distribution or below to the right.

no association:

points will be randomly scattered all around the centre of the distribution.

Positive and negative values of (x – ẋ)(y – ẏ) will cancel each other out.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Carrying out a test

A
  1. ) Construct a null hypothesis
  2. ) Calculate the correlation coefficient (r)
  3. ) Calculate probability of r being this big just by chance if there is no real association.
  4. ) If the probability if >0.05 there is a greater than 5% probability of this happening by chance. => no evidence to reject null hypothesis

If the probability is <0.05 there is a less than 5% probability of this happening by chance => evidence to reject null hypothesis - can say that two measurements are significantly correlated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Cautions about using correlations

A

Correlation analysis cannot be used if one measurement is independent, meaning that it could not be altered by the other.

Example:

a) measuring cell mass at different times
b) If one variable is controlled by myself in an experiment - e.g. measuring the metabolic rate of cells at certain temperatures

If there is a correlation, it does not necessarily imply a casual relationship between the two sets of measurements. E.g. heart rate doesn’t control blood pressure.

The size of the correlation coefficient doesn’t reflect the slope of the relationship, just its closeness.

Correlation is only useful for linear relationships, so always look at graphs of the results first.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly