Comparison of two Means Flashcards
What is unpaired data?
Occurs when individual observations in one sample are independent of individual observations in the other
What is paired data?
Occurs when the individual observations in the first sample are matched to individual observations in the second sample. (i.e. not independent)
What is a z-test?
If we can assume the distribution of mean differences is normal we can use a z-test to test the null hypothesis. The result tells us how many standard errors away from 0 is the sample mean difference.
Give an example of paired data?
Measurements taken from the same individual. I.e. repeated blood pressure measurements.
How is paired quantitative data measured?
Calculate the difference between the the events (di).
Calculate the mean of the index of the differences
Carry out a hypothesis test on mean of di
How do you calculate a confidence interval for the sample mean difference and test the hypothesis that it is equal to zero (null hypothesis)
Must know the mean, standard deviation and standard error of the mean difference
What is the null hypothesis?
The null hypothesis is the assertion that the mean population difference is zero
How does a hypothesis test work?
If we can assume that the distribution of mean differences (sampling distribution) is normal. We can test the hypothesis using a z-test (when n> 30).
How is a z-test performed?
z = (sample mean difference - hypothesised mean difference) / standard error of the mean difference
What does the result of a z-test tell us?
How many standard errors away from zero the sample mean difference is/
How does the analysis of two independent means differ from the analysis of paired data?
We look at the difference between two independent means rather than the mean difference of paired observations
What can be said about the characteristics of the sampling distribution of the differences between the two independent means?
- The mean of the sampling distribution is the difference between the two population means
- The standard deviation of the sampling mean (standard error) depends on n1 and n2 (sample sizes)
- As n1 and n2 increase, the distribution becomes increasingly normal
How do we calculate the standard error of the mean of two independent sample distributions?
The SE of the difference between two means is a combination of the standard errors of the two independent sample distributions
What is the square of the standard error of the mean?
The variance
How do we calculate the confidence interval for the difference between two means?
The difference between the means of both samples +/- level of confidence * SE (where the standard deviation of the samples may be used instead of the population standard deviations as this is unknown in most scenarios)