Comparison of two Means Flashcards

1
Q

What is unpaired data?

A

Occurs when individual observations in one sample are independent of individual observations in the other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is paired data?

A

Occurs when the individual observations in the first sample are matched to individual observations in the second sample. (i.e. not independent)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a z-test?

A

If we can assume the distribution of mean differences is normal we can use a z-test to test the null hypothesis. The result tells us how many standard errors away from 0 is the sample mean difference.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Give an example of paired data?

A

Measurements taken from the same individual. I.e. repeated blood pressure measurements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How is paired quantitative data measured?

A

Calculate the difference between the the events (di).
Calculate the mean of the index of the differences
Carry out a hypothesis test on mean of di

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do you calculate a confidence interval for the sample mean difference and test the hypothesis that it is equal to zero (null hypothesis)

A

Must know the mean, standard deviation and standard error of the mean difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the null hypothesis?

A

The null hypothesis is the assertion that the mean population difference is zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does a hypothesis test work?

A

If we can assume that the distribution of mean differences (sampling distribution) is normal. We can test the hypothesis using a z-test (when n> 30).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is a z-test performed?

A

z = (sample mean difference - hypothesised mean difference) / standard error of the mean difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the result of a z-test tell us?

A

How many standard errors away from zero the sample mean difference is/

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does the analysis of two independent means differ from the analysis of paired data?

A

We look at the difference between two independent means rather than the mean difference of paired observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What can be said about the characteristics of the sampling distribution of the differences between the two independent means?

A
  1. The mean of the sampling distribution is the difference between the two population means
  2. The standard deviation of the sampling mean (standard error) depends on n1 and n2 (sample sizes)
  3. As n1 and n2 increase, the distribution becomes increasingly normal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do we calculate the standard error of the mean of two independent sample distributions?

A

The SE of the difference between two means is a combination of the standard errors of the two independent sample distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the square of the standard error of the mean?

A

The variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do we calculate the confidence interval for the difference between two means?

A

The difference between the means of both samples +/- level of confidence * SE (where the standard deviation of the samples may be used instead of the population standard deviations as this is unknown in most scenarios)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How does one obtain a P-value from a z-score?

A

Standard normal tables.

17
Q

In what scenario is it more prudent to use a t-test than a z-test?

A

When the sample size is small (conventionally n<30)

18
Q

What are degrees of freedom?

A

The number of values in the final calculation of a statistic that are free to vary. The number of independent ways by which a dynamic system can move, without violating any constraint imposed on it, is called number of degrees of freedom.

19
Q

How do you calculate the degrees of freedom?

A

n - (number of samples)

i.e. in a one sample test the degrees of freedom are equal to n-1. In two sample tests = (n1+n2)-2

20
Q

How do we calculate the variance in small samples?

A

One common variance is calculated using the data from two independent samples. It is a weighted average of the two sample variances.

21
Q

What assumptions are essential to make in using the z and t tests?

A
  1. The population is normally distributed

2. The standard deviations of the populations are known

22
Q

How can we test the assumption that the populations in a hypothesis test are normally distributed?

A

Using histograms or normal plots

23
Q

What specific assumption does the two sample t test make in regards to measures of spread?

A

It assumes that the variances of the two samples are equal

24
Q

What do we do if the assumptions of normality are not possilbe?

A
  1. Transformation (log or square root of data) to make it more approximately normal
  2. Non parametric methods (these do not make any assumptions)
25
Q

What special type of t test can be used when the variance of a two sample scenario are not equal?

A

Welch’s test

26
Q

What is the Wilcoxon Rank Sum Test?

A

A non-parametric test that corresponds to the two sample t test (equivalent to the Mann-Whitney test)

27
Q

How does one perform the Wilcoxon Rank Sum Test?

A
  1. Assume the two population distributions are the same
  2. Define hypothesis (H0 = the two distributions overlap and H1 = they are shifted)
  3. Rank all observations in ascending order (if the values are equal take an average)
  4. Calculate T as the sum of their ranks in the smaller sample.
  5. Compare T with tables of critical vaules for the test to obtain a P-vaule
28
Q

How do we interpret critical values for the Wilcoxon Rank Sum test?

A

If the T value is outside or at the limit of the critical value the value of P is less than the corresponding P-value of the aforementioned range. If it is inside the range then it is equivalent to P-value greater than that range P-value