Hypothesis testing for two means (independent samples, equal variances) Flashcards
what features does two samples have?
two samples are unrelated and drawn randomly from a different populations (independent)
eg. men and women, left and right handed people
samples are linked in some way (dependent/paired) mix up order and checking if it makes sense (before and after weights)
eg. two questions are asked of the same person,
independent- population standard deviations are known
1. Hypothesis
- Test Statistic
- Decision Test
- Conclusion
H0: μ1= μ2
HA: μ1≠ μ2
independent- population standard deviations are known
- Hypothesis
- Test Statistic
- Decision Test
4. Conclusion
in terms of whether sufficient evidence to reject null hypothesis.
independent- population standard deviations are known
- Hypothesis
- Test Statistic
3. Decision Test
- Conclusion
Compare to a standard normal distribution for decision rule.
independent- population standard deviations are known
- Hypothesis
2. Test Statistic
- Decision Test
- Conclusion

How to determine the confidence interval for two samples that are independent and their population standard deviations are known
100(1-α)% Confidence interval

what is the problem with the previous formulas for two indepedent samples where the population standard deviation is known?
Population variances are almost always unknown
Usually we have to estimate the population variance using the sample variance
<!--StartFragment-->
Independent – population standard deviations unknown<!--EndFragment-->
what does the test depend on?
Test to use depends on whether the population variances of the two different populations can be assumed to be equal or not.
If variances are equal, test can be performed by hand
If variances are not equal, test is complicated – we will rely on software (Minitab)
how to test if the two samples have equal population variances?
1.Hypotheses
- Test Statistic
- Decision Rule
- Conclusion
H0: σ1²=σ2² - that is, both populations have the same variance
HA: σ1²≠σ2² - that is, the variances of the populations are different
Alternative is (almost) always “≠”
how to test if the two samples have equal population variances?
- Hypotheses
- Test Statistic
* *3.Decision Rule** - Conclusion
Compare to an F distribution with numerator df = n1-1; denominator df=n2-1 (see Table 5 in text, pages 559-562)
F-distribution: Requires two degrees of freedom
Tables give upper cut-off point for α=0.05/0.025/0.01/0.005
two-sided test; look up table for α/2
E.g. for α=0.05, need to look up tables for α=0.025
how to test if the two samples have equal population variances?
- Hypotheses
* *2.Test Statistic** - Decision Rule
- Conclusion
General rule – put larger sample variance on top
This makes it easier to use the F tables

how to test if the two samples have equal population variances?
- Hypotheses
- Test Statistic
- Decision Rule
* *4.Conclusion**
EITHER
There is insufficient evidence to reject the null hypothesis, and we can assume the population variances are equal.
OR
There is sufficient evidence to reject the null hypothesis. We cannot assume the population variances are equal.

<!--StartFragment-->
If the two population variances are equal, we can test for equality <!--EndFragment-->
_1. Hypotheses _
- Test Statisic
- Decision Rule
- Conclusion
H0: μ1= μ2
HA: μ1≠μ2
<!--StartFragment-->
If the two population variances are equal, we can test for equality <!--EndFragment-->
- Hypotheses
2. Test Statisic : why pool the sample variances?
- Decision Rule
- Conclusion
If both populations have the same variance, then each sample is providing a different estimate of the same thing.
So, rather than choose one variance to use, we use a weighted average of both sample variances to estimate the (common) population variance.
Weighting is done according to sample size.
If the two population variances are equal, we can test for equality
- Hypotheses
2. Test Statisic
- Decision Rule
- Conclusion

<!--StartFragment-->
If the two population variances are equal, we can test for equality <!--EndFragment-->
- Hypotheses
- Test Statisic
3. Decision Rule
4. Conclusion
Decision Rule: Compare to a t-distribution with n1+n2-2 df (same as denominator in pooled variance formula)
Conclusion: In terms of whether evidence is sufficient to reject null hypothesis.
<!--StartFragment-->
If the two population variances are equal, we can test for equality <!--EndFragment-->
- Hypotheses
2. Test Statisic: what is the purpose of a pooled variance?
- Decision Rule
- Conclusion
Test statistic uses pooled variance to estimate standard error of the difference between the means
<!--StartFragment-->
If the two population variances are equal, we can test for equality <!--EndFragment-->
- Hypotheses
2. Test Statisic: what often happens to population mean?
- Decision Rule
- Conclusion
In general, if testing for equality of means, then μ1= μ2, so the μ1- μ2 term drops out (=0)
<!--StartFragment-->
Confidence interval for difference between two means
what do we have to assume?
– independent data, population variances assumed equal
Confidence interval for difference between two means – independent data, population variances assumed equal

<!--StartFragment-->
What if variances aren’t equal for different between two means?
Minitab (or other software packages) will compute a p-value (for hypothesis test), or a Confidence Interval (if required).
<!--StartFragment-->
How about paired samples?<!--EndFragment-->
i.e. have two samples, X1 and X2, which are paired.
For each observation, calculate Xd=X1-X2.
Then calculate X̄d and sd (mean and standard deviation of differences).
Conduct Hypothesis test on paired samples
1. Hypotheses
- Test Statistic
- Decision Test
- Conclusion
H0: μd=0
HA: μd≠0
Conduct Hypothesis test on paired samples
- Hypotheses
2. Test Statistic
- Decision Test
- Conclusion
- Test Statistic

Conduct Hypothesis test on paired samples
- Hypotheses
- Test Statistic
3. Decision Test
- Conclusion
- Decision Test
: Compare to a t-distribution with nd-1 df.
Confidence interval for the difference between two means, paired data

What about testing two proportions?
Our samples must be independent!
Use the idea that sample proportion is normally distributed, samples independent, then

Testing Two proportions
- Hypotheses
- Test Statistic
3. Decision Rule
- Conclusion
We compare this to a Z~N(0,1) distribution for the decision rule.
Testing Two proportions
- Hypotheses
2. Test Statistic
- Decision Test
- Conclusion

Testing Two proportions
1. Hypotheses
- Test Statistic
- Decision Test
- Conclusion
If the null hypothesis is true, both the sample proportions are estimating the same thing.
So, to find a standard error, we use a pooled sample proportion.

when may you use the pooled-variance t-test?
to determine whether there is a significant differnt between the means of two populations
(if population is not normally distributed, the pooled variance t test can still be used if sample sizes are large enough, typically greater than 30)
when should you use F test?
onl when each of the two populations are normally distributed. F test is very sensitive to normality assumption