Ch 18 - Two-sample problems for population means (σ unknown) Flashcards
Two samples situations
independent vs dependent two sample
We have 2 independent SRSs coming from 2 populations with (u1,s1) and (u2,s2) unknown. We use ____ and ___ to estimate (u1,s1) and (u2,s2) respectively.
(xbar1,s1) and (xbar2,s2)
for t distribution of 2 independent samples Both populations should be
Normally distributed.
BUT in practice, it is enough that both distributions have similar shapes and that the sample data contain no strong outliers.
The two-sample t statistic follows approximately a t distribution with a
standard error SE (denominator) reflecting variation from both samples.
Ho for 2 independent random samples
Two sample t-confidence interval
Because we have 2 independent samples we use the difference between both sample averages (x1-x2) to estimate (u1 − u2).
The two-sample statistic is most robust when
both sample sizes are equal and
both sample distributions are similar.
But even when we deviate from this, two-sample tests tend to remain quite robust.
As a guideline, a combined sample size (n1 + n2) of 40 or more will allow you to work even with the most skewed distributions.
two versions of the two-sample t procedures
one assuming equal variance (“pooled”) and one not assuming equal variance for the two populations. They have slightly different formulas and df.
Variance = SD^2
The pooled (equal variance) two-sample t test has degrees of freedom
n1 + n2 − 2
the assumption of equal variance is
hard to check, and thus the unequal variance test is safer.