Topic 6 Hypothesis Testing Flashcards
One sample testing vs 2 sample testing
Single sample against population e.g grades in Birmingham vs population in England
Two samples against EACH OTHER. E.g grades in Birmingham vs London
3 assumptions to conduct one sample hypothesis testing
Random sampling
Level of measurement is interval or ratio (to calc mean)
Sample distribution is normal (we can be sure of this if large size, using CLT)
Step 1 of one sample hypothesis testing
State the null and alternate hypothesis
Null hypothesis
Statement about the value of a population parameter
H0
Alternate hypothesis h1
Statement accepted if null is false.
Inequalities are always part of the alternate i.e <,>
Possible null and alternative hypothesis symbols
If h0 is = h1 is ≉
If h0 is greater or equal to, h1 is <
If h0 is less or equal to, h1 is >
Steps to one sample hypothesis testing
1.State null and alternative
2.Choose level of significance e.g 95% confidence so alpha=0.05
- Choose test statistic
- Formulate decision rule (critical value is where the null hypothesis is rejected)
- Make decisions
- calculate test statistic
Find z value (sample mean (x bar) - population mean/(pop SD/root N (sample size)
E.g if critical region is 1.96 and z= 8, it falls in critical region so we reject null hypothesis
What happens for one tailed tests
E.g left tail test, full 5% will lie in lower tale, changes the decision rule (critical value changes) now -1.65, 8>-1.65 so we now fail to reject null
T statistic and z score formulas ( the value that will be compared to the critical region)
Z score
Sample mean (X bar) - population mean (μ)
/
σ/√N.
It is similar to Z value, but Z value doesn’t have root N, as no sample size.
T statistic
Same but s instead of σ
When to use one vs two tailed
Want to know whether a different number of holidays happen in Edgbaston to Birmingham overall, use 2 tailed.
If we want to know whether edgbaston take MORE holidays then one tailed.
When to use sigma or s
If sigma unknown use S!
When to use z or t distribution, when unknown sigma
If large sample, Z
Small sample, T
P-value
Highest level of confidence we can reject the null
How to find p value example
Lower p value is, the more confident
If p value is 0.01, reject null up to 99% confidence.
3 sample hypothesis test assumptions e.g grades London vs Birmingham
Two independent random samples used (not related)
Level of measurement is interval or ratio (to calc mean)
Normal sampling distribution (confirm if large, by CLT)