SDA: Significance testing Flashcards
Null hypothesis H0
No difference between groups under study
Research hypothesis H1
There is a difference between groups under study
What are the two types of research hypotheses?
Two-tailed
One-tailed
Two-tailed research hypothesis
Stating there will be a difference but nothing else
e.g. there will be a difference in average levels of disposable income between two areas
One-tailed research hypothesis
Gives an indication of the direction of difference
e.g. People living in Hackney will have less disposable income than people living in Westminster
Why do we use hypotheses?
Assess whether the sample and calculated statistics are reasonable: not due to chance and generalisable to the population
Make inferences about a population using data for a sample
What is the most common significance level used?
95% i.e. happy to be wrong 5/100 times
May need to use the 99% significance level at times
Type 1 error in statistical testing
When H0 is found to be true, but the decision is made to reject H0 - this is a false positive
Type 2 error in statistical testing
When H0 is found to be false, but the decision is made to accept H0 - this is a false negative
What things should be remembered about sampling?
Larger the sample the more likely the chances of finding a statistically significant result
Always a possibility of chance findings as using sampled data: By using a more demanding significance level, we limit the chances of Type 1 errors. BUT this leads to a trade off - decreasing the chance of a Type 1 error, increases the chance of Type 2 errors, therefore should limit the number of comparisons and statistical calculations made
What are the two distinct types of statistical tests?
Parametric
Non-paramertic
Non-parametric tests should be used with what type of data?
Nominal
Weak ordinal
Strong ordinal
Parametric tests should be used with what type of data?
Interval/Ratio
What test is used when analysing NOMINAL data?
Chi-squared one sample test (This increases to a two-sampled and k-sampled when number of samples increases)
What test is used with WEAK ORDINAL (and nominal) data?
Kolomogrov-Smirnov one-sample/two-sample test
What test is used with ORDINAL data?
Mann-Whitney U test (when 2 samples)
Kruskal-Wallis test (more than 2 samples)
What test is used with one and two-samples of INTERVAL/RATIO data?
One sample T-test/Two sample T-test
What test is used when there are more than two samples of INTERVAL/RATIO data?
Snecedor F ratio test
What assumptions must be followed with parametric tests?
1) Observations must be drawn from a pop. with a normal distribution
2) Observations must be independent i.e. randomly sampled
3) The pop. must have the same amount of variability
Key points to remember about Parametric and Non-parametric tests:
Parametric:
demanding assumptions
interval/ratio data
Non-parametric:
less demanding/no assumptions
any data especially nominal/ordinal
fall-back situation if parametic assumptions cannot be met
What is the general process for significance testing?
- State H0
- Specify the desired significance level (usually 95%)
- Perform calculations to generate the test statistic
- Compare the test statistic with the critical value in the sampling distribution
- Accept or reject H0 - if the test statistic exceeds the critical value at the chosen significance level, then H0 is rejected