Confidence Intervals/Hypothesis Testing Flashcards
95% Confidence Interval
Range of scores constructed such that the population mean will fall within this range in 95% OF SAMPLES
NOT an interval within which we are 95% confident that the population mean will fall.
Null Hypothesis
An effect doesn’t exist
Alternative hypothesis
An effect exists
Test statistic
Calculate the probability that we would get a value as big as the one we have if the null hypothesis were true.
Type 1 error
We believe that there is an effect when there isn’t
Type 2 error
We believe there is no effect when there is
Power
Probability that a test will find an effect when one exists
Effect of sample size
Same effect will have different p-values in different-sized samples
Small differences can be deemed ‘significant’ in large samples
Large effects might be deemed ‘non-significant’ in small samples.
Disadvantages of NHST
All-or-nothing thinking (0.05 cut-off)
Biased by researchers deviating from their initial sampling frame (e.g., by stopping data collection earlier than planned).
Researcher degrees of freedom
Scientists can influence p-value:
selective exclusion of data
fitting different statistical models but reporting only the one with the most favourable results
stopping data collection at a point other than that decided at the study’s conception
including only control variables that influence the p-value in a favourable way.
P-hacking
Practices that lead to the selective reporting of significant p-values
i.e trying multiple analyses and reporting only the one that yields significant results
HARKing
Hypothesising after results are known