Mindless stats Flashcards
Replication
Can we replicate the results form the study
Reproducibility is an essential for scientific progress and knowsledge + theory dev+ essental for application in real world
Replication crisis
It involves many fields, including psychology
Failure to replicated results when implamenting the same study design
~36% of studies replicated when take same data and methods (based on whether p<0.05)
~47% studies replicated when using 95% of CI as statistical significance indicator
Type I error
Incorrect acceptce of false positives
Type II error
Incorrect acceptce of false negatives
Why the results do not replicate? (4)
- Sampling variability: no two samples are the same
- Hidden moderators across studies
* Different cultural contexts
* Time in history
* Demographic characteristics
* Methods - Low statistical power
* False positives in original study
* False negatives in replication study - Questionable research practices (QRPs) e.g. finding meaning in noise
Why do some researchers engage in QRPs?
There is a bias of publicating only novel and significant findings
Issues with using p
A p-value is (sort of) a measure of how incompatible the data are with a specified statistical model, not e.g. if the hypothesis is true nor an effect size measure
The value is not based on anything real, it’s a made up threshold
When it’s not significant = we don’t know = it is not support for null hypothesis
If you can report effect sizes alongisde p (e.g. ANOVA, r, t-test)
Statistical power
Probability of correctly identifying a false null hypothesis
Sample size: larger N, greater power to
detect weak effects
Effect size: larger effect size, greater power
to detect with small sample sizes
A priori power analysis (sample size estimation)
Pre-specify necessary parameters to estimate how many participants you will need
* Parameters
* Type of analysis
* Type of power analysis
* Tails: one- vs. Two-tailed
* Expected correlation (be conservative)
* Power: aim for high power
HARKing
Hypothesising After the Results are Known
p-hacking
The practice of iteratively analysing your data in order to find significant results
Difference between Exploration vs. Inference
Exploring: finding the structure in your data
* Can perform many analyses
Inference: using data to choose between different models
* Logic of p-values: 1 analysis to test a prediction
95% Confidence Intrevals
They estimage a range where the correct value is
We have 95% confidence in stating that the unknown “true” mean lies within this interval range
This can provide valuable information about the lower and upper range of the likely effect
* Important when considering replications
If CIs do not overlap with 0, then the value is significantly different from 0
Advantages of CI over p-values
- We don’t tend to make reasoning errors with CIs
- We see the full range - (un)certainty
- CIs provide a more reasonable prediction about future outcomes/measurement than p-values