Lecture 8 Flashcards
What is P-Hacking?
Deliberate and selective reporting of significant P-values-deliberate abuse of Df. Can include correlational methods and overfitting of data
What percentage of researchers admit to questionable practices?
10%
What percentage of researchers admit to dropping data based on a “gut feeling”?
15%
What percentage of researchers allowed industry funders to write their first draft?
30%
What percentage of researchers allowed an industry funder to decide when study is terminated?
34%
What percentage of researchers know someone who has failed to report data that contradicts their hypothesis?
69%
What percentage of researchers know someone who has chosen different statistical techniques that decreases the P-value?
46%
What percentage of researchers know someone who has reported only significant results?
46%
What percentage of researchers know someone who has excluded data based on a “gut feeling”
20%
What happens when we look at the predicted P-values graph?
The number of studies reporting something that just barely makes P < 0.05 is way higher than predicted. There are fewer P values over 0.05 than expected as well. Suggests that researchers are messing with data to put it in the “just less than 0.05 section”
What is HARKing?
Hypothesizing After Results ae Known. Present a hypothesis that was made after data collection as though it was made beforehand. Collect data on a bunch of variables and see what sticks
What are some solutions to the problems with data collection?
1) Effect sizes
2) Transparency
3) Providing all materials needed for replication
4) Make raw data available
How can effect size help us?
Effect sizes tell you the size of the effect in an objective and standardized way-independent of sample size, allows for comparison across studies (even one’s with different measures and sample sizes)
What is Cohen’s d?
Measure of effect size wherein the difference between 2 comparison means is divided by the pooled SD. Tells us how overlapped our two populations are. Effect size will remain the same regardless of sample size
What is Pearsons r?
Correlation coefficient that tells us how strongy related 2 variables are-only used if one of the variables is categorical with 2 categories.