Mindless stats Flashcards

1
Q

Replication

A

Can we replicate the results form the study

Reproducibility is an essential for scientific progress and knowsledge + theory dev+ essental for application in real world

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Replication crisis

A

It involves many fields, including psychology
Failure to replicated results when implamenting the same study design

~36% of studies replicated when take same data and methods (based on whether p<0.05)
~47% studies replicated when using 95% of CI as statistical significance indicator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Type I error

A

Incorrect acceptce of false positives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Type II error

A

Incorrect acceptce of false negatives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why the results do not replicate? (4)

A
  1. Sampling variability: no two samples are the same
  2. Hidden moderators across studies
    * Different cultural contexts
    * Time in history
    * Demographic characteristics
    * Methods
  3. Low statistical power
    * False positives in original study
    * False negatives in replication study
  4. Questionable research practices (QRPs) e.g. finding meaning in noise
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why do some researchers engage in QRPs?

A

There is a bias of publicating only novel and significant findings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Issues with using p

A

A p-value is (sort of) a measure of how incompatible the data are with a specified statistical model, not e.g. if the hypothesis is true nor an effect size measure

The value is not based on anything real, it’s a made up threshold

When it’s not significant = we don’t know = it is not support for null hypothesis

If you can report effect sizes alongisde p (e.g. ANOVA, r, t-test)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Statistical power

A

Probability of correctly identifying a false null hypothesis

Sample size: larger N, greater power to
detect weak effects
Effect size: larger effect size, greater power
to detect with small sample sizes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A priori power analysis (sample size estimation)

A

Pre-specify necessary parameters to estimate how many participants you will need
* Parameters
* Type of analysis
* Type of power analysis
* Tails: one- vs. Two-tailed
* Expected correlation (be conservative)
* Power: aim for high power

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

HARKing

A

Hypothesising After the Results are Known

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

p-hacking

A

The practice of iteratively analysing your data in order to find significant results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Difference between Exploration vs. Inference

A

Exploring: finding the structure in your data
* Can perform many analyses

Inference: using data to choose between different models
* Logic of p-values: 1 analysis to test a prediction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

95% Confidence Intrevals

A

They estimage a range where the correct value is

We have 95% confidence in stating that the unknown “true” mean lies within this interval range

This can provide valuable information about the lower and upper range of the likely effect
* Important when considering replications

If CIs do not overlap with 0, then the value is significantly different from 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Advantages of CI over p-values

A
  • We don’t tend to make reasoning errors with CIs
  • We see the full range - (un)certainty
  • CIs provide a more reasonable prediction about future outcomes/measurement than p-values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly