Resampling and Open Science Flashcards

Question 1

Q

What are resampling methods?

Answer

A

Sets confidence intervals and critical values of tests
Non-parametric
Relatively assumption free
Computationally demanding

Question 2

Q

Bootstrapping vs Permutation

Answer

A

Both involve computing simulations
Bootstrapping samples with replacement
Permutation samples without replacement

Question 3

Q

Simulating Null Hypothesis Distribution Permutation Tests

Answer

A

common statistical question usually comparing two groups

Question 4

Q

Null Hypothesis

Answer

A

States that the group means are equivalent

Question 5

Q

Alternative Hypothesis

Answer

A

states that the groups means are not equivalent

Question 6

Q

P-value

Answer

A

Indicates probability of obtaining a difference in the mean of the two samples at least as extreme as observed at random, if the two samples did come from the same population

Question 7

Q

Null Hypothesis Testing using Parametric Methods

Answer

A

Simulate or select theoretical null hypothesis sampling distribution
Determine where our observed test statistic lies within this distribution and the probability of it being observed if null hypothesis was true

Question 8

Q

Null Hypothesis Testing using Permutation Tests

Answer

A

calculate the real difference between the means of two groups
simulate or select theoretical null hypothesis sampling distribution

Question 9

Q

Simulating H0: Random Shufflying of Observations

Answer

A

Pool sample together then draw new groups from pooled sample
Repeat process many times ensuring it is not due to chance
Build a mean that should be true under the null hypothesis

Question 10

Q

Bootstrapping

Answer

A

Resampling method for assessing statistical accuracy of an estimate
Own sample and treat it as the entire population assessing it better
does not work for small samples
Typically used to estimate quantities associated with the sampling distribution of estimates
Original sample is resampled then drawn from which may be repeated multiple times creating a bootstrapping sample

Question 11

Q

Basic Ideas of Bootstrapping

Answer

A

Treat particular sample as the entire population
Repeatedly sample with replacement to generate samples
Analyse to get estimate

Question 12

Q

How Science Should Work?

Answer

A

Thought to be a reliable way to answer questions about the world
Start with hypothesis
Collect data
Do statistics to test null hypothesis
Make conclusion based on the data

Question 13

Q

Reproducibility Crisis in Science

Answer

A

Findings do not replicate
Most significant effects should replicate
97% of 100 papers reported significant findings but only 37% were significant in the replication study
Data should reproduce as repeat experiments find significant effects replicating - rare

Question 14

Q

Publication Bias

Answer

A

Not all research is equally publishable - editorial bias, incentivised, increased likelihood of false positives published
Wrong incentive structures - academic success tied to ‘significant’ results, publish or perish
Distorts meta-analyses - bad for estimating effect sizes
Not publishing negative results skew meta-analyese

Question 15

Q

P-Hacking

Answer

A

Actively searching for ‘something significant’ in data
Cherry-picking - experiments, subjects, stopping rules
Analysis of degrees of freedom
Variant of multiple comparisons problem
Focus on data giving significant findings and ignore those that do not inflate the chance of it occurring
Multiple comparisons unless controlled inflate probability

Question 16

Q

HARKing

Answer

Study These Flashcards

A

Hypothesis after results are known
Related to p-hacking
Hides the reality of multiple comparison problem

Question 17

Q

Apophenia

Answer

Study These Flashcards

A

Tendency to see patterns in random data

Question 18

Q

Confirmation Bias

Answer

Study These Flashcards

A

Tendency to focus on evidence that is in line with our expectations or favoured explanation

Question 19

Q

Hindsight Bias

Answer

Study These Flashcards

A

Tendency to see an event as having been predictable only after it has occurred

Question 20

Q

Why do Findings Fail to Replicate?

Answer

Study These Flashcards

A

When low-powered studies show significant effects, these will be overestimates

Question 21

Q

Open Science - Open Hypothesis Testing and Analysis Decision Making

Answer

Study These Flashcards

A

Pre-registration - publicly commit to your hypothesis and analysis pipeline before conducting study - treats p-hacking and HARKing
Registered report - coupled to publishing can also treat public bias