Week 4 - Bootstrap Flashcards

Question 1

Q

General format for CIs

Answer

A

estimate ± quantile × se(estimate)

If we know the quantiles -> we can calculate a CI
Using CLT or knowing sampling distribution

However, Bootstrap is an alternative method if we can’t use CLT

Question 2

Q

Bootstrap

Answer

A

Create new datasets (bootstrap samples) by randomly picking data points from the original dataset, allowing the same point to be picked more than once (with replacement)
Each new dataset will have the same size as the original one
Repeat this process many times to get approximate sampling distribution
We can this bootstrapping to understand uncertainty of estimate and calculate CI

Question 3

Q

If sample size is not the same as original

Answer

A

Lead to unreliable and biased estimates

Question 4

Q

Why use bootstrap?

Answer

A

Bootstrap uses all of data and thus is more versatile if we have non-normal data

Question 5

Q

Difference between simulation and bootstrap

Answer

A

Simulation starts with an assumed or known model (e.g., normal distribution, Poisson process) to generate data

While bootstrap relies on the original dataset as the only “population” available

(5 cards)