bootstrapping Flashcards
parameter (N units)
numerical summary for the population (population slope B1)
statistic (n units)
numerical summary calculated from sample data (estimated slope in sample B^1)
good samples
drawn at random, unbiased, and representative of of population
should be good estimates of population parameter without need for census
how to get a bootstrap sample
choose with replacement from the existing sample, using the same sample size
bootstrap statistic
statistic computed for each bootstrap sample
bootstrap distribution
collection of bootstrap statistics from many bootstrap sample
how to get bootstrap distribution
start with sample size n, take k resamples, calculate statistic on each as k -> infinity, distribution of k resample statistics approximates sampling distribution
standard error
standard deviation of sample distribution (measure of sampling variability)
bootstrap standard error
the standard deviation of the bootstrap distribution
what does Bootstrap assess
if sample results are statistically significant and draw inferences from the regression model to the population
based on sampling repeatedly with replacement from data at hand and computing regression coefficients from each re-sample