Chapter 2 Flashcards
If we have sample data from just 1 sample, which 3 approaches can be used to obtain the sampling distribution?
- bootstrapping
- exact approaches
- theoretical approximations
Bootstrap sample
A random sample drawn from the initial sample. This is done through sampling with replacement
How big should a bootstrap sample be?
Equal to the initial sample
In bootstrapping, what happens when we have an unrepresentative initial sample?
Results will be skewed if the proportion in the initial sample is very different from the proportion in the population
Apart from the proportion in the initial sample, what else affects the accuracy of the sampling distribution?
The size of the initial sample. Bigger samples will yield better results, but will still not be perfectly accurate.
Note: The population should be much larger than the sample size!
What is the main limitation of bootstrapping?
We are never sure if our initial sample reflects the population well
What is the main advantage of bootstrapping?
Every sample statistic can be bootstrapped
E.g. the median weight of candies
When do we use bootstrapping?
- When the assumptions for theoretical approximation are not met
- When we have a continuous variable
- When SPSS does not have a test for the sample statistic we want to examine
- When we know nothing about the population proportion
When do we use exact approaches?
When we (think we) know the proportion of the variable in the population.
What are limitations of exact approaches?
- Only works for discrete or categorical variables
- Needs a lot of computing power
What do we actually do in an exact approach?
We use this approach to calculate the probabilities for all possible outcomes
Describe the binomial distribution
It is a frequency distribution of the possible number of successes (vs. failures) for N repeated trials in which there is the same probability of success. There are always 2 possible outcomes
E.g. number of heads in 3x coin toss. Either heads or tails.
What would we use an exact binomial test for?
Questions involving population proportions
E.g. “Is 50% of candies in the population sticky?”
What would we use an exact cross tabs test for?
To examine the association between 2 categorical variables
What would we use an exact chi-squared test for?
To test whether 1 categorical variable follows an hypothesized population mean