Sampling and Distributions Flashcards
Representative Sample
Attempt to have the sample look just like the population; every member of the population has an equal chance of being selected
(ex. simple random sampling -> lists every member of population and randomly select from list; stratified random sampling -> list every member of pop., identify subgroups, randomly select proportionally from subgroups)
Non-Representative Sample
Systematically different from the population (ex. voluntary response sample -> individuals choose to respond to a general appeal for information; convenience sample -> includes the people that are easiest to contact and measure)
Quota Sample
Select same number of people regardless of their prevalence in pop. (50 democrats and 50 republicans)
Sampling Error
Differences between sample and population due to random chance; increasing sample size decreases sampling error; results from collecting data from some (not all) members of population
Sampling Distribution
Distribution of sample central tendency from a large number of sample (take low samples from pop., then calculate mean for each sample and make distribution of those means); builds a sampling dis. centered at pop. mean; normally distributed
Standard Normal Distribution
Every individual/observation is under curve; area=100%; the proportion of population in a range of values is based on M and SD of normal distribution; tells us that 68% of pop. is +/- 1 SD of M
Error
Difference between a sample and population
Making Comparisons
You don’t just compare the actual sample average (proportion), also compare underlying population distribution
Margin of Error
How much the sample mean (or proportion) differs from the value in population; each sample includes some random error so we need M.O.E.
Error Bars
If they overlap: there is no evidence of a significant difference; if we had a different random sample, there might be no difference between the groups
If no overlap: there is likely evidence of a significant difference
MOE Calculations
For categorical variables: MOE is based on sample size so if sample size increases, MOE decreases
For continuous variables: MOE is based on sample size and variation; can be represented using standard error of mean and confidence interval
Standard Error of Mean (SEM)
Based on both sample size and variation
Confidence Intervals (95%)
Confidence interval: the interval within which the population parameter is believed to be
Confidence Level
The probability that the confidence interval contains the parameter; multiply the SEM to reach C.I.
Calculate 95% CI
Calculate SEM, then MOE, then CI (mean+/- MOE)