sampling and randomisation Flashcards
population
parameters (unknown and hard to measure); the large group of units about which inferences are made eg. all married women
mew
sample
statistics - the smaller group of units that are measured
x bar
used to make inferences if representative
advantages of sample survey over a census
census is not always possible
speedier
less costly
can devote more resources to getting an accurate sample
sampling data
define population of concern
specific the sampling frame - electron roll, telephone list etc. (representative)
specify sampling method - haphazard and convenience or probably and randomness
determine sample size to achieve desired accuracy - error is alpha 1/squrt(n)
sample data
apply statistical description and inference
types of sampling
- convinience and haphazard
- random (probability sample)
- stratified random (probability sample)
convenience and haphazard
selection bias: readers of column where survey is are only a subset of population
voluntary response bias: people with opinions do surveys
non-response bias: people don’t respond or can’t be contacted
undercoverage bias: small sample, does not represent population
questionnaire bias: bias can be introduced on how something is worded
simple random sampling
all units in population have the same chance of being in the sample
usually representative if the population is homogeneous
can do randomisation in excel
stratified random sampling
sort into groups (age, sex, some confounding variable) and then randomly allocate (proportional to stratum in population)
- cluster sampling - sample an entire group
- systematic sampling - randomise list and select a random starting point and choose every tenth unit
- multistage - combine methods
- helps with a heterogenous population