Lecture 4 (SAMPLING AND SAMPLING DISTRIBUTIONS) Flashcards
SAMPLING
A means for gathering useful information about a population - information gathered and conclusions drawn
What is the advantage of sampling over census?
Sampling saves money and time
Research process is sometimes destructive so can save product.
It is the only option when accessing a population is impossible.
What are the reasons for taking a census over a sample?
Eliminates the possibility that a random sample is not representative of the population.
The person authorising the study is uncomfortable with sample information.
Safety of customer.
POPULATION FRAME
A list, map, directory, or other source used to represent the population.
OVER REGISTRATION (population frame)
The frame contains all members of the target population and some additional elements.
UNDER REGISTRATION (population frame)
The frame does not contain all members of the target population.
The goal is to minimise differences between target population and frame.
RANDOM SAMPLING
Every unit of the population has the same probability of being included in the sample.
A chance mechanism is used in the selection process.
Eliminates bias
Also known as probability sampling.
NON-RANDOM SAMPLING
Every unit of the population does not have the same probability of being included in the sample.
Open to selection bias
Not appropriate data collection method for most statistical methods
Non-probability sampling.
What are the 4 random sampling techniques?
Simple Random Sampling
Stratified Random Sampling
Systematic Random Sampling
Cluster (or Area) Sampling
SIMPLE RANDOM SAMPLE
Basis for other random sampling techniques
Each unit is numbered 1 to n
A random number generator can be used to select n items from the sample.
Easier to perform for small populations
Cumbersome for large populations.
STRATIFIED RANDOM SAMPLE
Population is divided into non-overlapping sub populations called strata
A random sample is selected from each stratum
Proportionate (% of the sample taken from each stratum is proportionate to the % that each stratum is within the whole population)
Disproportionate (when the % of the sample taken from each stratum is not proportionate to the % that each stratum is within the whole population.
has the potential to match the sample closely to the population
Stratified sampling is more costly
Stratum should be relatively homogeneous (i.e. race, gender, religion
SAMPLING ERROR
A sample does not represent the population.
SYSTEMATIC RANDOM SAMPLING
Convenient and relatively easy to administer.
Population elements are an ordered sequence (at least conceptually)
The first sample element is selected randomly from the first k population elements.
Thereafter, sample elements are selected at a constant interval, k, from the ordered sequence frame.
k = N/n
n = sample size
N = population size
k = size of selection interval
Advantages/Disadvantages of systematic sampling
Ad: Convenience
Speed
Evenly distributed sampling across frame.
Dis: It is biased if the samples are ranked.
CLUSTER (AREA) SAMPLING
Population is divided into non-overlapping clusters or areas
Each cluster is a miniature, or microcosm, of the population.
A subset of the clusters is selected randomly for the sample
If the number of elements in the subset of clusters is larger than the desired value of n, these clusters may be subdivided to form a new set of clusters and subjected to a random selection process.