Statistics: Chapter 1 - Data Collection Flashcards
What is a Population
A population consists of all the items we are interested in.
(Note, there aren’t always people)
What is a Sample
A sample is a subset of items chosen from a population.
What is a Sampling Unit
Each individual item in the population that can be sampled is known as a sampling unit.
What is a Sampling Frame
Often sampling units of a population are individually named or numbered to form a list called the sampling frame.
Advantages and Disadvantages of a Census
Advantages:
- Should give completely accurate result.
Disadvantages:
- Time consuming and expensive
- Can not be used when testing involves destruction
- Large volume of data to process
Advantages and Disadvantages of a Sample
Advantages:
- Less expensive
- Less time consuming
- Less data to process
Disadvantages:
- Data may not be accurate
- Sample mag not be large enough to represent small sun-groups of the population
What is Random Sampling
Where each thing in your sample frame has an equal chance of being chosen.
Method of how to do Simple Random Sampling
To carry out a simple random sample, size n, from a population size N, firstly you need a sampling frame.
Each item is assigned a different number from 1 to N. Use a random number generator to select ‘n’ unique numbers or ‘lottery sampling’ (names in a hat).
Choose the items corresponding to these numbers to form the sample.
Advantages of Simple Random Sampling
- Bias free
- Easy and cheap to implement for small populations and samples
- Each sampling unit has a known equal chance of being selected
Disadvantages of Simple Random Sampling
- Not suitable when population size is large
- Sample may not accurately reflect the population
- A sampling frame is needed
Method of how to do Systematic Sampling
In systematic sampling, the required elements are chosen at regular intervals from an ordered list.
To carry out a systematic sample, size n, from a population, size N, you need a sampling frame.
Each item is assigned a different number from 1 to N. Starting at a random between 1 and k, take every k^th elements to form the sample k = pop size(N) / samp size (n)
Advantages of Systematic Sampling
- Simple and quick to use
- Suitable for large samples and populations
Disadvantages of Systematic Sampling
- Can introduce bias if sampling frame is small and not random as patterns can be picked up in the data
- A sampling frame is needed
Method of how to do Stratified Sampling
Population divided into groups (strata) and a simple random sample carried out in each group.
To carry out a stratified sample, size n, from population size N, you need a sampling frame and distinct strata. The same proportion n/N is to be sampled from each strata.
Within each strata, each item is assigned a different number and a random number generator is used to select the number of unique numbers required.
Choose the items corresponding to these numbers to form the sample.
Advantages of Stratified Sampling
- Sample accurately reflects population structure
- Guarantees proportional representation of groups within population