Statistics - Data collection Flashcards
What is a population?
The whole set of items that are of interest
What is a census?
A survey that observes/measures every member of a population
What is a sample?
A selection of observations taken from a subset of a population which is used to find out info about the whole population
What is the advantage of a census?
Should give a completely accurate result
What are the disadvantages of a census?
Time consuming & expensive
Hard to process large quantities of data
Cannot be used when testing process destroys item
What are the advantages of taking a sample?
Less time consuming & expensive than a census
Fewer people have to respond
Less data to process
What are the disadvantages of taking a sample?
Data may not be accurate
Sample maybe not large enough to represent all small sub-groups in the population
What happens as the sample size increase?
The more accurate it is
More representative of the sample
What is the individual units of a population called?
Sampling units
What are done to sampling units in order to distinguish them?
They are individually named or numbered to form a list
What is a statistic?
A value taken from a single sample
What is a sampling frame?
A list of the sample units
What happens in random sampling?
Each member of the population has an equal chance of being selected
What are the advantages of using random sampling?
Representative of the population
Removes bias
What are the three methods of random sampling?
Simple random sampling
Systematic sampling
Stratified sampling
How can you perform simple random sampling?
Number each sampling unit
Random number generator or numbers put into a “hat” and chosen at random
What are the advantages of simple random sampling?
Free of bias
Easy and cheap for small populations/samples
Each sampling unit has a known/equal chance of selection
What are the disadvantages of simple random sampling?
Not suitable when population/sample size is large
Sampling frame is needed
What is systematic sampling?
Required elements are chosen at regular intervals from an ordered list
E.g Data taken every nth value
What are the advantages of systematic sampling?
Simple and quick to use
Suitable for large samples/populations
What are the disadvantages of systematic sampling?
Sampling frame is needed
Can introduce bias if sampling frame is not random
What is stratified sampling?
Population is divided into mutually exclusive strata, and a random sample is taken from each
Strata example - male & female
What rules should be followed for obtaining strata?
Proportion of each strata should be the same
What is the formula to calculate the number of people should be sampled from each strata?
Number sampled in strata = (number in strata / number in population) x overall sample size
What are the advantages of stratified sampling?
Sample accurately reflects the population structure
Guarantees proportional representation of groups within a population
What are the disadvantages of stratified sampling?
Population must be clearly classified into distinct strata
Selection within each stratum has disadvantages of simple random sampling