Statistics 1.1 - Data collection Flashcards
What is a population?
The whole set of items that are of interest.
What is a census?
A census is a form of data collection that observes or measures every member of a population.
What is a sample?
A selection of observations taken from a subset of the population which is used to find out information about the population as a whole.
What is/are the advantage/s of a census?
It should give a completely accurate result.
What is/are the advantage/s of a sample?
It is less time consuming and expensive than a census, fewer people have to respond and as such there is less data to process than in a census.
What is/are the disadvantage/s of a census?
It is time-consuming and expensive, it cannot be used when the testing process destroys the item and it is hard to process such a large quantity of data.
What effects does sample size have on a sample?
The size of the sample depends on the required accuracy and available resources. Generally, the larger the sample, the more accurate it is, but the higher the resource requirement. If the population is very varied, a larger sample is needed that a uniform population to account for the variation. As such different samples can lead to different conclusions.
What is a sampling unit?
A sampling unit is an individual unit of a population, they are often named or numbered to form a list known as a sampling frame.
Describe random sampling.
In random sampling, every member of the population has an equal chance of being selected. The sample should therefore be representative of the population. Random sampling also helps to remove bias from a sample.
There are three main methods of random sampling:
Simple random sampling
Systematic sampling
Stratified sampling
Describe simple random sampling.
A simple random sample of size n is one where every sample of size n has an equal chance of being selected.
To carry out a simple random sample, a sampling frame is needed, usually a list of people or things. Each person or thing is allocated a unique number and a selection of these numbers is chosen at random.
There are two methods of choosing the numbers: generating random numbers (using of a calculator, computer or random number table) or lottery sampling.
In lottery sampling, the members of the sampling frame could be written on tickets and placed into a ‘hat’. The required number of tickets would then be drawn out.
Describe systematic sampling.
In systematic sampling, the required elements are chosen at regular intervals from an ordered list.
Describe stratified sampling.
In stratified sampling, the population is divided into mutually exclusive strata and a random sample is taken of each.
The proportion each strata sampled should be the same. A simple formula can be used to calculate the number of units that should be sampled from each stratum;
The number stratified in a stratum=(number in stratum/number in population)×overall sample size
What is/are the advantage/s of simple random sampling?
It is free of bias, easy and cheap to implement for small population and small samples and each sampling unit has a known and equal chance of selection.
What is/are the disadvantage/s of simple random sampling?
It isn’t suitable when the population size or the sample size is large. It also requires a sampling frame.
What is/are the advantage/s of systematic sampling?
It is simple and quick to use and suitable for large samples and large populations. It also garuntees proportional representation of groups within a population.