DATA COLLECTION Flashcards
Population
The whole set of items that are of interest
Census
Measuring every item of a population
Sample
Selection of observations taken from a portion of a population
Sampling unit
Each item that can be sampled
Explain: Simple Random Sampling
Every unit in the sampling frame has an equal chance of being selected. Use a number generator, or pull names from a hat, etc.
Pros and Cons: Simple Random Sampling
Pros: Bias free, easy and cheap. Cons: Not suitable for large populations
Explain: Systematic Sampling
Required elements are selected at regular intervals in an ordered list. Take every kth element where k=(Pop. size)/(Sample size) or N/n.
Pros and Cons: Systematic Sampling
Pros: Easy, suitable for large sample sizes. Cons: Sampling frame needed, can introduce bias.
Explain: Stratified Sampling
Sample size divided into groups/strata and then divided by the proportion needed overall (e.g 20% of 1000 units would be split into x amount of units, then 20% of the units would be observed)
Pros and Cons: Stratified Sampling
Pros: Good for large populations, gives an unbiased and fair view as proportional representation is guaranteed. Cons: dividing into groups can be difficult.
Explain: Quota Sampling
Sample size divided into groups based on characteristic, then chosen through deciding which units fit the quota the best. An interviewer selects the sampling units.
Pros and Cons: Quota Sampling
Pros: Allows a small sample to be representative of the population, can be inexpensive, allows for easy intercomparisons. Cons: When the sample size is larger, cost and time expense are added, can introduce bias, sorting into groups can cause bias and room for error.
Explain: Opportunity Sampling
Sampling units selected by interviewer at time of observance. Units all fit the criteria
Pros and Cons: Opportunity Sampling
Pros: Inexpensive, convenient, fast. Cons: Unlikely to provide a valid study, can be exposed to many forms of bias
Name the three methods of random sampling
Simple random, Stratified, Systematic
Name the two methods of non random sampling
Quota, Opportunity
A lake contains three different types of carp.
There are an estimated 450 mirror carp, 300 leather carp and 850 common carp.
Tim wishes to investigate the health of the fish in the lake.
He decides to take a sample of 160 fish.
Give a reason why stratified random sampling cannot be used.
A sampling frame is needed, and since it is impossible to find the exact population of fish in the lake, the population of fish cannot be divided into strata.
Helen is studying one of the qualitative variables from the large data set for Heathrow from
2015.
She started with the data from 3rd May and then took every 10th reading.
There were only 3 different outcomes with the following frequencies
State the sampling technique Helen used.
Systematic sampling
Name the four types of data values.
Qualitative, quantitative, continuous, discrete.
What is a qualitative value?
A value that cannot be described with a number. E.G: Hair colour, which can be brown, blonde, red.
What is a quantitative value?
A value that can be described by a number. E.G: Height, which can be 150cm, 156cm, 173cm, 185cm.
What is a discrete variable?
A number that can only have a specific value. For example, you can only have a certain number of children; a family cannot have 4.5 children.
What is a continuous variable?
A number that can have any value. For example, time: it can take 2.45 seconds to carry out a task, or 6.004, or 3677.6, or 0.00695, or 8.
State the names of the five domestic stations in the Large Data Set.
Leeming, Leuchars, Camborne, Hurn, Heathrow
State the names of the three foreign weather stations in the Large Data Set.
Beijing, Jacksonville, Perth.
What three domestic stations in the LDS are on the coast?
Camborne, Hurn, Leuchars
Units: Rainfall
millimeters. tr/trace means less than 0.05 mm.
Units: Mean windspeed
knots. 1kn=1.15mph
Units: total sunshine
1/10 of an hour
Units: maximum gust
knots. 1kn=1.15mph
Units: humidity
% of air saturated with water
Units: mean visibility
decametres Dm of visibility
Units: cloud cover
Oktas, meaning 1/8 of the sky covered in cloud
Units: pressure
Hectopascal (hPa)
What time periods were the LDS recorded within?
- May-October 1987. 2. May-October 2015
What dates were the two Florida hurricanes?
- 12 October 1987. 2. 1-2 October 2015
When was the UK Great Storm
15-16 October 1987