Applied Year 1 Flashcards
what is a population
a whole set of items that are of interest
what is a sample
some subset of the population intended to represent the population
what is a sampling unit
each individual thingin the population is known as a sampling unit
what is a sampling frame
when sampling units are individually named or numbered to form a list is called the sampling frame.
advantages of a census
- completely accurate results
disadvantages of a census
-time consuming and expensive
-cannot be used if involves destruction
-a large volume of data to process
advantages of a sample
- cheap
-quick
-less data to process
disadvantages of a sample.
-may not be accurate
-data may not be large enough to represent small subgroups of a population.
what are the different types of random sampling
- simple random sampling
-stratified sampling
-systematic sampling
why we use random Sampling
we use it when we want each sampling unit in our sampling frame to have an equal chance of being chosen, to avoid bias.
Simple Random Sampling (what, how +advantages/disadvantages)
what? every sampling unit in the sampling frame has an equal chance of being selected.
how? Each item in the sampling frame has an identifying number. use a random number generator, or lottery sampling to pick the item
advantages
- bias-free
- easy and cheap to implement
- equal chance of each item to be selected.
disadvantages
-not suitable when the population size is large
- sampling frame needed.
Systematic Random Sampling (what, how +advantages/disadvantages)
what? required elements are chosen at regular intervals in an ordered list.
how? i.e. take every k^th elements where k=pop sixe (N)/ sample size (N) starting at a random item between 1 and k
advantages:
- simple and quick to use
- suitable for large samples /populations
disadvantages
-sampling frame needed
-can introduce bias if sampling frame is not random
Stratified Random Sampling (what, how +advantages/disadvantages)
What? A population is divided into groups (strata), and a simple random sample carried out in each group
is used when the sample is large, and the population naturally divides into groups
How is? the same proportion
of samp size (n)/ pop size (N) sampled from each stratum
Advantages:
-reflects population structure
-guarantees proportional representation of groups within the population
Disadvantages:
- population must be classified into distinct strata
- selection within each stratum has the same disadvantages as simple random sampling.
what are the two types of non-random sampling
- Quota Sampling
-opportunity/ convenience sampling
Quota sampling
quota: a fixed share/ number of something
What? The population is divided into groups according to characteristics. A quota of items/people in each group is set to try to reflect the group’s proportion of the whole population.
How? The interviewer selects the actual sampling units.
advantages
- allows the small sample to still be representative of the population
- no sampling frame required
- quick, easy, inexpensive
-allows for easy comparison between different groups in the population
disadvantages
- non-random sampling can introduce bias
-population must be divided into groups, which can be costly or inaccurate.
- increasing scope of study increases the number of groups, adding time/expense
-non-responses are not recorded.
Opportunity/ Convenience Sampling
What? Sample taken from people who are available at the time of study, who meet criteria
how? The interviewer selects the actual sampling units according to the set criteria.
advantages
- easy to carry out
-inexpensive
disadvantages
-unlikely to produce a representative sample
-highly dependent on individual researchers.
qualitative/ categorical data
non-numerical values
quantitative data
numerical values
Quantitative data can be two things.
discrete: can only take specific values, e.g shoe size, number of children
Continuous: can take any decimal value (possible with a specified range)
Large Data Set (LDS)
Edexcel’s data set concerns weather data from a number of weather stations.
Remember
-Camborne
-Hern
-Heathrow
-Leeming
-Leuchars
International:
Jacksonville
Beijing
Perth
total rainfall
tr/trace means less than 0.05 mm.
meaning allowed to use 0 in calcluations
mean windspeed
kn/knot is a ‘nautical mile per hour’ 1kn =1.15mph
mean cloud cover
oktas means the number of 1/8ths of the sky covered.
range of values for each location
UK
temp range:
-Camborne: 10-20
-Heathrow:8-29
-Leeming:4-23
-Leuchars: 4-19
-Beijing: 8-33
-Jacksonville: 15-31
-Perth: 8-25
Wind speed:
-Camborne: 3-18
-heathrow:3-19
-hurn: 2-19
-Leeming: 3-17
-launchers: 3-23
-Beijing: 2-9
-Jacksonville: 1-12
-Perth: 4-14
measures of location
single values which describe a position in a data set
eg. mean, median, mode