Statistics 3 Flashcards
Normal distributions X and Y:
P(X > Y) ?
Z = (X - Y)/(sqrt[σ^2Χ + σ^2Υ])
φ(Ζ) is your answer
Degrees of Freedom (calculating expected values from table)
Normally:
v = (number of cells after combining)
- (number of constraints)
Number of constraints is usually 1, but every time a statistic is calculated from a sample, one degree of freedom is used up. If the estimate of a parameter is CALCULATED then it IS a restriction. If a parameter is GUESSED by using an estimate that seems sensible from observations then it IS NOT a restriction. - e.g having to calculate p from a sample to test a binomial distribution (n,p) is a restriction - merely guessing a value for p is not a restriction.
For a contingency table:
v = (rows -1)*(columns-1)
What the hell is a contingency table?
It’s got 2 “criteria” - one across the columns and one in the rows
e.g. The school a pupil attends (rows)
The grades the pupils get (columns)
Define population
Whole set of items that are of interest
What is a census
A census observes or measures every member of a population
What is a sample?
A sample is a selection of observations taken from a sub-set of the population which is used to find out information about the population as a whole. This is known as a SAMPLE SURVEY.
What is a key about a random sample?
In a random sample, every possible sample of size n has an equal chance of being selected.
What is a Sampling Frame?
A sampling frame is a list identifying every single sampling unit that could be included in the sample.
Random number sampling?
In random number sampling, each element is given a number to identify it and the numbers of the required elements are selected by using ransom number tables or other random number generators.
Systematic Sampling?
In systematic sampling the required elements are chosen at regular intervals from an ordered list.
Stratified Sampling?
In Stratified Sampling the population is divided into mutually exclusive strata and a simple random sample is taken from each. The proportion of units sampled in each strata is the same as the proportion of that strata in the total population.
Quota Sampling?
In quota sampling the population is divided into groups in terms of gender, social class etc. The number of people in each group is set to try and reflect the group’s proportion in the whole population. But it is the interviewer who selects the actual sampling units.
Advantages/Disadvantages of random sampling
RANDOM NUMBER SAMPLING:
ADVANTAGES:
•Numbers are truly random and free from bias
•Each number has a known equal chance of selection
•It is easy to use
•Standard formulae can be used to analyse the results
•Each person or unit is included only once
DISADVANTAGES:
•It is not suitable where the population size is large
In LOTTERY SAMPLING the same is true, ADVANTAGES include that tickets are drawn at random; it’s an easy process to use; each ticket has a known chance of selection. DISADVANTAGES include: not suitable where population size is large; a sampling frame is needed.
SIMPLE RANDOM SAMPLING (in general):
ADVANTAGES (provided population is small):
•It is cheap to do
•It is easy to do
•Standard formulae can be used to analyse results
•Each person/unit is only included once
DISADVANTAGES:
•It is not suitable where the population size is large
•A sampling frame is required.
Advantages/Disadvantages of Systematic Sampling
ADVANTAGES:
•It is simple to use
•It is suitable for large samples
DISADVANTAGES:
•It is only random if the ordered list is truly random
•It can introduce bias.
Advantages/Disadvantages of stratified sampling
ADVANTAGES:
•It can give more accurate estimates than simple random sampling where there are clear strata present
•It reflects the population structure
DISADVANTAGES:
•Within the strata, the problems are the same as for any simple random sample
•If the strata is not clearly defined they may overlap