AS Stats Flashcards
define population
the whole set of items that are of interest
define census
observes or measures every member of a population
what is the advantage of using the census?
it should give a completely accurate result
what are the disadvantages of using the census?
time consuming & expensive
cannot be used if the testing process destroys the item
difficult to process a large quantity of data
define sample
a selection of observations taken from a subset of the population, which is used to find out information about the whole population
what are the advantages of using a sample?
less time consuming & expensive than the census
fewer people have to respond
less data to process than a census
what are the disadvantages of using a sample?
data might not be as accurate
sample might not be large enough to give info about small subsets of the population
define sampling units
individual units of a population
define sampling frame
a list of individually named or numbered sampling units of a population
(how does sampling size affect the validity of the conclusions?)
sample size depends on required accuracy & resources
larger sample sizes are more accurate
a varied population requires a larger sample than a uniform population
different samples produce differing results due to natural variation within populations
what are the 3 types of random sampling?
simple random
systematic
stratified
define simple random sampling
every sample of size n has an equal chance of being selected
need a sampling frame
what are advantages of simple random sampling?
no bias
easy & cheap for small sample
each sampling unit has a known & equal chance of selection
what are disadvantages of simple random sampling?
not suitable from large sample bc time consuming, disruptive & expensive
need sampling frame
define systematic (random) sampling?
the required elements are chosen at regular intervals from an ordered list
what are advantages of systematic sampling?
simple & quick to use
suitable for large samples/populations
what are disadvantages of systematic sampling?
need sampling frame
can be biased if sampling frame is not random
define stratified (random) sampling
population is divided into mutually exclusive strata & a random sample is taken from each
what are advantages of stratified sampling?
sample accurately reflects the population structure
guarantees proportional representation of groups within the population
what are disadvantages of stratified sampling?
population must be clearly classified into distinct strata
selection within each stratum is random so same disadvantages as random
what are the 2 types of non-random sampling?
quota
opportunity
define quota sampling
researcher selects a sample that reflects the characteristics of the whole population
what are advantages of quota sampling?
allows a small sample to be representative of the population
no sampling frame needed
quick, easy & cheap
easy comparison b/w different groups within population
what are disadvantages of quota sampling?
non-random can introduce bias
population must be divided into groups - expensive or inaccurate
increase scope of study increases # of groups, which increases time & cost
non-responses not recorded
define opportunity/convenience sampling
take sample from people available at the time of study & who fit the criteria
what are advantages of opportunity sampling?
easy
cheap
what are disadvantages of opportunity sampling?
likely to be not representative of the population
dependent on individual researcher
define quantitative variables
variables/data associated with numerical observations
define qualitative variables
variables/data associated with non-numerical observations
define continuous variable
can take any value within a given range
define discrete variable
can take only specific values within a given range
grouped frequency table
data is grouped into classes
class boundaries show max. & min. values in each class
midpoint is the average of each class boundary
class width is the difference b/w the upper & lower class boundaries
when is it best to use mean, median or mode?
mean: quantitative data with no extreme values
median: quantitative data with extreme values
mode: qualitative or quantitative data with 1 or 2 modes
what is the formula for the mean & for mean of data in frequency table?
Σx / n
n = Σf
Σxf / Σf
how do you calculate median from frequency table?
arrange data points in ascending order
add 1 to the # of data points then divide by 2
Σf / 2 + 0.5 to find data - n+1 / 2 th
how do you calculate the mode from frequency table?
x value with the highest frequency
value that appears the most
how do you calculate mean, median & mode from grouped frequency table?
mean: Σ(midpoint x f) / Σf
median: linear interpolation
or Σf / 2 is the number of the value & see what class it is in
mode: class with the highest f
linear interpolation if specified
what are the other measures of location?
Q1 - lower quartile (first 25% of data)
Q2 - median (first 50% of data)
Q3 - upper quartile (first 75% of data)
P10 - 10th percentile (first 10% of data)