module 3 Flashcards
variable
- any measurable characteristic of an observational unit
- it can vary
what are the three pieces of info for variables
- what the variable represents
- the measurement unit
- a description of the observation unit
data for plural
datum
central tendency
- typical value in your sample (mean)
- 2nd quartile
dispersion
- spread of values (variance)
how are categorical data catergorized
- counts (# of sampling units in each category)
- proportions (share of total sampling units in each category)
______ and _______ indicate the central tendency of categorical data wheras _____ is used to indicate dispersion
counts and proportions, range
what symbol is mean often given
Y with a line over top
n=?
total number of observations/samples
formula for mean
Y=variables/n
how to calculate the variance
- calculate the mean
- calculate the difference between each data point and the mean, then square that answer
- sum the squares of the difference and divide by the number of observations/data points
-o^2=1/n(Yi-Y)^2 - each variable is subtracted by the mean the squared then divided by how many variables there are, then square rooted lol
by dividing by the number of observations, we are calculating what is known as the ___________
population variance
population vs sample variance
sample variance’s equation’s denominator is n-1 not just n
steps to calculate quartiles
- sort the data from lowest to highest
- find the 2nd quartile by splitting data in half (if even number the 2nd quartile is the mean of the two middle numbers)
- find the first quartile (create subset of data, split bottom in half)
- find the 3rd quartile by repeating 3 but for upper half
calculate interquartile range
-subtract 1st quartile from the 3rd quartile