Data management pt. 2 Flashcards
Statistical measure that represents the entire distribution or a data set using a single value
Measures of Central Tendency
sum of all observations/total number (average)
Mean
mean that takes into consideration each of the item value without regard to their importance
Simple Mean
takes in consideration the proper weights assigned to the observed values according to their relative importance
Weighted Mean
the midpoint of the distribution, sometimes denoted by Me or Md
Median
number that occurs the most
Mode
indicates the extent to which values in a distribution are spread around the central tendency
Low variability = consistent and accurate
High variability = less consistent
Measures of Variation
division of observations into 4 defined intervals based on the values of the data and how they compare to the entire set of observation
Quartiles
measures the variability around the median
it is the distance between the first and third quartiles
Interquartile range
visual summaries of a data set
represents the quartiles and the lowest and highest observations
Box and Whisker Plot
ave of the square of the distance each value is from the mean
Variance
square root of the variance, it is also the measure of how spread out the data is — distance to mean
Standard Deviation