Intro Descriptive Stats/Data Sum Flashcards
Measures of center and location
mean median mode weighted mean
measures of variation
range IQR variance standard deviation
negative of mean
affected by extreme outliers
pth percentile in data array means
p% are less than or equal to this value
quartiles do what to the data
split ranked data into 4 equal groups
formula to find path percentile in ordered array
p/100 (n+1) = position
disadvantages of range
ignores way in which data distributed
sensitive to outliers
advantage of IQR over range
not sensitive to outliers, provides sense of typicality (range of middle 50% of values)
variance
average of squared deviations of values form the mean
sample variance:
sum of diffs between each observation and mean squared and divided by samples - 1
difference in formula b/w sample variance and pop variance
population divide by N
standard deviation shows
variation about the mean
whiskers of box and whisker plots extend to
1.5 IQRs from 1st and 3rd queartile
scaled data types
values assigned by measurement (potassium) or contain (number of kids)
categorical data type
values assigned buy classification (blood type)