Chapter 4 Flashcards
range of data set
largest to smallest observation
sample variance
s^2, sum of squared deviations from mean divided by n-1 and found w/ (refer to formula)
standard deviation
s, positive square root of variance
quartile
the three points that divide the data set into four equal groups, each group comprising a quarter of the data
using random number generator
assign each person a number from 001 to 200, use RNG to pick X amount of people to use in sample. IMPORTANT: you may not repeat numbers!!
why and how do we trim the mean of a sample?
a trimmed mean is computed by first ordering the data values from smallest to largest, deleting a selected # of values from each end & finally averaging remaining values. used when there are outliers or highly skewed date. trimming % is % of values deleted from each end
boxplot
method of summarizing data that gives details to center and spread without being heavily influenced by outliers
lower quartile
median of lower half of sample
upper quartile
median of upper half of sample
interquartile range (IQR)
upper quartile - lower quartile
outlier
when observation is more than 1.5 IQR away from nearest quartile = mild. it’s extreme if more than 3 IQR from nearest end of box
how to construct box plot
- ) draw X axis (# line)
- ) rectangle - left edge at Lower Quartile, right edge at Upper Quartile. the width is the IQR.
- ) draw line inside at the median
- ) whiskers from each end to smallest and largest observation in data
how to modify box plot
- ) determine any mild/extreme outliers
- ) whiskers extend to most extreme observation that isn’t an outlier
- ) solid circle for mild outlier
- ) open circle for extreme outlier