Ch.3 Representations of data Flashcards
what is the formula for an outlier?
a value greater than Q³ + k(Q³ - Q¹)
or
a value less than Q¹ - k(Q³ - Q1)
how would you work out if something is an outlier?
if the value is greater than the upper quartile plus (k times IQR) or lesser than the lower quartile minus (k times IQR)
what is an outlier?
an extreme value
what is the sign for much lesser than?
«
what is the sign for much greater than?
> >
what is an anomaly?
a clearly extreme outlier
what is the process of removing anomalies from a data set called?
cleaning the data
how is an outlier plotted on a box plot?
as an x on the spot
how would you calculate the height of each bar (frequency density) in a histogram?
area of bar = k x frequency
how is a frequency polygon made?
joining the middle of the top of each bar of a histogram
when comparing data sets what can you comment on?
a measure of location: median and IQR
or
a measure of spread: mean and standard deviation