Lecture 4 Flashcards
Percentile
For any set of n measurements arranged in ascending order, the pth percentile is a number such that p% of the measurements fall below that number and (100-p)% fall above it
Quartiles
Let n denote the number of observations in a data set. Arrange the observed values in ascending order.
First quartile: n+1/4
Second quartile: n+1/2
Third quartile: 3(n+1)/4
If a position is not a whole number, what is used?
linear interpolation
Interquartile Range (IQR)
the sample interquartile range of the variable, denoted IQR, is the difference between the first and third quartiles of the variable; gives us the range of the middle 50% of the observed values
IQR=Q3-Q1
Q1 is roughly what percentile?
25th percentile
Q2 is what percentile?
50th percentile
Q3 is what percentile?
75th percentile
Five number summary
consists of minimum, maximum, and quartiles written in increasing order
Min, Q1, Q2, Q3, Max
Sample Z score
Z=x-sample mean/s
Population Z score
Z=x-population mean/ population standard deviation
Interpretation of Z score
Sign: whether score is above (+) or below (-) the mean
Number: Distance between the score and mean in standard deviation units
what is not affected by an extreme value in the data set?
median
Outlier
an observation that is unusually large or small relative to the other values in a data set. typically are attributable to one of the following causes
1) the measurement is observed, recorded, or entered into the computer incorrectly
2) comes from different population
3) correct but rare event
Box plot
based on the five number summary and can be used to provide a graphical display of the center and variation of the observed values of variables in a data set
How to construct a box plot
1) determine the five number summary
2) draw a horizontal (or vertical) axis on which the numbers obtained in step 1 can be located. Above the axis, mark the quartiles and the minimum and maximum with vertical (horizontal) lines
3) connect the quartiles to each other to make a box, and then connect the box to the minimum and maximum with lines
4) Determine the lower inner fence and upper inner fence
Lines are drawn from each hinge to the inner fence boundaries
5) Determine lower outer fence and upper outer fence