Relative Positions of Data and Boxplots (a.k.a. Quartiles (2)!) Flashcards
It cuts the data into two so that
- approximately P% of the data lie below it and
- approximately (100-P)% of the data lie above it
Three of these that cut the data into fourths would then be called quartiles.
Percentile
How do you split the data set when the number of data points is EVEN? What about when it is ODD?
EVEN: Exactly in half
ODD: Include the median in both halves
What is the relationship between the boxplot and the five-number summary?
The five-number summary are the set of numbers/values needed to create the boxplot that would represent the data
It is the length of the interval that contains the middle half of a numerically arranged data set
Interquartile range (IQR)
Observations which are far unusually different/far from the bulk of the data
Outliers
What are the formulas for the two types of outliers?
- Less than Xmin [Q1 - (1.5 X IQR)]
- Greater than Xmax [Q3 + (1.5 X IQR)]
It shows the distance of a value (x) from the mean of the data set in standard deviation units
Z-score
What does a negative and positive z-score represent?
Negative - the observation is below average
Positive - the observation is above average