2.5 Measure of Location and Outliers Flashcards
Measure of Location
tells us about an observation relative standing in the distribution, common measures are percentiles and quartiles
Percentiles
a measure of location that divides the data into 100 groups with roughly 1% of the values in each group
How to find the persentile
number of values less than x divided by the total number of values then multiply by 100
Finding the kth percentile
sort data, use location function L=(k/100)*n where k=percentile and n=total number, if L is a whole number count in your data set to L, if L is not a whole number, round up then count to find your value
Quartiles
special cases of percentiles, Q1=25th, Q2=50th (median), Q3=75th
Median
halfway point, if n is odd then its the middle ordered data value, if n is even then its the average of the two middle ordered data values
Location Function to find the mediam
L(M)= n+1/2
How to find Q1
look at the lower half of the data, to the left of the median and find the median for the lower half
How to find Q3
look at the upper half of the data values, to the right of the median, and find the median of the upper half
IQR
Q3-Q1, only takes into account 50% of the data
Fence Rule
Checking for outliers using quartiles, value less than the lower fence or greater than the upper fence could be considered an outlier
Lower Fence
Q1-(1.5*IQR)
Upper fence
Q3+(1.5*IQR)
Causes of outliers
measurement errors, data entry error, sampling error, chance occurrence