STAT MOD 2: Chapter 3 Flashcards
What is the measure of center?
where the data distribution is located along the number line
provides information about what is TYPICAL
What is the appropriate measure of center if distribution is symmetric?
mean/average = symmetric
What is the appropriate measure of center if distribution is skewed?
median = skewed or has outliers
What is measure of spread?
how much variability is in a data distribution
provides information about how much individual values tend to DIFFER from one another
What is the appropriate measure of spread if distribution is symmetric?
standard deviation = symmetric
if applicable, use empirical rule
What is the appropriate measure of spread if distribution is skewed?
Interquartile range = skewed or has outliers
What is notation (n)?
the number of observations in the data set
What is mean? How do you compute mean?
numerical average
Excel command: =AVERAGE(dataset)
Add up all values then divide by number of values
Hint: might be a list of numbers or stem-and-leaf plot, dotplot display
What is the median? How do you compute the median?
Middle data value for an odd number of observations or average of two middle values for an even
Excel command: =MEDIAN(dataset)
Order the values, find middle value or average of middle lines
Is the mean resistant to outliers?
No, mean is a non-resistant measure
Outliers affect mean because the mean takes into account all of the values and pulls the mean towards it
Is the median resistant to outliers?
Yes, median is a resistant measure
Outliers do not affect mean as it’s just the middle value
What is the mean/median on the histogram?
Mean: the point where the histogram would balance
Median: the point where half the area falls to the left and half to the right (SPLITS in the middle)
Where does mean/median fall relative to one another in a SYMMETRIC distribution?
mean and median are both in the middle or approximately the same
Where does mean/median fall relative to one another in a RIGHT SKEWED distribution?
mean > median (more to the right than median)
Where does mean/median fall relative to one another in a LEFT SKEWED distribution?
mean < median (more to the left than median)
How do you compute range?
Range = maximum - minimum
What is interquartile range? How do you compute interquartile range?
How spread out the middle half of the data is
IQR = Q3 - Q1
How do you find third quartile?
Taking median of upper half of the ordered data values (all numbers larger than median of dataset)
How do you find first quartile of the data set?
Taking median of lower half of the ordered data values (all numbers smaller than median of dataset)
What is standard deviation?
Roughly the average distance from the mean (how many units away from the mean)
What is the sample standard deviation?
How far away on average are the observations from the mean
- if given several datasets, know which has largest or smallest standard deviations
What kind of data sets would have larger standard deviations?
Data sets with bigger spreads or more spread/variability have higher standard deviation
Notation for mean of population and sample?
µ - mean of population
x bar - mean of sample
Notation for standard deviation of population and sample?
sigma - population
s - sample