Chapter 3 Numerically Summarizing Data Flashcards
Measures of Central Tendency
Give a feel for where the center of gravity of the data set is: Mean, Median, Mode
Arithmetic Mean
The average
Median
the value that lies in the middle of the data set when arranged in ascending order
Mode
The most frequent observation of the variable that occurs a the data set
A data set can have no mode, one mode, or more than one mode
The symbol for the mean of a population
The Greek letter mu (μ)
The symbol for the mean of a sample
(X ̅) “X-Hat”
The symbol for the median of a population
M
The symbol for the median of a sample
m
Relation between the mean, median, and a distribution shape that is skewed to the left
Mean is substantially smaller than the median
Relation between the mean, median, and a distribution shape that is symmetric
Mean roughly equal to median
Relation between the mean, median, and a distribution shape that is skewed to the right
Mean substantially larger than median
What does it mean when it is said that a data set is resistant?
Extreme values (very large or small) relative to the data do not affect its value substantially
Outlier
A data point that differs significantly from other observations. Results in a skewed distribution
What is the better measure of central tendency when the distribution is skewed?
The median
Measures of Dispersion
Show the degree to which the data in a population or sample is spread out: range, standard deviation, variance, interquartile range
Range
The difference between the largest data value and the smallest data value in a data set. Denoted as R
Range is not resistant to outlier values
Population Variance
The sum of the squared deviations from the population mean divided by the number of observations in the population, N. Denoted by the greek letter sigma squared (σ^2)
Population Standard Deviation
The positive square root of the population variance. Denoted by the Greek letter sigma (σ).
The population variance and standard deviation are
Parameters
Sample Variance
The sum of the squared deviations from the population mean divided by the size of the sample MINUS 1 (n - 1). Denoted by s squared (s^2).