2.4-2.8 Flashcards
What does knowledge of a data set’s variability and center help us to do?
can help us visualize the shape of the data set as well as its extreme values.
What is the range?
The range of a quantitative data set is equal to the largest measurement minus the smallest measurement.
What is a con of range? Why is this a con?
a rather insensitive measure of data variation when the data sets are large. This is because two data sets can have the same range and be vastly different with respect to data variation.
What is sample variance?
The sample variance for a sample of n measurements is equal to the sum of the squared deviations from the mean, divided by . The symbol s^2 is used to represent the sample variance.
What does it mean if the deviations are mostly small?
he data are clustered around the mean, x
, and therefore do not exhibit much variability.
What is the sample standard deviation?
The sample standard deviation, s, is defined as the positive square root of the sample variance
What symbol represents a population variance?
sigma
Why do we use n-1 when calculating sample variance instead of just n?
n tends to produce an underestimate of sigma^2.
What does the value of standard deviation indicate?
The larger the standard deviation, the more variable the data are. The smaller the standard deviation, the less variation there is in the data.
What is Chebyshev’s rule?
a. It is possible that very few of the measurements will fall within one standard deviation of the mean
b. At least 3/4 of the measurements will fall within two standard deviations of the mean
c. At least 8/9 of the measurements will fall within three standard deviations of the mean
d. Generally, for any number k greater than 1, at least (1-1/k^2) of the measurements will fall within k standard deviations of the mean (x-ks, x+ks)
What is the empirical rule?
a. Approximately 68% of the measurements will fall within one standard deviation of the mean
b. Approximately 95% of the measurements will fall within two standard deviations of the mean
c. Approximately 99.7% (essentially all) of the measurements will fall within three standard deviations of the mean
What sets of data do Chebyshev’s rule and the empirical rule apply to?
CR: any
EMP: mound-shaped
WHat are measures of relative standing?
Descriptive measures of the relationship of a measurement to the rest of the data
What is the pth percentile?
For any set of n measurements (arranged in ascending or descending order), the pth percentile is a number such that p% of the measurements fall below that number and (100-p)% fall above it.
Explain the quartiles/
The lower quartile (QL) is the 25th percentile of a data set. The middle quartile (M) is the median or 50th percentile. The upper quartile (QU) is the 75th percentile.