Chpt 3- Numerically Summarizing Data Flashcards
arithmetic mean
adding all the values of a variable in a data set, then dividing by the number of observations.
(average)
median
The value that lies in the middle of the data when arranged in ascending order. (M)
If the number of observation is even, then the median is the data set value that falls at the mean of the observations between n/2 and n/2 +1 positions. (The average of the two middle values on the ascending list)
If the number of observations is odd, then the median is the data set value that falls at (n+1)/2. (The middle value on the ascending list)
resistant statistic
a numerical data summary is described as resistant if extreme values do not affect its value substantially.
In a given data set, the mean may be resistant but not the median, or visa versa.
mode
the mode of a variable is the most frequent observation(s) that occurs in the data set.
Pareto chart
bar graph that organizes data by frequency, or relative frequency.
Either ascending or descending.
Pareto chart
bar graph that organizes data by ascending or descending frequency, or relative frequency.
Side-by-side bar graph
compares data for more than one variable.
Ogive graph
a graph representing cumulative frequency or relative frequency of the data.
frequency polygon
connects data points with a line.
Range
The range (R) of a variable is the difference between the largest and smallest data values.
R = largest data value (minus) smallest data value
dispersion
the degree to which the data are spread out
population standard deviation around a variable
square root of the sum of squared deviations about the population mean, divided by
The Empirical Rule
If a distribution is roughly bell-shaped, then…
99.7% fall within 3 standard deviations of the mean.
95% fall within 2 standard deviations of the mean.
68% fall within 1 standard deviation of the mean.
modal class
The class of data that has the highest frequency
z-score
the distance that a data value is from the mean in terms of the number of standard deviations (the number of standard deviations from the mean).
Unitless
Has a mean (center) of 0 and a standard deviation (spread) of 1.