Physics and Math Chapter 12: Data-based and Statistical Reasoning Flashcards
Measurements of central tendency
provide a single value representation for the middle of a group of data.
Arithmetic mean or average
measure of central tendency that equally weighs all values; most affected by outliers.
Median
Value that lies in the middle of the data set. 50% of points are above and below the median
Mode
the data point that appears most often; there may be multiple (or zero) modes in a data set
Distributions
have characteristic features that are exemplified by their shape. Distributions can be classified by measures of central tendency and measures of distribution.
Normal distribution
symmetrical - mean, median and mode are all the same in the normal distribution.
- 68% of points occur within one standard deciation of the mean, 95% within 2 and 99% within 3.
Standard normal distribution
normal distribution with a mean of 0 and a standard deviation of 1; used for most calculations
Skewed distributions
have differences in their mean, median and mode; the skew direction is the direction of the tail of the distribution
Bimodal distributions
have multiple peaks, although not necessarily multiple modes, strictly speaking. It may be useful to perform data analysis on the two groups separately.
Range
the difference between the larges and smallest values in a data set
Interquartile range
the difference between the value of the third quartile and first quartile 0 interquartile range can be used to determine outliers
Standard deviation
a measurement of variability about the mean; can also be used to determine outliers.
Outliers may be a result of:
true population variability, measurement error or a non-normal distribution
Prob of Independent Events
Does not change based on the outcomes of other events
Mutually exclusive outcomes
cannot occur simultaneously