Data-Based and Statistical Reasoning Flashcards
Measurements of Central Tendency
Provide a single value representation for the middle of a group of data (mean, median, mode)
Measurements of Central Tendency
Provide a single value representation for the middle of a group of data (mean, median, mode)
Measures of Central Tendency
Provide a single value representation for the middle of a group of data (mean, median, mode)
Measures of Central Tendency
Provide a single value representation for the middle of a group of data
Arithmetic Mean (Average)
A measure that equally weighs all values; it’s most affected by outliers
Median
The value that lies in the middle of the data set; 50% of the data points are above and below the median
Median
The value that lies in the middle of the data set; 50% of the data points are above and below the median
Mode
The data point that appears most often; there may be multiple, or zero, modes in a data set
Mode
The data point that appears most often; there may be multiple, or zero, modes in a data set
Range
The difference between the smallest and largest number in a sample
Distribution
Classified by measures of central tendency and measures of distribution; it has a particular shape
Distribution
Classified by measures of central tendency and measures of distribution; it has a particular shape
Normal Distribution
Symmetrical, mean, median and mode are the same
Standard Normal Distribution
Normal distribution with a mean of zero and standard deviation of 1
Skewed Distribution
Have differences in their mean, median and mode; skew is in the direction of the tail
Bimodal Distribution
Multiple peaks; it may be useful to perform data analysis on the two groups separately
Bimodal Distribution
Multiple peaks; it may be useful to perform data analysis on the two groups separately
Measures of Distribution
Range
Interquartile Range
Standard Deviation
Outliers
Interquartile Range
Difference between the value of the third quartile and the first quartile
Interquartile Range
Difference between the value of the third quartile and the first quartile
Standard Deviation
Measurement of the variability about the mean; it is used to calculate how much the data varies
Standard Deviation
Measurement of the variability about the mean; it is used to calculate how much the data varies
Outliers
A result of true population variability, measurement error or non-normal distribution
Outliers
A result of true population variability, measurement error or non-normal distribution
Probability of Independent Events
Does not change based on the outcomes of other events
Probability of Dependent Events
Does change depending on the outcomes of other events
Mutually Exclusive Outcomes
Cannot occur simultaneously
Mutually Exclusive Outcomes
Cannot occur simultaneously
Hypothesis Tests
Use a known distribution to determine whether a hypothesis of difference (the null hypothesis) can be rejected
Hypothesis Tests
Use a known distribution to determine whether a hypothesis of difference (the null hypothesis) can be rejected
Null Hypothesis
States that two populations are equal or that a single population can be described by parameter equal to a given value
Correlation
Expresses a relationship between two sets of data using a single number
Positive Correlation
Indicates a positive association between the two variables; that is, when one variable increases the other also tends to increase as well
Negative Correlation
Indicates a negative association between the two variables; that is, when one increases the other tends to decrease or vice versa
Correlation does not imply causation
Conclusions cannot be drawn about behavioral problems based on correlation