Descriptive Statistics Flashcards
Collapsing data
Take large data set and condense it into what you need.
Central Tendency Measures
- Mean
- Median
- Mode
Variability
Spread in the data
Distribution =
total set of scores (n)
Frequency distribution
- Rank that shows the number of times each value occured or frequency
Allows to examine the distribution of scores
Ideal frequency distribution shape is a
bell curve
Cummulative Percentage
% of people that fall below a score
What are 4 ways to describe the distribution of the shapes of graphs?
- Symmetrical
- Uniform
- Normal
- Skewed (doesn’t follow normal curve)
Positive vs Negative Skewed Bell Curve
- Negative skew: tail goes out toward negative side
- Positive skew: tail goes out toward positive side
Mode
- Most frequent number
- Not useful for continuous data
- bimodal = 2; multimodal = greater than 2
Median
- Middle Score
Advantage:
* Unaffected by extreme scores
* Average position in the distribution, not amount
* Useful for skewed data with extreme scores
Mean
- Average
Sum of scores/n
Most appropriate type of type of measurement and central tendency
- Interval/Ratio = Mean
- Ordinal = median or mode
- Nominal = Mode
- Mean is the most stable measure but is largely affected by skew
- Median/Mode are less affected by skew
Measures of variability
- Variability = dispersion of scores
Measures of variability:
* Range (max - min)
* Percentiles and quartiles
* Variance (spread)
* Standard Deviation
* Coefficent of variation
Range
- Range = maximum - minimum)
- Least useful
- Greatly affected by outliers
- Hard to compare different sample sizes