DJM Topic 4 - Visualizing Distribution Flashcards
Distribution
Examining sets of quantitative values to see how the values are distributed from lowest to highest
mean, median, and mode
Describe how parts of the data are distributed, but do not show the full distribution
Spread
The lowest value, highest value, and distance between them in a distribution
Center
An estimate of the middle set of values or the most typical value in a distribution
Shape
Where the values are located in a distribution, such as curved or flat, upward or downward, single or multiple peaked, symmetrical or skewed, concentrations, and gaps
Outliers
Values that are unusually high or low compared to the rest of the distribution
3-value summary
minimum, median, maximum
5-value summary
minimum, 25th percentile (Q1), median, 75th percentile (Q3), maximum
Distribution patterns
The shape, concentrations, gaps, and outliers in a distribution
Shape patterns
Curved or flat, upward or downward, single or multiple peaked, symmetrical or skewed
Single distribution display examples
1) Histogram
2) Frequency Polygons
3) Strip Plots
4) Stem-and-Leaf Plots
Multiple distribution display examples
1) Box plots
2) Multiple strip plots
3) Distribution deviation graphs
Distribution Analysis Best Practices
1) Keeping intervals consistent
2) Selecting interval size
3) Using resistant measures (Median & Percentiles instead of Mean & Standard Deviation)
Bar Chart vs Histogram
Histograms display distributions of variables and plot binned quantitative data, while bar charts are used for comparing variables and plot categorical data. Unlike in bar charts, bars cannot be reordered in histograms.