DJM Topic 3 - Visualizing Proportions Flashcards
Proportion data
Data grouped by categories and subcategories, with the maximum, minimum, and overall distribution being important
ranking analysis for proportions
Sorting the expenses to perform ranking analysis
part-to-whole analysis for proportions
Expressing expenses as percentages to do part-to-whole analysis
key questions for analyzing proportion data
Which department has the highest or lowest expenses? how much expenses has a specific department incurred for the whole company?
patterns to look for in proportion data
Unusual differences from one value to the next, grouped values, significant breaks in a pattern, obvious exceptions to the norm
visualizations for proportion data
Pie chart, donut chart, stacked area chart, stacked bar chart, treemap, pareto chart
pie chart characteristics
Circle represents the whole, wedge size represents part or percentage of the whole
when to use pie/donut charts
Parts make up a meaningful whole, parts are mutually exclusive, comparing parts to each other or to the whole, few parts (3 or less)
stacked area chart use
Showing changes over time for several variables, percentages add up to 100% vertically, showing raw counts and peaks/valleys
stacked bar chart characteristics
Similar to stacked area charts but with vertical bars, avoids angle perception problem of pie charts
treemap characteristics
Rectangles of varying sizes show relative proportions, works well for hierarchical data, can use color/gradient as second measure
pareto chart use
Examining cumulative contribution of parts to the whole, constructed with bar and line chart, based on pareto principle/80-20 rule
cautions with pareto charts
Fluctuating systems can produce inconsistent rankings, repeated analyses needed to confirm rankings, use control chart first to ensure system stability
when to use pareto charts
Data can be arranged into categories, rank of each category is important, analyzing frequency of problems/causes in a process
techniques and best practices
1) Grouping categorical items ad hoc
using pareto charts with percentile
scales
2) re-expressing values for
quantitative scaling
3) using line graphs for ranking
changes over time