Visualisation Flashcards
Data visualisation
Graphical representation of data to convey complex information, patterns, or insights in a clear and easily understandable way
Visual encodings
How we map data to graphical elements - position, colour, shape, etc.
When we design a data visualisation, we want to use as few visual encodings as possible
Perceptual hierarchy of visual encodings
Tells us which ones are the easiest for humans to read
When choosing the right chart, the primary goal is to prioritise visual encodings that are higher on the perceptual hierarchy
Primary goal when choosing the right chart
Prioritise visual encodings that are higher on the perceptual hierarchy e.g. length as opposed to area, volume or colour hue
Key components to any good story
Setup, conflict, resolution
Dashboards
Tools that enable users to monitor metrics and explore data interactively
Exploratory vs. Explanatory visualisations
Exploratory visualisations are for us to explore data and find patterns
Explanatory visualisations are for others to communicate and explain the patterns we have found through exploring the data
Best visualisations to capture a trend over time
Line chart: to capture how a numeric value changes over time
Multi-line chart: to capture how multiple numeric values change over time
Area chart: shows how a numeric value progresses by shading the area between the axis
Stacked area chart: to track the breakdown of a numeric value by subgroups
Spline chart: line chart in which data points are connected with smoothed curves to account for missing values
Best visualisations to visualise a single value
Card: show and track KPIs in dashboards or presentations
Table chart: best for small datasets, displays tabular data in a table
Gauge chart: often used in dashboard reports to show relevant KPIs
Best visualisations to capture distributions
Histogram: shows the distribution of a variable. Converts numerical data into bins as columns. X-axis shows the range and y-axis the frequency
Box plot: shows the distribution of a variable using 5 key summary statistics - minimum, first quartile, median, third quartile and maximum
Violin plot: variation of the box plot
Density plot: visualise a distribution by using smoothing to better capture the distribution shape of the data
Best visualisations to visualise relationships
Bar chart: one of the easiest charts to read which helps in quick comparison of categorical data. One axis contains categories and the other represents values
Column chart (vertical bar chart): categories are placed on the x-axis. Preferred over bar charts for short labels, date ranges or negatives in values
Scatter plot: most commonly used when observing relationships between two variables. Useful to spot potential correlations between data points
Connected scatter plot: hybrid between scatter plot and line chart, the scatter dots are connected with a line
Bubble chart: to visualise data points with 3 dimensions (namely visualised in the x and y axis and with the size of a bubble)
Word cloud chart: for visualising the most prevalent words in a text
Best visualisations to visualise a flow
Sankey chart: for representing flows in systems of any measurable quantity
Chord chart: for presenting weighted relationships or flows between nodes and highlighting the dominant or important flows
Network chart: consists of nodes and interconnected edges and illustrates how different items have relationships with each other
Best visualisations to visualise part-to-whole data
Pie chart: one of the most common, commonly used with percentages (useful with a small number of categories)
Donut pie chart: variant of the pie chart with a hole in the centre for readability
Heat maps: two-dimensional charts that use colour shading to represent data trends
Stacked column chart: best to compare subcategories within categorical data and to compare percentages
Treemap charts: 2D rectangles whose size is proportional to the value being measured and used to display hierarchically structured data
Line plot
Show changes in numeric values over time
Bar plot
Visualises numeric values by categories (ranked or unranked)