Week 3 - Visualisation Flashcards
relative frequency distribution
the fraction of occurrences of each value of variable.
bar graph
Bar graph: height of rectangular bars to display the frequency distribution of categorical variables.
histogram
Histogram: uses the area of rectangular bars to display the frequency distribution of a numerical variable.
Mode is highest peak
Symmetric of skewed
Outliers
contingency table
frequency of occurrence of all combinations of two or more categorical variables.
grouped bar chart
frequency distributions of two or more categorical variables.
mosaic plot
stacked bar chart that displays two or more categorical variables.
scatter plot
graphical display of two numerical variables in which each observation is represented as a point on a graph with two axes.
strip plot
graphic display of a numerical variable and a categorical variable in which each observation is represented as a dot.
violin plot
shows an approximation of the frequency distribution of a numerical variable in each group and its mirror image.
line graph
uses dots connected by line segments to display trends over time in a summary measurement such as the mean or an ordered series.
map
spatial equivalent to a line graph using colour gradient to display the numerical response variable at multiple locations.
how to graph?
Show the data - help you see potential problems and not hide anything
Make patterns easy to see - highlight trends, make comparisons easy, colour and shade, don’t use 3D
Represent magnitudes honestly - must start at zero with bar graphs but can use a different plotting method
Draw elements clearly