5 Named Graph Info & Uses Flashcards
When would you use a Scatterplot?
To visualise the relationship between two numeric values
Name 5 graphs
Scatterplot, Linegraph, Boxplot, Histogram, Barplot
What is the name for the phenomena of a confusing cluster of points on a scatterplot?
Overplotting
When overplotting on a scatterplot occurs, what are two ways to address this using R?
Alpha = (0:1) for transparency,
geom_jitter (width = x, length = y) for a ‘nudge’
When would you use a linegraph?
To explore the relationship between two numeric values, when the explanatory value on the X-axis is of a sequential nature.
What is the X-axis of a linegraph also known as?
The explanatory value, it is sequential. eg, hours, days, weeks.
What would you call a linegraph that represents some notion of time?
A time series
When using the ggplot() package, what sign can we use to add another grammatical component or layer to the code?
+
How would you see all 657 colours in R?
colours()
What is a histogram
a distribution visualisation of the values in a numerical variable
Why might you use a histogram instead of a horizontal line to visualise the distribution of a numerical variable?
A histogram makes it easier to visualise a large volume of values.
In a histogram, what are the bars?
The bars are called bins. They represent the division of the x axis by a determined factor. The height of the bar shows the number of observations for any given value in the variable.
In R, you can improve the visualisation of a Histogram using a colour border. How would you write that?
geom_histogram(colour = “white”)
Why might you change the bin width in a histogram?
The default value might not be easy to a useful range to interpret.
What are two ways to improve the distribution visualisation if the bin defaults are not useful?
- change the bin width: bins = 40
.2. Increase or decrease the number of bins: binwidth = 10