2. Data Visualisations Flashcards
How do you draw a boxplot?
- vertical axis represents data values
- Draw a box from lower quartile to median
- draw box from median to upper quartile
- decide which points are circles and plot with cicles (LQ - 1.5IQR and UP + 1.5IQR)
- draw whiskers to the minimum and maximum non-outlier datapoints
How is data organised in histograms?
In bins, intervals of data
What is the frequency f of bin on a histogram?
the number of observations in that bin
what is the relative frequency of a bin on a histogram?
f/n, the proportion of observations of the observations in that bin
what is the frequency density of bins on a histogram?
the relative frequency divided by the size of the bin
what is the area of a bar on a histogram?
relative frequency
can you work out mean and variance of data on a histogram?
no as it not exact, but you can approximate mean and variance
how do you approximate the mean on a histogram?
x(hat) = 1/n sum(frequency * midpoint, number of bins)
how do you approximate the variance on a histogram?
Var = 1/n-1 sum(frequency + (midpoint - mean)^2 , number of bins)
when would you use a scatterplot?
when more than one piece of data is collected from each subject, and you want to compare the data to find a relationship between variables.
how would you work out correlation between variables?
Sxy=SxSy
where Sxy is the sample covariance