Data visualisation Flashcards
Why do we use figures and tables?
To analyse the data to draw a conclusion
Why do we use graphs?
Data visualisation
What makes a good graph?
- Show the data without distortions
- Present many numbers with minimum ink
- Make large data sets coherent
- Induce reader to think about and compare data
- Always include title, label axis, good scale, use colour-blind friendly colours, y-axis should start at 0
What is a histogram?
Presents frequency distribution
X axis: scores from small to high
Y axis: frequency density
Some have a density curve
Shape of distribution determines the type of analysis to perform
What is a normal distribution?
Gaussian distribution or bell curve
Mode = median = mean
What is the symmetry distribution? (Skewness)
Negative skew = left foot
Positive skew = right foot
What is the shape of the distribution? (Kurtosis)
Leptokurtic = spiked (most data in the middle)
Platikurtic = flat (data widely distributed around the mean)
What is the bimodal distribution?
When there are two modes, thus two normal distributions in the same data set
Why might data not be normally distributed? (4 reasons)
1) Outliers (data point significantly differing from other observations)
2) Small sample size - insufficient data
3) Multiple distributions (bimodal or multimodal)
4) Measurement issues
What is the central limit theorem?
The sampling distribution of the mean approaches a normal distribution as the sample size increases
What is a box plot/ box and whiskers diagram composed of?
- Min and max score
- First and third quartiles
- Median
- Outliers (data point located outside the whiskers of the box plot)
How can you identify outliers in a box plot?
Over 1.5 times the IQR above the upper quartile or below the lower quartile:
1.5 x IQR = ans
Q1 - ans
Q3 + ans
What is a violin plot a combination of?
Box plot and histogram
What figures/graphs are used for a categorical design?
Looking at frequency of occurrence across two categories
- Bar chart
What figures/graphs are used for an experimental design?
Looking at differences between conditions after manipulating variables
- Nominal - bar chart
- Ordinal - line graph
(Include error bars)
What figures/graphs are used for a correlational design?
Looking at relationships between 2 continuous variables
- Scatter plot or line graph
How are figures reported by APA standards?
- Figure - any visual display other than a table
- All figures should be numbered (fig 1)
- Should have a title
- Images are clear
- Lines smooth and sharp and reasonable size
- Font is legible and simple (TNR)
- Units of measurement are provided
- Axes are clearly labelled
- Elements within the figure are clearly labelled or explained
- Left alignment
- Any notes (optional) go underneath the figure (e.g. to define the variables)
- Double line spacing
- Can be pasted/embedded within/among the text, or placed at the end of the text
How are tables presented by APA standards?
- Should be numbered independently from figures (table 1)
- Below, clear and informative title in italics
- Left alignment
- Double spaced
- All on one page
- Can be pasted/embedded within/among the text, or placed at the end of the text
When should a figure be used?
- When showing trends, patterns, relationships across data
- Summary of research results
- e.g. mean differences between groups
When should a table be used?
- When presenting a lot of numerical info
- Showing precise values
- To compare and contrast data values or characteristics among related items
- e.g. demographics table - age or gender of participants
When should text be used to present data?
- When there is not extensive or complicated data to present
- When putting data into a table would have 2 or fewer columns
e.g. when reporting few values or comparing one variable against another