Data Visualisation Flashcards
What is used to help me understand the data?
- Stem and leaf plots
- box plots
- Histograms
- Looking for skewness
- Comparing data sets visually
What is used to help others understand data?
- Bar Graphs
- Scatterplots
What do histograms do?
Good at showing the distribution of the data.
What do stem and leap plots do?
And how do they work?
Show distribution of the data
Easy to do, Stem makes up the tens and the leafs make up the units
Why are boxplots useful?
-useful for showing medians, ranges, IQ range, skewness etc.
Describe the structure of a box plot?
Upper adjacent value is the line at the top
Lower adjacent value is the line at the bottom(difference between two provides range)
The top and bottom line are known as upper and lower hinge( Interquartile range is difference between the two)
Line in the middle is the median
- Outliers are displayed as dots further away from adjacent values
- extreme outliers are displayed as stars
What do whiskers on a box plot indicate about data?
The more similar in length the more likely that the data is normally distributed vice versa
What is a frequency distribution?
- Histograms
- made of touching bars
- each bar is the frequency that the given value occurs
- We can count how many people have a healthy heart rate
What is a probability distribution?
- Bell curves
- Smoothed over to flow nicely
- Area under curve is the probability that the value occurs
- We can work out the likelihood of a person producing a certain outcome with a normal distribution.
Why is it ideal to have normally distributed data?
-It allows us to do better an more accurate statistical tests.
What are two ways in which a distribution can deviate away from normality?
- Lack of symmetry (skewness)
- Pointyness (Kurtosis)
What is skewness?
Deviation from symmetry
It also means some extreme scores are affecting the mean
What is kurtosis?
Refers to the extent to which scores cluster at the ends (tails) of the distribution which tends to change how pointy they look.
What is a distribution with positive kurtosis known as ?
Leptokurtic distribution
-more pointy
What is a distribution with negative kurtosis known as?
Platykurtic
-Tends to be flatter than normal