summarising numerical data Flashcards
what is the difference between discrete and continuous data?
discrete = data takes on whole unit values only - i.e) number of children, number of episodes of angina
continuous= data can take on any value i.e) blood pressure/weight
what are the two things that can go wrong when we ‘summarize’ data?
- we present aspects of the data that lead to the wrong conclusion
- we leave out important aspects of the data, leading to the reader drawing the wrong conclusion
when is a numerical representation useful? When is it not useful?
When you need exact information = useful
when you want people to appreciate a pattern/significant = awful
When should we ‘summarise’ data?
when there are too many individual measurements to effectively display on a graph/visual
what is a ‘quantile’?
a figure that cuts off a specified percentage of the data values -
ex) the 25th percentile or below, the 50th percentile or below etc.
what is the ‘median’?
the half way point of the data values -
half of the values lie at or below the median , it is the 50th percentile
how does the median differ from the mean?
the mean = the average
Will an outlier effect the median or the mean?
an outlier will effect the mean, but not the median
why is the range not a good idea of variability?
b/c it is based on only two pieces of data - the biggest and smallest value which are likely to be atypical cases or even errors!
what is the interquartile range?
divide the values into ‘quartiles’ and then give the 25%- 75% range as an indication apart from outliers
what is the difference between a quartile and quantile?
quartile = values that cut data into four equal groups
quintiles = cut data into 5 equal groups
how can quantiles help with understanding?
ex) if you get a test score back of 58% you’ll be pretty bummed until you learn that you’re in the 90th percentile for the class- in other words, you did 9/10ths better than the class!
how how to interpret/visualise the median and interquartile range
What does a boxplot show?
how does a boxplot show quartiles?
it builds a box around the 25th and 75th percentiles