data analysis: descriptive statistics Flashcards
what are the measures of central tendencies
these are”averages” which give us information about the most typical values in are of data. There are three of these to consider:
- mean
- medium
- mode
what is an advantage of the mean
the mean, also known as the average, is the most sensitive of the measures of central tendency as it includes all of the scores/ values in the data within the calculation
This means it is representative of the data as a whole
what is a limitation of the mean
the mean can be easily distorted by extreme values
e.g. in the list 5,7,7,8,9,11,12,14,15,17 if we replace 17 with 98, the mean becomes 18.8 which does not represent the data overall
what is the medium
the medium is the middle number in a data set when scores are arranged from lowest to highest
in an even list, we half the tow half numbers
what are some advantage of the medium
extreme values do not affect it so even if we replaced 17 with 98 ( as seen with the list in the mean example), the medium would stay the same
It is easy to calculate once the values are in order
what is a disadvantage of the medium
the medium is less sensitive than the mean as not all scores are included in the final calculation
what is the mode
the mode is the most frequently occurring score/ value within a data set
In some data sets there may be two modes ( bi - modal) or no mode if all the scores are different
what is a strength of the mode
for some data (data categories) the mode is the only method you can use
e.g. if you asked your class to list their favourite desserts, the only way to identify the most “typical” or the average value would be selecting the modal group
what is a limitation of the mode
although the mode is easy to calculate,it is still a very crude measure
what are measures of dispersion
measures of dispersion are based on the spread if scores
Basically how far scores vary and differ from one
what are the different measures of dispersion
Range
Standard Deviation
how do we calculate the mean
the range is calculated by this equation: highest value - lowest value + 1
we add 1 as a mathematical correction that allows for the fact that raw scores often rounded up ( or down) when they are recorded within research - essentially, the addition of 1 counts for the margin of error
what is a strength of the range
it is easy to calculate
what is a limitation of the range
it only takes ito account the two most extreme values, and this nay be unrepresentative of the data set as a whole
what is standard deviation
standard deviation is a more sophisticated measure of dispersion
This is a single value that tells is how far scores “deviate”form the mean