16 - Averages and the normal distribution Flashcards
grouped data definition
where the frequency is shown in terms of a range
ungrouped data definition
individual data points
how to find mode from grouped data (most commonly in a histogram)
identify highest frequency class, draw diagonal line from top of block to either side of the highest class
intercept = estimated modal value readable from x axis
how to find median from an ogive
median rank = cumulative total of the variable / 2
then look across y axis at this rank
advantages and disadvantages of mean
+
- used frequently
- understood
- uses all data
- may not be value in distribution
- distorted by extreme high low values
- ignores dispersion
advantages and disadvantages of mode
+
- not distorted by high/low values
- corresponds to actual value in data
- ignores dispersion
- doesnt take all data into account
advantages and disadvantages of median
+
- not distorted by high/low values
- corresponds to actual value in distribution
- ignores dispersion
- limited use
what is dispersion
method of determining location or central point of distribution
shows the spread of a variable about its average
standard deviation is a measure of dispersion, larger the SD, more dispersed the data is
what is x in grouped data
the mid point
how to work out fx2 (squared)
frequency multiplied by x2
what is the coefficient of the variance
standard deviation as a % of the mean - higher %, higher dispersion
properties of standard deviation
- based on all values in distribution
- suitable for further statistical analysis
- more difficult to understand
what is variance
square of standard deviation
what is range and quartiles
RANGE = measure of spread between highest and lowest values
QUARTILES = divide distribution into quarters
interquartile range
Q3 - Q1