chapter 3 Flashcards
what is the mode?
the value that occurs most often
what is the median?
the middle value in a list after they have been put in order
how to figure out the median from a group of data values with an odd amount?
1/2(n+1)
how to figure out the median from a group of data values with an even amount?
n/2
what is the mode of data in a frequency table?
the category or class with the highest frequency
what is the median of data from a frequency data?
1/2(n+1)th value
how to figure out the mean from a frequency data for discrete data?
total fx/ x
what is the median for grouped continuous data?
1/2nth
how to figure out the upper quartile from discrete data?
3/4 (n+1)
what are percentiles?
when a set of data is divided into 100 equal parts
how to figure out the median from discrete data?
1/2 (n+1)
how to figure out the lower quartile from discrete data?
1/4 (n+1)
what are deciles?
when data is divided into 10 equal parts
what is standard deviation a measure of?
a measure of how much the values deviate from the mean value/ how spread out they are
what does a box plot represent?
- maximum & minimum values
- median & upper and lower quartiles
what is an outlier?
- any value that is more than 1.5 times the IQR below the LQ
- any value that is more than 1.5 times the IQR above the UQ
what is an outlier (in terms of standard deviation and mean)?
a value more than 3 standard deviations from the mean
positive skew in box plots?
median close to the LQ
negative skew in box plots?
median close to the UQ
positive skew- mean, modian and mode?
mean > median > mode
negative skew- mean, median and mode?
mode > median > mean
a positive value means what for the skew?
positive skew
a negative value means what for the skew?
negative skew
advantages of the mode?
- easy to find
- can be used with any type of data
- unaffected by open- ended or extreme values
disadvantages of the mode?
- there may be no mode/ there may be more than one
- cant be used to calculate a measure of spread
advantages of the median?
- easy to calculate
- unaffected by extreme values
- best to use when data is skewed
- can be used to calculate quartiles, IQR and skew
disadvantages of the median?
- may not be a data value
advantages of the mean?
- uses all the data
- can be used to calculate standard deviation & skew
disadvantages of the mean?
- always affected by extreme values
- can be distorted by open-ended classes
what do you need to compare 2 or more data sets?
- need to compare at least an average and a measure of spread
- or you can compare the skew of the distribution