Statistics Y10 Summarising Data, Scatter Graphs and Correlation EoT test Flashcards
what are avergaes
measures of central tendency or location
what is the mode for discrete data
the value that occurs most often
what is bimodal data
data with two modes
what is the median for discrete data
the middle value in an ordered list
calculate the median value for discrete data
1/2(number of values+1)
what is the mean for discrete data
the sum of all the values divided by the number of values
calculate mean for discrete data
Σ values / number of values
what is the mode for grouped data
the group with the highest frequency
calculate mean for grouped discrete data
Σ frequency*value / Σ frequency
calculate the median value for continuous data
1/2(number of values)
estimate median using linear interpolation
L + ((n/2 - F) / f) * w
L = lower boundary or median class
n = number of values
F = cumulative frequency up to median class
f = frequency of median class
w = width of median class
estimate mean for continuous data
Σ frequency*midpoint / Σ frequency
calculate geometric mean
nth root (value1 * value2 … * value n)
what happens to the averages when all the data values are increased by the same amount/percentage
the averages are increased by the same amount/percentage
calculate weighted mean
Σ value*weight / Σ weight
calculate range
largest - smallest value
calculate upper and lower quartiles for discrete data
Q1 = 1/4 (number of values + 1)th value
Q3 = 3/4 (number of values + 1)th value
calculate upper and lower quartiles for continuous data
Q1 = 1/4 (number of values)th value
Q3 = 3/4 (number of values)th value
calculate interquartile range
upper quartile - lower quartile
calculate standard deviation for discrete data
square root( (Σx² / n) - (Σx / n)² )
calculate standard deviation for continuous data
square root( (Σfx² / Σf) - (Σfx / Σf)² )
calculate boundaries for outliers
lowest < (Q1 - 1.5 * IQR)
highest > (Q3 + 1.5 * IQR)
calculate skew
3(mean - median) / standard deviation
what variable is on the x-axis
independent/explanatory
what variable is on the y-axis
dependent/response
what is a correlation
an association between variables
what is interpolation
estimating the value of a variable within the range of the data set
what is exterpolation
estimating the value of a variable outside the range of the data set
calculate gradient
change in y / change in x
calculate y-intercept
meanY - gradient * meanX
calculate spearman’s rank correlation coefficient
1 - (6 * Σd²) / (n * (n² - 1) )
d = difference between ranks (lowest > highest) of each variable
n = number of data values
what is perason’s product moment correlation coefficient suitable for
random samples from normal distribution