Descriptive and Inferential Statistics Flashcards
What is descriptive statistics?
Deal with an entire dataset e.g. population, goal is to summarise raw data and represent graphically and not extend conclusions beyond the observed dataset.
What is inferential statistics?
Goal is to make inferences beyond your data. To infer something about a population based on a smaller model, or sample. Make an estimation of a population parameter from a statistic or test a hypothesis.
What is the arithmetic mean?
Add all values and divide between number of values. Sensitive to outliers.
What is the geometric mean?
Multiply values and take nth root. Can reduce the effect of outliers.
What is the weighted mean?
Times each value but its ‘weight’, add together and divide by value of all weights.
What measure of centrality is best for normally distributed data?
Mean, median or mode
What measure of centrality is best for negatively or positively skewed data?
Mode (3 measures of centrality will not coincide)
What measure of central tendency would we use for categorical data?
Mode
Define variation
Average distance an observation is from the mean
How do you calculate variation?
Subtract each value from the mean, then square the result. Then work out arithmetic mean of these numbers.
‘sum of squared differences from the mean’
What is standard deviation?
Square root of variance.
Larger sd= wider spread of data
Why and how to we adjust variance equation?
Divide by n-1 instead of n.
This brings variance estimation closer to true population variance.
Can we use variance and sd for all types of data?
Only for normally distributed data.
What measures of variation can we use for skewed data?
Quartiles or box plots
What is the empirical rule?
States that 68.26% data values lie within +/- 1 sd
- 45% within +/- 2sd
- 74% within +/-3 sd