Lecture 3 Flashcards

Measures of Dispersion

1
Q

Why is knowing the dispersion of data important?

A

Two different data sets may share the same mean, but the dispersion (spread) of that data may be entirely different

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Measures of Dispersion

A

Range (IQR or Median), Variance and Standard Deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Range

A

Calculated by subtracting the smallest value from the largest value. Strongly affected by outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

IQR x 1.5 rule

A

We call an observation a suspected outlier if it falls more than 1.5 x IQR above the third quartile or below the first quartile. Can be used to justify excluding outlier data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Variance

A

The overall degree to which data points differ from the mean and each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to calculate variance?

A
  1. Find the difference between each value and the mean.
  2. Square these differences
  3. Add these differences together
  4. Divide by the number of data points minus 1 (if working with a sample – if working with the population simply divide by the number of data points)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why do we calculate standard deviation rather than variance?

A

Variance: we square the differences between the individual data points and the mean, which means the units of measurement are less intuitive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Standard Deviation

A

A very useful measure of dispersion IF our data is normally distributed. Nearly all values/observations fall within 2 standard deviations either side of the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The 68-95-99.7 Rule

A

68% of data will lie 1SD away from the mean. 95% of data will lie 2DS’s away from the mean. 99.7% of data will lie 3SD’s away from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly