Lesson 2- Data Description Flashcards

0
Q

Measures of center

A
  • mean: X resistant to outliers
  • median: resistant
  • mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

Quantitative description of data

A
  • center: mean, median, mode

- spread (variability): range, quartiles (IQR), variance, standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Measures of spread

A
  • range (max. - min.)
  • quartile (Q1, M, Q3)
  • IQR: middle 50%
  • outlier criterion: if observation falls 1.5(IQR) below Q1 or above Q3
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Boxplots

A
  • show right or left-skew
  • 5 number summary (min, Q1, M, Q3, max)
  • central box spans quartiles
  • line in box is median
  • conceals outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Modified boxplot

A

-outliers plotted individually

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Standard deviation

A

s= [ 1/(n-1)*sum(xi -xmean)^2 ] ^ (1/2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Measures of spread 2

A
  • standard deviation
  • degrees of freedom: the sum of deviations = 0, the last deviation can be found once we know the other n-1 deviations, only n-1 can vary freely therefore we divide by n-1
  • variance = s^2
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Effect of changing units of measure

A
  • linear transformation, x -> a + bx
    • effect of ‘a’: adds a to center and quartiles, but X effect on spread
    • effect of ‘b’: multiplies both center and spread (StD, IQR) by b
  • skew, median, mean
    • right-skew: mean is greater than median. CFP goes up quickly
    • left-skew: mean is less than median. CFP goes up slowly
How well did you know this?
1
Not at all
2
3
4
5
Perfectly