Data Distribution Flashcards

1
Q

what is the purpose of standard deviation?

A

quantified the spread of data around the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

how would you calculate standard deviation?

A
  1. find mean
  2. find square of each data point’s distance to the mean
  3. sum values so far
  4. divide by number of data points
  5. square root
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is the standard error?

A

spread of sample means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are the differences between standard deviation and standard error?

A

SD - measures spread around mean for this population
SE - measures spread of several means from various populations and estimates how far away from sample mean the true mean is

SD - describes spread around mean
SE - estimates real mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

normal distribution:

how much of the data is within 1 standard deviation of the mean?

A

68%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

normal distribution:

how much of the data is within 2 standard deviations of the mean?

A

95%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

normal distribution:

how much of the data is within 3 standard deviations of the mean?

A

99.7%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is a negative skew of data?

A

peak to the right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is a positive skew of data?

A

peak to the left

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is a vertical skew of data?

A

peak in the middle, but not normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the IQR?

A

measure of spread

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

how would you calculate the IQR?

A

UQ - LQ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

when is the IQR used?

A
  • when data is not normally distributed

- alongside median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what are confidence intervals?

A

range of values our population mean is likely to lie in

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what does a 95% confidence interval mean?

A

there is a 95% chance the mean lies within the interval

- confidently limits the parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

how would you calculate a 95% confidence interval?

A

mean +/- (1.96 x SE)

mean +/- (1.96 x SD/root n)

17
Q

what are histograms?

A
  • continuous box plots

- representing bars cover a range

18
Q

what are box and whisker plots?

A

compare continuous variable between multiple groups

19
Q

what are scatterplots?

A

display 2 continuous variables