week 3 -- Summarizing data & Comparing distributions Flashcards

1
Q

descriptive vs inferential statistics

A
d = what we can do to DESCRIBE the population
i = what we can do to MAKE A GENERALIZED STATEMENT about the entire population based on the observed sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

shape (descriptive)

A

symmetry?
modes (peaks)?
skew?
extreme values?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

center (descriptive)

A

MODE

MEDIAN (splits dataset down the middle – robust for extreme values)

MEAN “balances” the data (i.e., takes into account frequency), can be misleading with datasets with extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

spread (descriptive)

A

RANGE (max-min)

INTERQUARTILE range (middle half of data

Median separates halves of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

5-number summary

A
Q1-Q3 = middle half of data
max = unusual data point? or just highest point in data?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

outlier vs extreme value

A

“outlier” is a DEFINED term (in terms of data depiction), “extreme value” is a general term

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

If you draw a card at random from a well shuffled
deck, is getting an ace independent of the suit?
Explain.

A

Yes. There is the same number of aces in each suit, so no matter which suit you draw, the
probability of getting an ace does not change.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When are descriptive stats the same as inferential?

A

When population = sample (known as a census)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does a bimodal pattern reflect?

A

e.g., two separate populations being lumped together – recheck trials/participants (is there anything that can explain the data?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Draw a boxplot

A

1) Draw small lines at the median, IQ1 and IQ3 – make a box.
2) “Fence in the data” at 1.5xIQR above and below the IQs (just for reference, not part of boxplot)
3) draw “whisker” to the most extreme values found within the fences
4) add outliers outside the fences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do boxplots tell us?

A

They help us compare the distribution of groups or categories we wish to compare

small IQR shows consistent performance

asymmetrical whiskers or box shows skew

outliers always derserve our attention! examine them in context of the data – is it an error or just an extreme value?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What to compare between boxplots

A

shapes (symmetric or skewed)
medians (which is higher? any pattern?)
IQRs (which group is more spread out? any pattern to IQR change?
outliers)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly