DECK 2 Flashcards

1
Q

What is a contingency table

A

shows distributions across 2 variables like gender acros music pref (male/female across hip hop/country/ classical). AKA 2-way table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How can you tell if variables in a contingency table are independent?

A

If the distributions are the same across the variables.. Then it doesn’t DEPEND on which category it came from, it still has same likelihood as others….. it’s INDEPENDENT.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

marginal distribution

A

overall distributions of a single variable in contingency table (out in margins)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

conditional distribution?

A

A distribution within the table, along only one row or column on the inside of the table… NOT IN THE MARGINS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Association and Independence?. How are they related?

A

Variables are either ASSOCIATED or INDEPENDENT. If they are associated, then they are not independent, if they are independent then they are not associated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

mean/SD/median/IQR? what to use?

A

when unimodal and symmetric, us MEAN and SD. If skewed or outliers use Median and IQR. If BIMODAL Talk about the MODES and use range or IQR,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do you describe distributions (histograms)?

A

Shape-Cener-Spread-Outlierg-Gaps —- GSOCS? where’s yo GSOCS?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

If asked to compare distributions, what should you write about?

A

GSOCS.. Write a sentence comparing shapes, and then one comparing centers, then comparing spreads and finally gaps and outliers…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Center description?

A

mean (balance), median (splits area in half), mode (peaks? if bimodal, talk about both modes) or ?. “centered around ____”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Shape description?

A

unimodal, bimodal, multimodal, uniform, symmetric, skewed,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Spread description?

A

We have many measures of spread: range, IQR, stand dev, variance, or simply say. From here to about here.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what happens if you ADD a constant to each value in a data set?

A

it is SHIFTED only. This effects all of the data values and measures of center (mean, med) and quartiles, deciles, etc… IT DOES NOT CHANGE THE SPREAD!
(IQR, St Dev, Range all stay the SAME).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what happens if you multiply all of a data set by a constant?

A

it is scaled.. Everything is effected. Mean/ median/ stand dev/ iqr/ quartiles all multiplied by that constant. Center, spread and all individual values are changed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the five number summary?

A

min, Q1, Q2(median), Q3 and max

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do you find Q1 and Q3?

A

Q1 is the median of the bottom half (25th %ile) and Q3 is the median of the upper half (75th %ile)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How can you match boxplots to histograms?

A

USE THE FISH TANK METHOD!

17
Q

For information purposes, which gives most? stem-leaf, histogram or box-whisker?

A

Stem leaf gives the actual values and the shape, histogram just the shape and box-whisker the least amt, but are great for comparing multiple distributions.

18
Q

What percent of the data is above Q3?

A

25%

19
Q

What percent of the data is below the median?

A

50%

20
Q

What percent of the data is between Q1 and Q3?

A

50%

21
Q

What is the IQR?

A

Interquartile range. a measure of spread.. Q3-Q1.. The distance from Q1 to Q3.

22
Q

What are the percentiles for Q1, med, and Q3?

A

25, 50 and 75

23
Q

where are the “outlier fences?”

A

1.5 IQR above Q3 and below Q1. Just a rule of thumb..

24
Q

How can you think of mean, median and mode to help understand?

A

Mean is balancing point of histogram, median splits area in half, mode is the peaks of histogram…

25
Q

What if mean median are way different?

A

There is evidence that the data is skewed or there is an outlier. (the mean chases the tail)

26
Q

Who is the best?

A

YOU ARE!! Because you are studying these here cards!!! Way to go. I am proud of you.