Distributions and Histograms Flashcards

1
Q

Visually representing continuous variables

A

Histogram (predictor variable is cont.); column chart with bins and no categories; count # of values greater than min. up to but not including max.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Bar height

A

of observations in bin

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Bar width

A

Range of values of the continuous variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Patterns of distributions

A

Bell-shaped/normal, U-shaped, Right-skewed/positive, Left-skewed/negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Bell-Shaped/Normal:

A

Symmetric/unimodal
Skewness value of 0
Many naturally occurring characteristics are normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

U-Shaped:

A

Symmetric, bimodal (two distributions)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Right-skewed/Positive:

A

Long right tail
Skew=running into a max. or min. possible value; could have min. and no max. or opposite
Asymmetry

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Left-skewed/Negative:

A

Long left tail
Skew=outliers
Mean is influenced by outliers
Asymmetry

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Describing distributions with descriptive statistics

A

Central tendency, variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Central Tendency

A

mean (average/normal), median (middle value/skewed), mode (most frequent value/bimodal)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Variability

A

Standard deviation (normal), interquartile range (skewed), range (high-low)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Standard Deviation

A

Shows how closely points cluster around the mean; “average” amount the points deviate, or differ, from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Median

A

Found by calculating middle of minimum and maximum values then averaging those values; less sensitive to outliers or skew

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

IQR

A

Quartiles are values that cut the distribution in quarters; IQR= Q3-Q1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Box and Whisker Plot

A

Box plots represent the min, Q1, median, Q3, and max; outliers are black dots on whiskers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Main Effects

A

1 outcome and certain # of predictors

17
Q

Associated (related):

A

Outcome is diff. by levels of predictor

18
Q

Independent (not related):

A

Outcome is same by levels of predictor

19
Q

Interaction:

A

Effect of one predictor is different depending on level of other predictors

20
Q

Additive:

A

Effect of one predictor is same regardless of level of other predictors

21
Q

Effect Size

A

The magnitude (strength) of the association between predictor and outcome; based on difference between means and variability

22
Q

Effect Size and Mean Differences

A

If variability is same, but one pair of means are farther apart then there is less overlap, bigger effect size, and easier to detect

23
Q

Effect Size and Variability

A

If mean differences are the same, but one pair has smaller variability then there is less overlap, bigger effect size, and easier to detect

24
Q

Ceiling effect/Floor effect

A

Runs into upper/lower limit

25
Q

Converting continuous values into categories

A

Count frequency of each category and create column chart