Distribution and Plots Flashcards
Describing distribution
Shape, center, spread, outliers, what does the data mean in this context
One clear peak
Unimodal
Two clear peaks
Bimodal
All have same frequency
Uniform
Area under the curve is 1
Normal (bell curve)
Left mirrors the right and mean is median
Symmetric
Mean is to the left of median and tail is longer to the left
Left skewed
Mean is to the right of median and tail is longer to the right
Right skewed
Center for symmetric
Mean
Center for skewed
Median
Center for mode
Bimodal may have bimode
Spread for skewed
IQR
Spread for symmetric
Stat, list, math, std
Higher frequencies away from the median means _ standard deviation.
Higher
Outliers
Q1-1.5(IQR)
Q3+1.5(IQR)
_ _
- Min
- Q1
- Med
- Q2
- Max
- Outliers can be as points in modified _ _
Box plot
Inner quartile range
Q3-Q1
Marginal distribution
Margin/total
Conditional distribution
Condition/margin
P(x|y) means the _ of x condition given y condition.
Probability
Always assume _ until proven otherwise.
Independent
Conditional distribution of x in y
X values in the y row divided by the y total
Stem and leaf plot
Make sure to put key
Histogram
- X is _
- Stat, edit, stat plot, plot 1, on, histogram, x list: L1, Freq: 1, No values in y=, Zoom 9
Frequency
Quantitative
- Average
- Discrete
- Whole number
- Continuous
- Measured to an arbitrary amount
Numbers
Categorical
- Proportions
- Percent
Categories
If standard deviation is 0 then all of the observations are the _.
Same