Main Vocab Flashcards
when describing a distribution of data, mention..
- shape
- center
- variability (spread)
- unusual features (outliers or gaps)
Always include context!
A five-number summary includes..
Minimum, Q1, median, Q3, maximum
a z-score tells us..
The number of standard deviations above or below the mean.
the empirical rule
in a normal distribution:
about 68% of the values lie within 1 standard deviation of the mean
about 95% of the values lie within 2 standard deviations of the mean
about 99.7% of the values lie within 3 standard deviations of the mean
when describing a bivariate distribution (scatterplot), mention..
- direction (positive or negative)
- strength (strong or weak)
- form (linear or non-linear)
- unusual features
Always include context!
residual
Observed value - predicted value. y − ŷ
When asked to interpret the slope of a regression line in context, say..
“On average, there is a predicted [increase or decrease] of [slope][units] in [the dependent variable] for every
increase of one [unit] in [the independent variable].”
When asked to interpret the y-intercept of a regression line in context, say..
“On average, when the value of [the independent variable] is zero [units], [the dependent variable] is predicted
to be [y-intercept] [units].”
When asked to interpret the coefficient of determination of a regression line, r2, in context, say..
“[r^2] percent of the variation in [the dependent variable] can be explained by the linear relationship between
[the dependent variable] and [the independent variable].”
When writing about bias in sampling methods..
- Identify the population and sample.
- Explain how the sampled individuals differ from the general population
- Explain how this leads to an overestimate or underestimate
Two events A and B are independent if
P(A|B) = P(A) and P(B|A) = P(B)
Two events A and B are mutually exclusive if
P(A ∩ B) = 0
The probability of the union of two events A and B can be found by
P(A ∪ B) = P(A) + P(B) − P(A ∩ B)
The probability of the intersection or “joint probability” of two events A and B can be found by
P(A ∩ B) = P(A) ∙ P(B|A)
Conditions for a one-sample z interval for a population proportion
- the data are collected using a random sample
- when sampling without replacement, the sample size is less than 10% of the population
- np̂≥ 10 and n(1 − p̂) ≥ 10