AP Stats CH 3 Flashcards
Interpret a scatter plot (describe a relationship)
Direction(positive/negative/none)
Unusual features (outliers, clusters)
Form (linear or non-linear)
Strength (how close to form)
Interpret correlation coefficient r
The linear relationship between x and y is “strength” and “direction”
Interpret slope
For every “x context”, the
predicted number of “y context”
increases/decreases by “slope”
Interpret a residual
The actual “y context” was
“residual” above/below the
predicted value for “x =”
Interpret coefficient of determination (r squared)
“Percent” of the variation in “y context” is explained by the linear relationship with “x context”
interpret the y intercept
When “x context” is zero, the “y context” is equal to “y intercept”
Interpret the standard deviation of the residuals (s)
The actual “y context is
typically about “s” away
from the number predicted by the
LSRL.
Desiree is interested to see if students who consume more caffeine tend to study more as
well. She randomly selects 20 students at her school and records their caffeine intake (mg), 𝑥, and the number of hours spent studying, 𝑦. If you were to interpret the scatter plot, what features would you need to include?
Context, direction, form, strength, unusual features
Desiree randomly selects 20 students at her school and records their caffeine intake (mg), 𝑥, and the number of hours spent studying, 𝑦. A scatterplot of the data showed a linear relationship with a slope of 0.164. Interpret the slope.
For every mg of caffeine, the
predicted number of study hours
increases by 0.164 hrs.
Desiree randomly selects 20 students at her school and records their caffeine intake (mg), 𝑥, and the number of hours spent studying, 𝑦. A scatterplot of the data showed a linear relationship with a y int. of 2.544 hrs. Interpret the y intercept.
When a student does not have any
caffeine, the predicted number of
hours spent studying is 2.544 hrs.
Desiree randomly selects 20 students at her school and records their caffeine intake (mg), 𝑥, and the number of hours spent studying, 𝑦. A scatterplot of the data showed a linear relationship with an R-squared value of 60.032. Interpret R -squared, the coefficient of determination.
About 60.032% of the variation in
study hours is explained by the
linear relationship with caffeine.
Desiree randomly selects 20 students at her school and records their caffeine intake (mg), 𝑥, and the number of hours spent studying, 𝑦. A scatterplot of the data showed a linear relationship with an r-squared value of 60.032. Find and interpret r, the correlation coefficient.
𝑟 is 0.7767, indicating that the linear relationship between study hours and caffeine intake is fairly strong and positive.
Note: r and/or an equation of a line
of best fit does not tell you if a linear
model is appropriate. You must see
the original scatterplot and or
residual plot.
State 3 ways to decide if a data set is linear.
- Residual plot does not show a pattern
- The scatter plot looks linear.
- r is close to 1 or -1 (or r-squared is close to 1)
Desiree randomly selects 20 students at her school and records their caffeine intake (mg), 𝑥, and the number of hours spent studying, 𝑦. A scatterplot of the data showed a linear relationship. The residual for a student whose caffeine in take was 80 mg is −2.72. Interpret the residual.
The actual number of study hours is
2.72 hours below the predicted
value for a student whose caffeine
intake is 80 mg
Desiree randomly selects 20 students at her school and records their caffeine intake (mg), 𝑥, and the number of hours spent studying, 𝑦. A scatterplot of the data showed a linear relationship with s = 1.532. Interpret s.
The actual hours spent studying is
typically about 1.532 hours away
from the number predicted by the
LSRL.