Unit 2 Flashcards
What is a scatterplot used for
Immediate visual impression of a possible relationship between 2 variables.
What is the correlation coefficient
Used as a quantitative value of strength and direction of a linear relationship.
Inequality of value of correlation coefficient and symbol
-1 < r < 1
What is r^2
Coefficient of determination - % of variation in the resp var that is explained by the explanatory var
How is r^2 represented
As a %
what does 100% r^2 mean
Perfect fit
all variation in x
How to describe a scatter plot and eg
Direction
Form
Strength
Strong positive linear association
What are the boundaries of r (what value of r is strong, etc)
r > 0.8 = strong
0.7 < r < 0.8 = moderately strong
r < 0.7 = weak
What is a residual
Difference between an observed and predicted value
= actual - predicted
Sum and mean of residuals is always equal to
0
Line of regression eqn and what each variable is
y hat = a + bx
a = y intercept b = slope
How to calculate B
what does each variable mean
r * sy/sx
r = correlation coeff sy = SD of y sx = SD of x
How to check if linear model is a good fit
r^2 should be high
Residual plot shouldn’t show any pattern
How and whyto transform a graph
Use log or ln on one of the variables, may allow for a linear model to be used
How to assess the effectiveness of transformation
Checking if randomness in residual plot has increased
Checking if r^2 value has increased