Unit 2 Flashcards
birative data
comparing two sets of data
explanatory variable
the independent variable or the variable that impacts the other variable; resides on the x axis
response variable
the dependent variable or the variable that is impacted by the other variable; resides on the y axis
correlation coefficient
a number between -1 to 1 that tells us the strength and direction of a relationship. It can be represented by the equation (zx*zy)/n-1
Least Squares Regression Line
best fit line for the scatterplot; represented by y=mx+b
sum of squared errors
its another way to see how far the points are from the line; the smaller the residual, the better fit line
residual or error
observed - predicted
extrapolation
when the data is outside of the domain
lurking variable
a hidden variable that could be impacting both the explanatory and response variables
leverage
points extraordinary in x
influential point
a term to name if an outlier or leverage point has the potential to change the slope or lsrl if removed
coefficient of determination (R^2)
the percent of variability in the y variable can be explained by x variable
regression to the mean
the tendency for extreme occurances to be more average in the future
scatterplot
a method of representing data between two variables
DOFS
(Direction, Outliers, Form, Strength)