Module 4 Flashcards
What are the two primary purposes for using regression?
- Studying the magnitude and structure of a relationship between two variables.
- Forecasting a variable based on its relationship with another variable.
What is the structure of the single variable linear regression line?
y^=a+bx.
What is the expected value of y, the dependent variable, for a given value of x.?
y^
What is the independent variable, the variable we are using to help us predict or better understand the dependent
variable.?
x
What is the y-intercept, the point at which the regression line intersects the vertical axis. This is the value of y^
when the independent variable, x, is set equal to 0.?
a
What is the slope, the average change in the dependent variable y as the independent variable x increases by
one?
b
The true relationship between two variables is described by the equation y=α+βx+E, where E is the ______ y–y^.
error term
The idealized equation that describes the _________ is y^=α+βx.
true regression line
We determine a _________ by entering the desired value of x into the regression equation.
point forecast
We must be extremely cautious about using regression to forecast for values outside of the historically observed range of the _________ variable (x-values).
independent
Instead of predicting a single point, we can construct a __________, an interval around the point forecast that is likely to contain, for example, the actual selling price of a house of a given size.
prediction interval
The ______ of a prediction interval varies based on the standard deviation of the regression (the standard error of the regression), the desired level of confidence, and the location of the x-value of interest in relation to the historical values of the independent variable.
width
It is important to evaluate several metrics in order to determine whether a __________________ model is a good fit for a data set, rather than looking at single metrics in isolation.
single variable linear regression
_______ measures the percent of total variation in the dependent variable, y, that is explained by the regression line.
R2
R2 = Regression Sum of Squares/____________
Total Sum of Squares