BA 4 - Single Variable Linear Regression Flashcards
Regression, linear regression, single variable linear regression
Regression - examines the relationship among variables
Linear regression - examines linear relationships among variables
Single variable linear regression - seeks to identify a linear relationship between two variables
Uses of regression
Primary uses:
i. Studying the magnitude and structure of a relationship between two variables; and
ii. Forecasting a variable based on its relationship with another variable.
Regression equation
y^ = a + bx
y is the dependent variable, x is the independent variable, a is the y-intercept, b is the slope
true eq: y = alpha + (beta)x + epsilon
Point forecast
The value of y determined by entering the particular value of x into the regression equation
Prediction interval
Interval around the point forecast likely to contain the actual value of y.
The width of the prediction interval is based on the:
i. sd of the regression;
ii. the desired level of confidence; and
iii. the location of the x-value in relation to the historical values of the independent variable.
R^2
R^2 = (Regression sum of squares)/(Total sum of squares)
i.e. variation explained by the regression line divided by the total variation.
0 =< R^2 =< 1
p-value
result of the hypothesis test on whether there is a significant linear relationship - i.e. whether the slope of the regression line is zero
H0: beta = 0; Ha: beta =/= 0
If p<= 0.5 and if the confidence interval doesn’t contain zero we fail to accept H0.
Residual, regression and total sum of squares
Residual - variation unexplained by the regression equation
Regression - variation explained by the regression equation
Total - variation around the mean line
To check in residual plot
Residuals should be randomly spread above and below the x-axis.
Pattern implies other factors are at play.
To know if SVLR is the right fit
R^2
p-value
residual plot
Regression analysis with categorical variables
Use a dummy variable
[EXCEL] Value of y
=CELL1+CELL2*x
for y=a + bx
[EXCEL] To calculate point forecasts
=SUMPRODUCT(array1,[array2],[array3]…)
[EXCEL] To forecast value
=FORECAST(x, known_ys, known_xs)
dependent array, independent array
[EXCEL] Regression models with dummy variables
=IF(logical_test,[value_if_true],[value_if_false])