BA 5 - Multiple Regression Flashcards

1
Q

Use of multiple regression

A

To identify linear relationships between three or more variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Equation

A

y^ = a + b1x1 + b2x2 + … + bkxk + e

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Tools used for analyzing multiple regression

A

Because graphing is complicated or impossible, we rely on numerical values and residual plots (for one variable at a time).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Adjusted R^2

A

Adjusted R^2 = adjustment factor X R^2

R^2 never decreases when independent variables are added to a regression model, so the adjustment factor compensates for the increase that was due solely to the addition of a new variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Residual plot analysis

A

To check if the relationship between independent and dependent variables is linear and significant.

i. Separate scatter plot for each independent variable;
ii. Look for patterns of heteroskedasticity and nonlinearity; and
iii. Also examine p-values of independent variables,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Multicollinearity

A

Multicollinearity occurs when there is a strong linear relationship among two or more of the independent variables.

If a variable is significant in a single variable model and becomes insignificant in a multiple regression model, it’s likely that there is a multicollinearity between two or more variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When multicollinearity is detected

A

i. Check if dropping one of the collinear variables increases the Adjusted R^2; or
ii. Increase sample size.

If using the model for forecasting, multicollinearity is not an issue; if using the model to understand the net effects of independent variables, it’s an issue.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Dummy variables

A
For categorical (rather than quantitative) data.
The number of dummy variables should be one fewer than the options in the category. The option that is not included will have the value of '0', and is known as the 'base case'
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Lagged variables

A

Used to capture the ongoing effects of a given variable.
The lag period is based on managerial judgement.

Drawbacks:

i. each lagged variable reduces the sample size by one; and
ii. if it doesn’t increase the model’s explanatory power, it decreases the Adjusted R^2.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Gross and net relationships between variables

A

Gross - affected by any variable related to the independent variable;

Net - controls for other factors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

EXCEL lag tip

A

Don’t check labels for lagged!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly