Module 5 Flashcards

1
Q

We use ___________ to investigate the relationship between a dependent variable and multiple
independent variables.

A

multiple regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The structure of the multiple regression equation is…

A

y^=a+b1x1+b2x2+…+bkxk.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The true relationship between multiple variables is described by y=α+β1x1+β2x2+…+βkxk+ε, where ε is the ________.

A

error term

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

_________ in multiple regression characterize relationships that are net with respect to the independent variables included in the model but gross with respect to all omitted independent variables.

A

Coefficients

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Forecasting with a multiple regression equation is similar to forecasting with a single variable linear model. However, instead of entering only one value for a single independent variable, we input a value for ______ of the independent variables.

A

each

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

As with single variable linear regression, it is important to evaluate several metrics to determine whether a multiple variable linear regression model is a ______ for our data.

A

good fit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

For multiple regression we rely less on scatter plots and more on __________ and _________ because visualizing three or more variables can be difficult.

A

numerical values; residual plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Because R2 never ________ when independent variables are added to a regression, it is important to multiply it by
an ____________ when assessing and comparing the fit of a multiple regression model.

A

decreases; adjustment factor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

This adjustment factor compensates for the ________ in R2 that results solely from increasing the number of independent variables.

A

increase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

It is particularly important to look at ________, rather than R2, when comparing regression models with different numbers of independent variables.

A

Adjusted R2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

In addition to analyzing Adjusted R2, we must test whether the relationship between the independent and dependent variables is ______ and _______. We do this by analyzing the regression’s residual plots and the _______
associated with each independent variable’s coefficient.

A

linear; significant; p-values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

For multiple regression models, because it is difficult to view the data in a simple scatter plot, ________ are an
indispensable tool for detecting whether the linear model is a good fit.

A

residual plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

There is a residual plot for each _______ variable included in the regression model.

A

independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

We can graph a residual plot for each independent variable to help detect patterns such as __________ and __________.

A

heteroskedasticity; nonlinearity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

As with single variable regression models, if the underlying multiple relationship is linear, each of the residuals
follows a normal distribution with a mean of _____ and _____ variance.

A

zero; fixed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

We should also analyze the _______ of the independent variables to determine whether there is a significant
relationship between the variables in the model.

A

p-values

17
Q

If the p-value of each of the independent variables is less than _____, we conclude that there is sufficient evidence to say that we are 95% confident that there is a significant linear relationship between the independent and dependent variables.

A

0.05

18
Q

Multiple regression requires us to be aware of the possibility of __________ among the independent variables.

A

multicollinearity

19
Q

Multicollinearity occurs when there is a ______ linear relationship among two or more of the independent
variables.

A

strong

20
Q

Indications of multicollinearity include seeing an independent variable’s p-value ________ when one or more other independent variables are added to a regression model.

A

increase

21
Q

We may be able to _______ multicollinearity by either increasing the ________ size or removing one (or more) of
the collinear variables.

A

reduce; sample

22
Q

Multiple regression models allow us to include multiple __________ for categorical data—day of week,
for example.

A

dummy variables

23
Q

A dummy variable is equal to ___ when the variable of interest fits a certain criterion. For example, a dummy
variable for “Saturday” would equal ___ for observations relating to Saturdays and ___ for observations related
to all other days.

A

1; 1; 0

24
Q

The number of dummy variables we include must always be ___ fewer than the number of options in a category.

A

1

25
Q

_________ are used to capture the ongoing effects of a given variable.

A

Lagged values

26
Q

The lag period is based on managerial _____ and data _______.

A

insight; availability

27
Q

If the lagged variable does not _______ the model’s explanatory power, the addition of the variable
decreases Adjusted R2.

A

increase

28
Q

How do you create a regression output table in excel?

A

Using the Data Analysis tool

29
Q

How do you create a regression model using dummy variables?

A

=IF(logical_test,[value_if_true],[value_if_false])

→ Returns value_if_true if the specified condition is met, and returns value_if_false if the condition is not met.