Chapter 9: Multiple & Logistic Regression Flashcards

1
Q

What is the form of a multiple regression model?

A

ŷ = b₀ + b₁x₁ + b₂x₂ + … + bₖxₖ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does each coefficient in a multiple regression represent?

A

The effect of that variable on the response, holding other variables constant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the purpose of adjusted R²?

A

To account for the number of predictors and prevent overfitting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the formula for adjusted R²?

A

1 - [(SSE/SST) × (n-1)/(n-p-1)]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When is a variable considered significant in regression?

A

When its p-value is less than 0.05.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does a large p-value suggest about a predictor?

A

That it may not contribute meaningfully to the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is collinearity?

A

Correlation between two or more predictors that complicates model estimation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why is collinearity a problem?

A

It makes it difficult to estimate individual effects of predictors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is backward elimination?

A

A model selection method that removes predictors one at a time based on adjusted R² or p-values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is forward selection?

A

A model selection method that adds predictors one at a time based on adjusted R² or p-values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does the intercept represent in multiple regression?

A

The predicted response when all predictors are 0 (may not be meaningful).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the interpretation of a binary categorical predictor’s coefficient?

A

The difference in the response between that category and the reference group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does R² tell us in multiple regression?

A

The proportion of variance in the response explained by the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why use adjusted R² over regular R²?

A

Adjusted R² accounts for the number of predictors, avoiding inflation from adding irrelevant variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does a p-value less than 0.05 indicate for a coefficient?

A

That the predictor is statistically significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the goal of model selection?

A

To find the simplest model that adequately explains the data.

17
Q

How is prediction made using a regression model?

A

By plugging in values of predictors into the model equation.

18
Q

What does it mean to control for other variables?

A

To isolate the effect of one variable while holding others constant.

19
Q

Why might the intercept in a model be meaningless?

A

Because it represents the prediction when all variables are zero, which may not be realistic.

20
Q

What is the main advantage of multiple regression?

A

It allows for controlling multiple variables and assessing their individual contributions.