CK020 - Linear Regression Flashcards

1
Q

What are the assumptions of a linear regression model?

A
  • Error terms are homoscedastic
  • Error terms are uncorrelated
  • Error terms are normally distributed
  • Linearity
  • No (multi)colinearity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are ‘standardized residuals’ ?

A

Residuals where the heteroscedasticity is removed (so they are homoscedastric residuals)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are ‘standardized residuals’ used for?

A
  • Assessing homoscedasticity
  • assessing normality of the residuals
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are ‘studentized residuals’ ?

A

Residuals with a known (t-) distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are ‘studentized residuals’ used for?

A

Identifying outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to check the ‘normality assumption’ ?

A

Plot standardized residuals (y) against the theoretical residuals (x) and check if this is a linear line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is ‘homoscedasticity’ ?

A

Standardized/studentized residuals are randomly spread around 0 with constant variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How to check the ‘homoscedasticity assumption’ ?

A

By plotting the standardized/studentized residuals (y) against the fitted values/covariates (x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What to do with ‘heteroscedasticity’ ?

A
  • Variable transformation (not recommended)
  • Weighted Least Squares
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is an ‘outlier’ ?

A

Any observation that does not ‘fit the model’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a ‘high leverage point’ ?

A

Observation with extreme predictor value(s)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are ‘influential values’ ?

A

Observations that have excessive influence on the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why should you do model diagnostics?

A

To evaluate the assumptions are reasonable for the data at hand

How well did you know this?
1
Not at all
2
3
4
5
Perfectly