Lecture 4 Flashcards

1
Q

4 things to mention when interpreting a coefficient

A
  • Significance
  • Sign
  • Size
  • Ceteris paribus
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Stepwise regression

A

Step by step include/ exclude variables. If there are a lot of IVs, you can use stepwise regression to find the good models and to find what variables to use.

Forward - Add variables in every model
Backward - Start with all variables and take them out step by step.

However, this is a bad practice, since you have to start from theory and not randomly look for variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Multicollinearity

A

High correlation between 2 IVs. It makes it hard to identify the effect of x on y.
It also influences the result of the estimation and the assumption of the model is that all the IVs have a separate influence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How can we check wether we have multicollinearity?

A
  • Check the data before estimation
  • Correlation coefficients matrix, correlation coefficients around or above 0,7-0,8 signal multicollinearity
  • Variance inflation factor (VIF)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to measure Variance inflation factor (VIF)

A

VIF for the IV(xk)

VIFk = 1/ (1-R2k)

Usually values >10 indicate multicollinearity
Conservative threshold is >5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can you solve multicollinearity?

A
  1. Increase sample size
  2. Drop one of the variables (Robustness check: first estimate the model with one variable, then with the other. Do you get the same results?)
  3. Transform the highly correlated IVs (e.g. create a composite variable, combine collinear IVs)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Homoscedacity

A
  • OLS assumption
  • Variance of the error term is constant over various values of the IV
  • Dispersion of the error remains the same over the range of observations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Heteroscedacity

A
  • OLS assumption does not hold
  • Error term does not have a constant variance
  • Dispersion of the error changes over the range of observations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why do you have heteroscadacity?

A

Groups of observations are different, follow different processes, so have different error terms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What tests to use to test for heteroscadacity

A
  • Breusch- Pagan test

- White test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Graphical inspection for heteroscadacity

A
  • Make a scatterplot of the IV and residuals of regression

- You want to see most scores concentrated around the centre

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How can we solve heteroscadacity

A
  1. Weighted least squares
    - Each observation is weighted
    - Observations with a higher variance get a lower weight in determining the coefficients
  2. Calculate robust standard errors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the mean represent in the case of dummy variables?

A

Percentage split between the categories, average in the proportion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly