8 Flashcards

1
Q

What are the issues with distributed series models?

A
  • The various lagged values of 𝑋 are likely to be severely multicollinear.
  • No guarantee that the estimates 𝛽𝑠 will follow the smoothly declining pattern that economic theory would suggest.
  • The degrees of freedom tend to decrease leading to more variability and less precision.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How we could solve the problem with distributed series models?

A

Instead of including lags of the independent variables include a lag of the dependent variable. This might create issues for a model where you are trying to predict how independent variables affect the dependent one. Hence, in cases when you want to predict the dependent variable with independent variables, it is better to simply limit the number of lags you use based on theoretical or other considerations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is LOESS?

A

Locally estimated scatterplot smoothing regression (LOESS) – an analysis method that created a smooth line through time plots and scatter plots. It divides regression analysis into a few lines

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When and how to use LOESS

A

You use LOESS to:
See if there is an interesting relationship in a noisy data set.
Evaluating if there is a non-linear relationship (i.e., e.g. non-monotone trend).
Data exploration (i.e., to get a better feel of the data without any particular aim).

You do NOT use LOESS to calculate how one variable affects another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Problems with time series models

A

Things change as time passes, so we need to be mindful of this and try to remove such “trends” as much as possible to truly evaluate how variables affect one another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is spurious relationship?

A

Spurious correlation – a strong relationship between variables that is not caused by a real underlying causal relationship, this relationship is “accidental”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is correlogram?

A

Correlogram – a data visualisation technique that shows how a variable correlates with its lags.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How correlograms are drawn?

A

Autocorrelation Function (ACF) indicates how a variable correlates with different lags of itself. Partial Autocorrelation Function (PACF) shows the same thing but it also controls for previous 𝑘 lags when calculating the correlation between the current value 𝑡 and a lagged value 𝑘+1 where 𝑘<𝑡.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How we can find out if it is stationarity or non-stationarity in ACF?

A

If autocorrelation is slowly decreasing - non-stationarity
If autocorrelation remains the same for a long period of lag - stationarity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is unit root?

A

It is a measure which is better than correlograms and it is a stable difference between 𝑌_𝑡−𝑌_(𝑡−1) implying there is a trend, which also implies we have a non-stationary variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How to check a unit root?

A

Make sure that your numeric variables do not have a unit root

To do that you use two, or more if needed, tests:
Augmented Dickey-Fuller (ADF) test – tests if a variable is stationary using a combination of different lags. It predominantly focuses on stationarity in means, and with it, you want to reject the null hypothesis.
Kwiatkowski–Phillips–Schmidt–Shin (KPSS) test – similar to ADF, but it focuses more on the stationarity of the variance. With this test, you do not want to reject the null hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How we can fix non-stationarity?

A

First, you can try to build a regression model and see if it is cointegrated – do the variables in the model flow to some sort of equilibrium and hence non-stationarity of variables is not a problem.
Secondly, transform your non-stationary data by taking the first difference of a variable. If you still have a non-stationarity issue, you can take the first difference (in raw terms, percentages, logs, etc.) of the difference, and so on, until the variables become stationary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How we can check for cointegration?

A
  1. Build a regression equation
  2. Extract the residuals from it
  3. Import the residuals into a statistical software of your choice
  4. Check the unit root of the residuals. If you have a unit root (i.e. non-stationary), you do not have cointegration, if you do not have a unit root, you have cointegration.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How the proffesor would solve non-stationarity?

A
  1. Check if your variables are stationary.
  2. If some of them are not, transform them using the first difference (or transform all numeric variables for simplicity’s sake).
    3.If the first difference fixes the issue, just build a regression equation and interpret it as if you would any regression.

If the first difference did not fix this issue, try different types of differences (e.g. unit difference, percentage difference, log difference). If this does not help, build your regression with the “best” differences and check for cointegration. If you have cointegration, just continue, but if you have an issue, only then try to use the first difference of the first difference.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are time-series assumptions?

A
  1. The regression model is linear, is correctly specified, and has an additive error term
  2. The error term has a zero population mean.
  3. All explanatory variables are uncorrelated with the error term.
  4. The error term has a constant variance (homoscedasticity).
  5. No perfect multicollinearity
  6. Observations of the error term are uncorrelated with each other (no serial correlation).
  7. Residuals are normally distributed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How we can check for structural changes?

A

Chow test – tests if two regressions split based on some variable (e.g., a crisis dummy variable) or at a particular time moment would produce better results than one regression.

17
Q

How to use Chow’s test accurately?

A
  1. Make sure to know that there was a structural break and check if your model captures it
  2. Minimum issues with heteroscedasticity, autocorrelation, or non-normality of residuals (robust standard errors are not good enough).