QE 7/8 - time series Flashcards

1
Q

Difference between a predicted value and a forecast?

A
  1. Predicted value – refers to value of Y predicted (using regression) for observation WITHIN SAMPLE used to estimate regression
  2. Forecast – refers to value of Y forecasted for observation OUT OF SAMPLE used to estimate regression
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Difference between a forecast error and OLS residual?

A
  1. OLS residual = within sample (difference between predicted value and actual value)
  2. Forecast error = same concept but out of sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does RMSFE measure?

A
  1. Measures spread of forecast error distribution

2. Measures magnitude of typical forecasting ‘mistake’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sources of error in the RMSFE

A

(1) future values of u unknown

2) error in estimating coefficients (B0 & B1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When is RMSFE not an appropriate measure of the magnitude of a typical forecasting mistake? Example?

A
  1. If forecasting mistakes asymmetric
  2. E.g. when forecasting time I’ll arrive at train station, under-forecast (being late) much worse than over-forecast (being early)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to test the hypothesis that, say, regressors Yt-2, Yt-3,…,Yt-p don’t further help forecast (beyond Yt-1)?

A
  1. F-test that coefficients all jointly zero
  2. Information criterion (BIC or AIC)
    (i) E.g. Bayes information criterion (BIC) determines how large the increase in R-squared must be to justify including the additional lag
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the Granger causality test?

A
  1. Test of joint hypothesis that none of X’s a useful predictor, above and beyond lagged values of Y
  2. i.e. F-statistic testing hypothesis that coefficients on all values of 1 of variables are zero (implying regressors have no predictive content for Yt beyond that contained in other regressors)
  3. N.B. NOT a test of causality (causality here just refers to predictive content)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the trade-off of using additional lagged values as predictors?

A
  1. Too few lags decreases forecast accuracy because valuable information is lost
  2. Too many lags increases estimation uncertainty
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Generally, an AR(…..) in 1st difference = AR(…..) in level

A

Generally, an AR(p) in 1st difference = AR(p+1) in level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q
  1. What does it mean for Yt to have very strong autocorrelation?
  2. What is the consequence of this?
  3. What happens in the extreme case when autocorrelation = 1?
  4. Possible solution?
A
  1. Very persistent process
  2. OLS estimator of the AR coefficient is biased towards zero
  3. In the extreme case, Yt no longer stationary
  4. Take 1st differences
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
  1. What does Granger causality mean?

2. Granger non-causality?

A
  1. Granger causality - at least 1 of the coefficients of the lags of X is not zero
  2. Granger non-causality - all the coefficients of the lags on X are zero
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the only way to remove a stochastic trend? Exception?

A

Only way to remove a stochastic trend is by differencing, unless there’s co-integration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Problems caused by stochastic trends/unit root?

A
  1. Autoregressive coefficients biased downwards towards zero
  2. Distribution of OLS estimator and t-statistic not normal, even in large samples
  3. Spurious regression
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain how ‘stochastic trend’ and ‘unit root’ can be used interchangeably?

A
  1. If Yt has a unit root, then Yt contains a stochastic trend (and so is non-stationary)
  2. If Yt is stationary (and hence doesn’t have a unit root), then Yt doesn’t contain a stochastic trend
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Main methods for dealing with problem of spurious regression?

A
  1. Test for co-integration

2. Difference the data so it becomes stationary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q
  1. Benefit of co-integration (rather than differencing data) if possible, when dealing with problem of spurious regression?
  2. How do we do this?
A

1a. Co-integration allows us to see long-run relationship between X and Y
1b. Regressing on differences only allows short-run relationship

  1. Use error correction model
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Initial (informal) indication of a stochastic trend?

A
  1. Fit a mean line through the data and see how often the series crosses the line
  2. If it doesn’t cross the line very often, this indicates data with stochastic trend
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the implication for the standard Dickey-Fuller test if Yt is trend stationary?

A
  1. Test biased in favour of a unit root (only way model can fit trend is with unit root)
  2. High probability of type 1 error (rejecting null when it is true)
19
Q

Under what assumption are the Dickey-Fuller critical values correct?

A

Errors serially uncorrelated

20
Q
  1. If errors not serially uncorrelated, how can we ‘augment’ the Dickey-Fuller test?
  2. Explain how this works
A
  1. Augment DF test by adding lagged values of Y

2a. Want to ensure that any lagged differences with predictive power included in regression and not left in error term
2b. Need sufficient lags to ensure residuals are serially uncorrelated

21
Q

When augmenting the DF test, what is the trade-off between using more/fewer lags?

A
  1. Too few lags - errors may be serially uncorrelated, meaning critical values are wrong
  2. Too many lags - larger standard error (less precise estimates) because observations and degrees of freedom lost when adding lags
22
Q

How to decide how many lags to use in a DF test?

A
  1. Do sequence of F-tests,

2. Choose regression with the lowest information criterion

23
Q

Why must we be cautious about accepting the null hypothesis in a DF unit root test?

A
  1. ADF test null hypothesis = series is non-stationary
  2. Accepting null hypothesis of unit root can be due to type 2 error (failing to reject the null when it is false)
  3. ADF test has low power to distinguish between unit roots and persistent but stationary alternatives
24
Q

ADF test has low power to distinguish between ….. and …..

A

ADF test has low power to distinguish between unit roots and persistent but stationary alternatives

25
Q

What can we NOT conclude if we fail to reject the null hypothesis in a unit root test?

A

Failure to reject null doesn’t mean the series does have a unit root (just insufficient evidence to conclude that it doesn’t)

26
Q

What is a break?

A

Change in probability distribution of data (e.g. change in mean, variance etc)

27
Q

Problems caused by breaks

A
  1. Destroy external validity of time series models
  2. Cause biased in-sample estimates of coefficients (OLS estimates ‘average value’, which won’t correspond to true effect in any period if there’s a break)
28
Q

What is the F-test called when testing for a break?

A

Chow test

29
Q

It may be difficult to distinguish multiple breaks from …..

A

It may be difficult to distinguish multiple breaks from stochastic trends

30
Q

Remedies for breaks?

A
  1. If know/assume break date, estimate separate models in different sub-periods

2a. But break could be due to effect of outliers (e.g. outliers due to economic crisis in 1980, but in fact no break from 70s-90s)
2b. Could remove observations from sample, or model period w/outliers separately

  1. If break due to regime shift (e.g. change in inflation target from 1985), can model using dummy variables
31
Q
  1. Examples of 2 cointegrated variables

2. Explain

A
  1. Short and long-run interest rates

(i) Economic theory - long rate = expected weighted sum of short rates over same time horizon
(ii) If short rate has stochastic trend, long rates must inherit that same trend (if this theory is correct)
(iii) i.e. long and short rates share common stochastic trend (they are co-integrated)

  1. If the law of one price holds, then exchange rates and price differentials must be cointegrated of order 1
32
Q

What assumption is required for the identification of dynamic causal effects?

A

Random assignment of X (independent of potential outcomes), AKA EXOGENEITY

33
Q

Under what assumption does OLS estimate the dynamic causal effect on Y of change in X?

A

X is EXOGENOUS

34
Q

Distributed lag model assumptions

A

(1) X (regressors) exogenous

(2a) Y and X have stationary distributions
(2b) Observations (Yt, Xt) and (Yt-j, Xt-j) become independent as j gets large

(3) Y and X have non-zero finite 8th moments (E[Y^8]=0)
(4) No perfect multi-collinearity

35
Q

Time series counter-part of ‘identically distributed’ part of usual iid assumption

A

X and Y have stationary distributions

36
Q

Time series counter-part of ‘independently distributed’ part of usual iid assumption

A

(Yt, Xt) and (Yt-j, Xt-j) become independent as j gets large

37
Q

What are the implications of the assumption for distributed lag models that X and Y have stationary distributions?

A
  1. Coefficients don’t change within sample (internal validity)
  2. Results can be extrapolated outside sample (external validity)
38
Q

Intuition behind the assumption for distributed lag models that (Yt, Xt) and (Yt-j, Xt-j) become independent as j gets large

A
  1. We have separate experiments for different time periods (that are widely separated)
  2. Each new observations provides additional information (wouldn’t be case, for example, in extreme where data perfectly dependent over time)
39
Q

Consequences of distributed lag model assumptions?

A
  1. OLS gives consistent estimators of B1, B2,… (dynamic multipliers)
  2. Sampling distribution of estimators of coefficients approximately normal in large samples
  3. But if errors serially correlated, then formula for variance and standard errors not the same as normal case of i.i.d data (instead use HAC SEs)
40
Q
  1. Limitation of distributed lag model

2. Possible solutions

A
  1. Assumes all dynamic multipliers beyond lag r equal zero

2a. Solution - could add more lags
2b. But this means losing more observations at the beginning + degrees of freedom (which reduces precision of estimates)

  1. Solution - use ADL model
41
Q

Verbally, explain what the following mean:

  1. Impact effect of change in X
  2. 1-period dynamic multiplier
A
  1. Effect of change in Xt on Yt, holding past Xt constant

2. Effect of change in Xt-1 on Yt, holding constant Xt, Xt-2, Xt-3,…

42
Q

What does a 1-period dynamic multiplier mean?

A

Effect of change in Xt-1 on Yt, holding constant Xt, Xt-2, Xt-3,…

43
Q

What does a 2-period dynamic multiplier mean?

A

Effect of change in Xt-2 on Yt, holding constant Xt, Xt-1, Xt-3,…

44
Q

Example of a break

A

Change in inflation target