Time Series Flashcards
Time Series forecasting steps
Time Series Models:
- Linear
- Log-linear
- Autoregressive (AR)
Linear Trend
yt = b0 + b1 (t) + et
- predicted change in y is b1 and t = 1, 2, …, T
- uses time as independent variable (model assumes it explains dependent variable which limits it)
- DW check for SC.
- Appropriate for data points equallly distributed above/below line w/ constant mean. GDP and inflation good candidates for model.
Log-linear
yt = eb0 + b1 (t)
ln( yt ) = ln(eb0 + b1 (t)) => ln( yt ) = b0 + b1 (t)
- best when data residuals are correlated/predictable, or mean is non-constant. Investment and seasonality data good candidates. Expotential growth data needs it.
- increases predictive ability of time series and minimizes impact of SC in error terms.
Autoregressive (AR)
xt = b0 + b1 xt-1 + b2xt-2 + …. + bpxt-p + et
- above shows AR model of the order p
- AR(p) model is correctly specified if autocorrelations of residuals from model are not statistically significant at any lag.
- No longer a distinction b/w dependent and independent variables ( x is the only variable)
- Don’t use Durbin-Watson here.
-
Do use t-test to find if residuals at any lag are significant. If yes, model incorrect and lagged variable at indicated lag should be added.
*
Time Series terms (1st set)
Chain rule of forecasting
Covariance stationary
Mean Reversion
Unit Root
Random Walk
Chain rule of forecasting - calculation of successive forecasts one period at a time, where risk increases w/ each forecast bc based on previously forecasted values
^xt+1 = ^b0 + ^b1xt is one-period ahead for an AR(1). xt+2 is two-periods.
Covariance stationary - infrences based on AR models may be invalid unless model is covariance stationary, meaning the following three conditions are met:
Constant and finite
- Mean
- Variance
- Covariance w/ leading or lagged values
Most models are not stationary.
How to determine?
- plot data to see if mean and variance remain constant
- (ryan) Dickey-Fuller test (test for unit root, or if b1 - 1 = 0)
Mean Reversion mean reverting level for an AR(1) is:
b0 / (1 - b1 )
Unit Root present when lag coefficient = 1.
- Not covariance stationary
- Undefined mean reversion (i.e. b1 = 1)
- period’s value = last period’s value + random error term
Random Walk
Without Drift: yt = xt-1 + Et
- (Same descripters as unit root) when the value in one period is equal to the value in another period, plus a random (unpredictable) error.
- First Differencing can transform data to covariance stationary
First Differencing is subtracting value of immediately preceding period from current period value to define a new variable, y. Models change in the value of the variable rather than the value of the variable.
yt = xt – xt – 1 ⇒ yt = εt
Stating y in form of AR(1) model:
yt = b0 + b1yt – 1 + εt
where:
b0 = b1 = 0
Time Series terms (2nd set)
Seasonality
Root Mean Squared Error (RMSE)
Autoregressive Conditional Heteroskedasticity (ARCH)
Seasonality
Root Mean Squared Error (RMSE)
Autoregressive Conditional Heteroskedasticity (ARCH)
- What is it? variance of the residuals in one time period within a time series is dependent on the variance of the residuals in another period
- Effect? standard errors of regression coefficients in AR models and the hypothesis tests of these coefficients are invalid.
- Correct? use methods that correct for heteroskedasticity, such as generalized least sqaures.
- Alternatively, using an ARCH modelcan be used to predict variance in t+1