Chapter 6: Statistical Models for Time Series Flashcards
How do linear models for time series differ from models applied to cross-sectional data?
They account for the correlations that arises between data points in the same time series, in contrast to the standard methods applied to cross-sectional data, in which point is assumed to be independent.
What assumptions would allow ordinary least squares to be applied to time series data?
With respect to time series behaviour:
- Time series has a linear response to its predictors.
- No input variable is constant over time or perfectly correlated with another input variable.
With respect to error:
- For each point in time, the expected value of the error, given all explanatory variables for all time periods is 0.
- The error at any given time period is uncorrelated with the inputs at any time period in the past or future.
- Variance of the error is independent of time.
What is an autoregressive model (AR)?
AR models rely on the intuition that the past predicts the future and so posits a time series process in which the value at a point in time t is a function of the series’s values at earlier points in time.
What is the equation for the simplest AR model, AR(1)?
y(t) = b0 + b1 * y(t-1) + e(t)
Does a time series need to be stationary to be modelled using AR?
Yes, the time series does need to be stationary.
What is the difference between strong and weak stationarity?
Weak stationarity only requires the mean and the variance of a process to be invariant.
Strong stationarity requires the distribution of the random variables output by a process to remain over time. It demands the statistical distribution of y1, y2, y3, y4 to be the same as y101, y102, y103.
What is the definition of a distribution?
A distribution is a statistical function describing the probabilities for all possible values that a particular value will be generated by a process.
What is Akaike Information Criterion (AIC) for a model?
AIC = 2k - 2lnL, where k is the number of parameters and L is the maximum likelihood value for a that function. In general we want to lessen the complexity of the model (lessen k) but increase the goodness of fit (i.e L), so we favour models with lower AIC’s than higher.
What is a likelihood function?
A likelihood function is a measure of how likely a particular set of parameters for a function is in relation to the other parameters for that function given the data.
For example, when fitting a linear model to y = [1,2,3], x = [1,2,3] using y = b * x, your likelihood function would tell you that an estimate of b = 1 was far more likely than an estimate b = 0.
REFACTOR THIS CARD
What check should we do after fitting an AR(p) model to assess goodness of fit with respect to the models errors?
Plot an ACF of the residuals (error) at each lag to see if any cross the significance threshold.
If we see significant autocorrelation between the errors in a model, what should we do?
Return to the model and consider additional terms to add complexity to account for the significant autocorrelation of the residuals.
What is the Lijung-Box test?
The Lijung-Box test is an overall test of the randomness of a time series. It poses the following:
- H0: The data does not exhibit serial correlation.
- H1: The data does exhibit serial correlation.
When is the Lijung-Box test applied?
The test is commonly applied to AR and ARIMA models, more specifically to their errors rather than the models themselves.
When using an AR model to predict, how should we assess the performance of the predictions?
We should like a the correlation between the raw time series and predictions, but more importantly the correlation between the difference time series and predicted time series.
What are Moving Average (MA) models?
A moving average model is similar to an autoregressive model except that the terms included in the linear equation refer to present and past error terms rather than present and past values of the process itself.