Time series Flashcards

Question

what is the mean of an MA(q) series?

Answer 1

variance of mu is zero, so it removes. so we have the variance of a sum of white noise variables multiplied by their own theta value. The variance of theta part becomes a squared contribution, and the variance of the white noise random variables is sigma^2. So we get: var(MA(q)) = (1+theta^2_1 + theta^2_2 ...+theta^2_q)sigma^2

Answer 2

for s>q, the covariance is 0. up to q, the covariance may or may not be zero.

Answer 3

First of all, the models doesnt only rely on the white noise. We also have the mean component. As a result, a simple MA models is basically trying to see if there is a systematic way the time series move around its mean based on the specific shocks. For a true MA process, the time series will fluctuate around a mean, and these fluctuations would be completely determined by the shock. The shocks are considered random, but the reaction to it is not. The key is that the assumption when using MA models is that the variable we're intersted in will behave in a certain way as a reaction to the shocks. By using MA we're trying to learn the reaction. Also, it helps to consider the MA as a process. A process where a series of white noise together influence the reaction of the time series variable of interest. If a time series is a true MA process, we can capture it fully by an MA model.

Answer 4

Here we are considfering a TRUE MA process. A true MA process use exactly q random white noise variables to produce the output. he reason why the autocorrelation function of MA(q) models cuts of at s>q lags is because an MA process use (by definition) exactly q variables to produce the reaction/output variable value. If the underlying process only use the latest q white noise shocks to determine the next step in our time series, then there is obviously no correlation to earlier lags. If there happened to be correlation to earlier lags, then it would simply not be a true MA(q) process, and'd have to increase q to compensate for this.

Answer 5

autoregressive process where the time series variable is determined completely by the p latest variable values.

Answer 6

A model that builds on the assumption that the past p variable values determine the next variable value, along with a single white noise term.

Answer 7

if we consider an AR(1) process: it can be written as r_t = ø_0 + ø_1r_(t-1) + a_t Then we can apply recursively and get: r_t = ø_0 + ø_1(ø_0+ø_1r_(t-2)+a_(t-1))+a_t and so on, right? this creates a scenario where actually, modeling r_t as a process that only use 1 lag entail that we can consider it as a function of the shocks and the coefficients ø_0 and ø_1, which will create an infinite sequence that ultimately converge to a fixed point given that certain requirements for ø_0 and ø_1 are met. So yes, all AR models depend on all previous shocks. The key is really "convergence" of the sums that emerge.

Answer 8

L, lag operator. L represent a single lag. L^x represent x lags. L is used in conjunction with r_t, y_t. For isntance: Ly_t is actually just y_{t-1}, one lag. L^10 y_t is simply y_{t-10}

Answer 9

We have the mean, which is alos typically used as ø_0. then we have a sum of p terms, one per order of the AR(p) model we are considering. Each term in the sum consist of the corresponding parameter ø_i, and the lagged variable. The lagged variable is simply represented as L^k y_t. this would be y_{t-k}. The benefit of using the lag operator is that we can separate y_t from the equation. Then we remain with L^k which is a specific function.

Answer 10

This formula is the simple AR(p) model, but it is heavily compact. We have defiend a function ø(L), which is a function of ø_i params and back operators/lag operators. ø(L) = (1 - ø_1L - ø_2L^2 - .... - ø_p L^p) The formula have simply moved the orgiginal RHS sum to the left hand side, and extracted the common y_t using the power of the lag variables. Of course, separating y_t allow us to isolate ø(L), which we refer to as the characteristic equaiton.

Answer 11

We start by the dense form of the AR(p) model: ø(L) y_t = mu + u_t Setting my = 0 by performing a shift in all the variables we observe (so that we dont lose generality), we get: ø(L) y_t = u_t this means, we also have: y_t = (ø(L))^(-1) u_t y_t is equal to the inverse of the function ø(L) multiplied by u_t. And, we know that the mean must be zero, so we naturally want y_t to approximate zero. Therefore, the inverse of ø(L) must converge to zero. This will happen if the impact that earlier variables have on the current variable will diminish as we go further and further back.

Answer 12

Again it is important to understadn the process. If some process is AR(1) and phi_1 is larger than 1, then it will simply explode. It will explode because the process will generate new numbers that are bigger and bigger and basically just grow to infinity. The idea is that if we let the time series continue, what can we say about its heading. For stationarity, we want it to remain flat around some mean.

Answer 13

Not the first phi value at least. we need to consider the convergence. the key is to understand that the mean is constant for a stationary process, which means that we can use the property that the mean of y_t is the same as the mean of Ly_t which is the same as the mea nof L^2 y_t and so on. This allows us to separate/isolate an expression for the expectation.

Answer 14

Because for orders higher than 1, it is a recurrence relation that consist of more than 1 phi, and then the relationship is not that straight forward. The recurrence can converge without both phis being less than 1. Stationarity depends on the combination.

Answer 15

solve characteristic polynomial. Roots must be outside of unity. Behind this is math surrounding concepts: - Recurrence relations - eigenvalues

Answer 16

No, it is actually known as a random walk. formally, we can check it: y_t (1-øL) = u_t 1/(1-øL) is the inverse 1-øL = 0is the characteristic eq. 1 = øL L = 1/ø L = 1/1 = 1 Root is 1. 1 is not outside unity, which means that it is non-stationary.

Answer 17

Yule-Walker is a set of equations that can be solved as a system of equations. It basically solve all the autocorrelations.

Answer 18

ARMA process is a time series where the next value is determined based on a combination of a some past values and linear combination of past noise. It combines AR with MA. We basically just add them together. Since we add them together, ARMA will have a geometrically declining ACF. Therefore, the first step of identifying ARMA is probably to notice the declining ACF. The same happens for the MA step and PACF. ARMA has geometrically declining ACF and PACF. the effects are additive.

Answer 19

We're asking "is there a correlation between current y_t and y_{t-k}". And for a true MA process of the k'th order, we know that y_{t-k} was determined based on whatever the noise at that time was. And since we also include this noise in the determination of hte current point, there is a correlation between them. At the same time, this highlights why the ACF for moving averages suddenly drop. The true MA process simply doesnt rely on any common information when the time backwards is larger than the order.

Answer 20

PACF is the correlation between lags after accounting for the effects that intermediary lags have.

Answer 21

negative values in the MA

Answer 22

a two term thing. It will have some term related to the RSS, and one term related to punishing having more parameters. When we add a parameter to a model, the RSS will likely drop (can never increase) but the penalty term will increase. These are competing effects that determine whether we consider it a good choice or not. When we use informaiton criteria as the basis for selection, we want to minimize it.

Answer 23

AIC, SBIC, HQIC. They are based on the same things. sigma^2 is the residual sum of squares, the RSS AIC = ln(sigma^2) + 2k/T SBIC = ln(sigma^2) + (k/T) lnT HQIC = ln(sigma^2) + (2k/T)ln(ln(T))

Answer 24

r_t = mu + ∑w_i a_{t-i} [i=0, infinity] It doesnt include any past variables r_{t-l} because these can be recursively expanded. The recursion ultimately bottoms out (in the infinite) and we will have only shocks and weights.

Answer 25

If it is not constant, the values we use in the autocorrelation function will be differnet depending on where we are along the time series, which makes our predicitons bad.

Answer 26

We can use this to test whether a time series is white noise (just random mess) or not. If we have a large time series sample, the variance is extremely low. This means that if we use the t-ratio test to test whether the correlation for a certain lag is zero or not, we obtain really good values.

Answer 27

because of how we defined linear models. That term would be index 0 in the sum which would cause theta to be power of 0, which gives 1. Helps to consider the AR collapsed version which creates powers of the weights.

Answer 28

We need to remember that crossproducts are 0, because of how white noise series is iid. This is in the context of taking the expected value of a cross product. Since independent normal variables allow us to do E[XY]=E[X]E[Y], we can make use of the fact that the expected value of each random variable in the white noise series is zero. This is why we can neglect the cross products.

Time series Flashcards

(54 cards)