Econometrics Flashcards

Question

Why are dummy variables useful in time series?

Answer 1

A dummy variable will represent whether, in each time period, a certain event has occurred.

Answer 2

Can be done by regressing that variable with a linear time trend and computing its residual. Its fundamental use is create a new variable that trends with time, thereby removing the part of the given independent variable/s trending with time.

Answer 3

Regressing retreaded variables yield the same slope estimates on x as one would when time trend is added. Generally, if you have a variable that is trending, it is a good idea to add a time trend.

Answer 4

A variable that appears to increase in time with no reason aside other factor being the causal reason for this

Answer 5

It is often the case that growth is more exponential, whereby it is increasing at an increasing rate.

Answer 6

80/20 almost natural law. Roughly states that 80% of the effects come from 20% of the causes.

Answer 7

Occurs when a relationship between two or more trending variables (in time) simply because each is growing over time. Adding a time trend eliminates this problem.

Answer 8

If a trend term is statistically significant and the results change in important ways when a time trend is added to a regression

Answer 9

Dummy variables used to alter the dependent variable at certain times of the year..

Answer 10

Violates 'No perfect correlation'. Occurs when the base year variable isn't omitted in the equation.

Answer 11

Similar to detrending the data. It makes a new dummy variable to account for seasonal changes in y.

Answer 12

A stationary time series process is one whose probability distributions are stable over time in the following sense: If we take any collection of random variables in the sequence and then shift that sequence ahead h time periods, the joint probability distribution must remain unchanged.

Answer 13

The LLN and CLT.

Answer 14

The joint distributions of a process as it moves through time. The mean, variance and autocorrelation structure do not change over time.

Answer 15

A time series consisting of elements that are generally less correlated the further away they are from each other. Basically nearly independent, so that correlation between x and x goes to 0 sufficiently quickly as h approaches infinity.

Answer 16

With respect to weakly dependent time series, stating that as h tends to infinity, the joint correlation between two sets of values along the trend with the same sample size will be uncorrelated. Covariance stationary processes are said to be asymptotically uncorrelated.

Answer 17

Under assumptions TS.1-TS.3

Answer 18

Under assumption TS.1-TS.5

Answer 19

Efficient Market Hypothesis - States that the most recent information contains all that is required to make prediction.

Answer 20

Otherwise all the data would give us no information about future data. This dependence will fall as t approaches infinity though.

Answer 21

Just a time series of i.i.d variables. This enables it to be modelled in processes.

Answer 22

There will be joint correlation between two observations next to each other in a time series, but any association between variables two or more period apart will be independent. e will be independent across t, meaning, time is independent across t with a more than 1 period difference. Therefore MA(1) is weakly dependent, stationary and the law of large numbers and the central limit theorem can be applied for x.

Answer 23

The crucial assumption for the weak dependence of AR1. The stability condition would be the |Row| < 1, which intuitively means that -----------

Answer 24

If it violates one of the stationary principle.

Answer 25

1.E (yt) = 𝛍 2. Var(yt) = 𝛔^2 3. Cov(yt,ys) = Cov(y*t+h*, Y*s+h*). Used when there is a finite second moment.

Answer 26

xt has the same distribution for all t. It is identically distributed. The stationarity definition means that the covariance between two sets will be dependent on h.

Answer 27

Involves the joint distributions of a process as it moves through time.

Answer 28

Assumes that the relationship between x*t* and x*t+h* are 'almost independent' as h increases without a bound. We assume a series is weakly dependent if the covariance

Answer 29

It replaces the assumption of random sampling in implying that the LLN and CLT hold.

Answer 30

d) Time series tend to be serially-correlated

Answer 31

β₀ + β₁ + β₂

Answer 32

Strict exogeneity

Answer 33

No, only contemperaneous exogeneity (along with TS1' and TS2')

Answer 34

a) E(Xt)=E(Xt+h) for all t,h b) Var(Xt)=Var(Xt+h) for all t,h c) Cov(Xt,Xt+h)=Cov(Xs,Xs+h) for all t,s,h d) All of the above. d

Answer 35

Cannot say with the given information

Answer 36

Moving average

Answer 37

Correlations between variables that are far apart in time have either no or weak correlation

Answer 38

They have constant variance

Answer 39

Cannot say with the given information.

Answer 40

E(yt|y₁,…,yt-1) = E(yt) Not: E(yt|y₁,…,ys) = E(ys) for 1

Answer 41

yt = α + βyt-1 + ut

Answer 42

The slope parameter is significant and R-squared may be very high

Answer 43

First differences may be weakly dependent

Answer 44

Test if the error terms are correlated to one another

Answer 45

Perform OLS on all the regressors to obtain the residuals

Answer 46

The OLS estimator is both unbiased and consistent

Answer 47

The Cochrane-Orcutt procedure may yield a more efficient estimator than OLS

Answer 48

Yes, serial correlation-robust standard errors can be used

Answer 49

a.) E (Xt) = E (Xt+h) for all t; h b.) V ar (Xt) = V ar (Xt+h) for all t; h c.) Cov (Xt;Xt+h) = Cov (Xs;Xs+h) for all t; s; h d.) All of the above All of the above

Answer 50

Cannot say with the given information

Answer 51

Cannot say with the given information

Answer 52

It is nonstationary

Answer 53

It is nonstationary

Answer 54

To improve the e¢ ciency on the OLS estimator

Answer 55

CO omits the rst observation while PW uses all observations

Answer 56

Cannot say. Depends on sample size

Answer 57

Perform OLS on all the regressors to obtain the residuals

Answer 58

Regress the residual on its lagged values, as well as the other regres- sors in the original model

Answer 59

Regression involving highly persistent dependent and independent variables may be spurious

Answer 60

``` AR can be used to model time series that exhibits persistence that may (or may not!) disperse over time MA only allow correlations over a nite time lag ```

Answer 61

Persistence intuitively is the reverse of weak dependence

Answer 62

Will be a non stationary process since variance alters. Not weakly dependent

Answer 63

a persistent time series.

Answer 64

when two variables are correlated through a third variable. E.g., if you regress y on x, you may nd a highly signicant relationship but this disappears as soon as you add another variable, say w.

Answer 65

To test for a unit root. Contruct the t-ratio to check for critical values.

Answer 66

General Least squares. More efficient estimator than OLS if ut is autocorrelated (TS5 violated).

Answer 67

The Lagrange Multiplier (LM) test.

Answer 68

- OLS is unbiased and inefficient. - FGLS is biased but more efficient with large sample - Both estimators are consistent with large sample - Preference of one over another will depend on the sample size

Answer 69

When errors are autocorrelated and inference needs adjusting.

Answer 70

The Lagrange multiplier (LM) test.

Answer 71

When the sample size is large enough. With a large sample size FGLS may be better than the OLS estimator.

Answer 72

``` MLR1: Linear in Parameters MLR2: Random Sampling MLR3: No Perfect Multicollinearity MLR4: Zero Conditional Mean on Unobserved Term MLR5: Homoskedasticity ```

Answer 73

Under GM 1-5.

Answer 74

- Book and Slides

Answer 75

An estimator is unbiased if we take a random draw of Beta1(HAT) many times, the average is Beta1.

Answer 76

For a single draw of B1,n(HAT) as n tends to infinity, we expect B1,n(HAT) ->Beta1

Answer 77

Try to find and use a suitable Proxy variable for the unobserved variable. If not we can try and leave the unobserved result in the error term, and rather than using OLS we use an estimation method that presence of the omitted variable (the method of instrumental variables).

Answer 78

If the remaining variables in the model being estimated are correlated with the extra variable put into the error term.

Answer 79

Instrumental variables.

Answer 80

Z will be an instrumental variable for x, or simply and instrument for x if: 1. Cov(z,u) = 0 (uncorrelated) 2. Cov(z,x) ≠ 0 (correlated) We also call this instrument exogeneity (from the equation). These allow us to identify the parameter β1.

Answer 81

That z should have no partial effect on y (after x and omitted variables have been controlled for), and z should be uncorrelated with the omitted variables. So z should only be correlated with the explanatory variable it is an instrument for

Answer 82

A variable generated by a statistical model that is explained by the relationships between functions within the model.

Answer 83

An exogenous change is one that comes from outside the model and is unexplained by the model.

Answer 84

β1 = Cov(x,y) / Var(x)

Answer 85

We can estimate a simple regression between x and z: x = 𝛌0 + 𝛌1z + v. As 𝛌1 = Cov(z,x)/Var(z), we know that our instrument variable is good only if Cov(z,x) ≠ 0. Thus we reject the null of 𝛌1 = 0 against the alternative if we are at a significantly small significance level.

Answer 86

Instrumental variables are correlated an explanatory variable and not with any unobserved terms. A proxy variable must be highly correlated with the an unattainable or unobserved term, as to best approximate it.

Answer 87

IQ, family background, number of siblings

Answer 88

They can be used to solve the errors in variables problem and the problem of endogeneity of one or more explanatory variables.

Answer 89

When MLR4 is violated, and so Cov(xi, ui) ≠ 0

Answer 90

Corr ≠ Corr(hat), so that an IV estimator can be worse than an OLS, especially if Corr(Hat)(zi,xi) is low, meaning it is a weak instrument. Also an IV estimator will always be biased in finite sample.

Answer 91

When regressor is endogenous.

Answer 92

The Zero conditional mean rule.

Answer 93

y used to signify that we think it is correlated with u. | z used to signify that we think the explanatory variable is exogenous of u.

Answer 94

No, as it already appears as an explanatory variable., thus we need another exogenous variable.

Answer 95

We have written an endogenous variable in terms of exogenous variables.

Answer 96

No perfect linear relationships among the exogenous variables; which is analogous to the assumption of no perfect collinearity in the context of OLS.

Answer 97

Any combination of instruments is in fact also an instrument, e.g. a1z1 + a2z2. 2SLS is then used to figure out the best combination.

Answer 98

Picks the best linear combination for instruments. If we have multiple IV estimators that wouldn't be efficient, as each one is uncorrelated with u, any linear combination is also uncorrelated with u.

Answer 99

1. Use OLS to regress y2 on 1, z1, z2 to obtain the fitted value y(HAT)2 2. Use to regress y1 on 1, y(HAT)2 to obtain the 2SLS estimators for β0 and β2.

Answer 100

With one instrument, it is the same, with more it is more efficient.

Answer 101

Whether the extra IVs are correlated with the exogenous variable also used to represent y2. If both = 0 we will suffer from perfect collinearity. Thus we use an F-test and make the null that each one =0.

Answer 102

1. Use OLS to regress each endogenous variable in a reduced form equation, and collect the fitted values. 2. Use OLS to regress y on the structural equation with all endogenous variables replaced by their fitted values from stage (1)

Answer 103

If the two instruments are not jointly significant.

Answer 104

We need at least as many excluded exogenous variables as there are invoiced endogenous explanatory variables in the structural equation.

Answer 105

Number of endogenous variables: k | Number of instruments: q.

Answer 106

k=q, we can just identify the parameters of interest. If k > q then we cannot identify (under) If k < q then we can use different instruments, (over)

Answer 107

1. Estimate the structural equation using 2SLS and obtain the residuals. 2. Regress the residuals on all exogenous variables to obtain R(SQUARED). 3. Construct a test statistic of the form nR(SQUARED) that has Chi Squared with q - k df distribution, with df equal to the degree of over identification.

Answer 108

1. Estimate the structural equation using 2SLS and obtain the residuals. 2. Regress the residuals on all exogenous variables to obtain R(SQUARED). 3. Construct a test statistic of the form nR(SQUARED) that has Chi Squared with q - k df distribution, with df equal to the degree of over identification. Rejecting the null means at least one of the two instruments is invalid, but not rejecting the null doesn't necessarily mean both instruments are valid.wa

Answer 109

Comparing different IV estimates of the same parameter. The idea is that we have more instruments to estimate the parameter than we need.

Answer 110

Relevance means that the instrument is partially correlated with the explanatory variable. Exogeneity means that the instrument is uncorrelated with the error term.

Answer 111

If we have two instruments for y2, we will be able to estimate the equation twice. If the instruments are both exogenous and relevant, the only difference between the two results will be sampling error. If the estimates are statistically different from each other then we must conclude that either of the instruments or both fail the exogeneity requirement.

Answer 112

Even if both the instruments are similar just by nature of picking them and them haveing a similar outcome,, this may lead to similar estimates despite the instruments being inconsistent.

Answer 113

Even if both the instruments are similar just by nature of picking them and them haveing a similar outcome,, this may lead to similar estimates despite the instruments being inconsistent.

Answer 114

Measurement error and omitted variables. Instruments can be used to solve these.

Answer 115

Arises Shen one or more of the explanatory variables is jointly determined with the dependent variable. This basically means that x causes y but y also causes x.

Answer 116

Simultaneous equation model. Used when we have two structural equations but

Answer 117

Neither equation can stand on its own as the two endogenous variables are chosen by the same economic agent. The agent will choose these amount of time working and studying simultaneously. Therefore, it makes no sense to specify two equations where each is a function of the other.

Answer 118

Each equation must have a clear ceteris paribus interpretation. We us SEMs to postulate another relationship, namely one that takes an essentially removes endogenous variables from the two structural equation, making a third equation through substitution that contains only exogenous variable, being the reduced form equation of y2.

Answer 119

Simultaneity bias due to the second equation being a function of the first equation, so the error in the second equation will be contained in the first equation when substituted in. There will be bias if y2 is correlated with u1. Therefore using OLS in a simultaneous system lead to biased and inconsistent results.

Answer 120

π: Reduced form parameters | V2: Reduced form error.

Answer 121

An IV estimator is consistent and based.

Answer 122

We have two structural equations already so can immediately see if we have enough IVs.

Answer 123

Instrumental Variable.

Answer 124

The equation that we will be able to us IVs for to get rid of endogenous variables. In the case of simultaneous equations it is the equation that has less z values generally.

Answer 125

It causes us to estimate the demand equation with error, but the editors will be consistent provided z1 is uncorrelated with U2.

Answer 126

If we do not have nay exogenous observed factors shifting the other equation, which would allow us to trace out the former equation. e.g. if we have an observed exogenous variable in demand, we can trace out the supply curve and vice versa.

Answer 127

Error terms in a structural equation.

Answer 128

Over: Several options f IVS and 2SLS to be used. Just: IV can be used. Under: it cannot be generally consistently estimated.

Answer 129

``` 1. Strict exogeneity: E(ut | X) = 0 2. Sequential exogeneity: E (ut | x1, . . . , xt ) = 0 3. Contemporaneous exogeneity: E (ut | xt ) = 0 ```

Answer 130

A time series process in which the correlation between random variables at two points in time tends to zero as the time interval between them increases.

Answer 131

A time series model whose current value depends linearly on its most recent value plus an unpredictable disturbance.

Answer 132

In time series or panel data applications, a regressor is contemporaneously exogenous if it is uncorrelated with the error term in the same time period, although it may be correlated with the errors in other time periods.

Answer 133

In time series or panel data applications, the variance of the error term, conditional on the regressors in the same time period, is constant.

Answer 134

A time series process with constant mean and variance where the covariance between any two random variables in the sequence depends only on the distance between them.

Answer 135

For a time series process ordered chronologically, the correlation coefficient between pairs of adjacent observations.

Answer 136

A time series process where outcomes in the distant future are highly correlated with current outcomes.

Answer 137

A time series process generated as a linear function of the current value and one lagged value of a zero-mean, constant variance, uncorrelated stochastic process.

Answer 138

A time series process whose joint distributions are not constant across different epochs.

Answer 139

A time series process where next period's value is obtained as this period's value, plus an independent (or at least an uncorrelated) error term.

Answer 140

A random walk that has a constant (or drift) added in each period.

Answer 141

A feature of an explanatory variable in time series (or panel data) models where the error term in the current time period has a zero mean conditional on all current and past explanatory variables; a weaker version is stated in terms of zero correlations.

Answer 142

The errors in a time series or panel data model are pairwise uncorrelated across time.

Answer 143

An AR(1) process where the parameter on the lag is less than one in absolute value. The correlation between two random variables in the sequence declines to zero at a geometric rate as the distance between the random variables increases, and so a stable AR(1) process is weakly dependent.

Answer 144

A time series process where the marginal and all joint distributions are invariant across time.

Answer 145

A process that is stationary once a time trend has been removed; it is usually implicit that the detrended series is weakly dependent.

Answer 146

A highly persistent time series process where the current value equals last period's value, plus a weakly dependent disturbance.

Answer 147

A term that describes a time series process where some measure of dependence between random variables at two points in time?such as correlation?diminishes as the interval between the two points in time increases.

Answer 148

A unit root (also called a unit root process or a difference stationary process) is a stochastic trend in a time series, sometimes called a “random walk with drift”; If a time series has a unit root, it shows a systematic pattern that is unpredictable.

Answer 149

The Dickey-Fuller Test.

Econometrics Flashcards

(196 cards)