Questions from Course Manual Flashcards
How can we use the standard error to infer statistical significance of a coefficient?
The standard error determines how much variability “surrounds” a coefficient estimate. A coefficient is significant if it is non-zero.
How do we need to interpret coefficients when independent variables are dummies?
A dummy variable is always compared with the reference group. For example, in a regression assessing the relationship between income and political affiliation, a positive regression coefficient means that income is higher for the dummy variable than for the reference group.
What does controlling for other variables mean? What is the difference with interaction variables?
Controlling for a variable is the attempt to reduce the effect of confounding variables.
An interaction variable is a variable constructed from an original set of variables to try to represent either all of the interaction present or part of it.
What does the R2 measure and mean?
R-squared explains to what extent the variance of one variable explains the variance of the second variable.
How can misspecification tests help to assess the validity of your OLS estimates?
A regression suffers from misspecification of the functional when the functional form of the estimated regression model differs from the functional form the population regression function. Functional form misspecification leads to biased and inconsistent coefficient estimators.
Under which conditions do omitted variables or reverse causality bias coefficients?
For omitted variable bias to occur, two conditions must be fulfilled:
- X is correlated with the omitted variable
- The omitted variable is a determinant of the dependent variable Y
Reverse causality leads to correlation between X and the error in the population of interest such that the coefficient on X is estimated with bias.
What happens to the estimates when there is measurement error in the dependent variable?
if there is measurement error in the dependent variable, and the measurement error is random than there is no bias but only an increase in variance. If the error is random, then there is a bias.
What happens to the estimates when there is measurement error in the independent variable? In which direction does measurement error bias coefficients in this case?
When independent variables are measured imprecisely, we speak of errors-in-variables bias. This bias does not disappear if the sample size is large. If the measurement error has mean zero and is independent of the affected variable, the OLS estimator of the respective coefficient is biased towards zero.
Under which conditions can panel data be used to solve omitted variable bias?
Regression using panel data may mitigate omitted variable bias when there is no information on variables that correlate with both the regressors of interest and the independent variable and if these variables are constant in the time dimension or across entities.
What is meant by Pooled OLS? When is Pooled OLS appropriate? Can this be tested?
- POLS refers to the application of OLS to panel data. In POLS the data is treated as if it were cross-sectional, ignoring the time effect.
- POLS is appropriate when the explanatory variables in each time period are uncorrelated with the idiosyncratic error (the time-varying part of the error)
- to test for pool ability use Breusch-Pagan Test, which tests for heteroscedasticity
- if the errors are heteroskedastic then there is a correlation idiosyncratic error and the explanatory variable x.
What is meant by a fixed effects estimator?
In Panel data where longitudinal observations exist for the same subject, fixed effects represent the subject-specific means. IN panel data analysis the term fixed effects estimator (also known as the within estimator) is used to refer to an estimator for the coefficients in the regression model including those fixed effects (one time-invariant intercept for each subject).
What is the difference between FE and RE estimator? How can you choose between the FE or RE estimates?
- There are two assumptions about the individual-specific effect: FE assumption and RE assumption.
The RE assumption is that the individual-specific effects are uncorrelated with the independent variable.
The FE assumption is that the individual-specific effects are correlated with the independent variable.
Hausmann checks whether FE & RE generate similar results
Sargan J determines correlation between error term and independent variable
What is meant by a First Difference estimator?
FD approach is used to address the problem of endogeneity which is caused by unobserved heterogeneity. The endogeneity problem leaves the estimator biased and inconsistent, therefore FD are taken.
What are the criteria for a valid instrumental variable?
- The instrument z should be correlated with the independent variable x
- The instrument z should not be correlated with the error u
What are the criteria for a valid instrumental variable?
- The instrument z should be correlated with the independent variable x
- The instrument z affects the dependent variably y only through x.
What does a first stage mean? How do you determine whether an instrument, or a set of instruments, is strong? What is the danger of weak instruments?
- The instrument must be correlated with the endogenous explanatory variable. If the correlation is strong, then the instrument is said to have a strong first stage. A weak correlation may provide misleading inferences about parameter estimates and standard errors.