Quantitative Methods Flashcards by Geoffrey Coe

Sum of Squared Errors (SSE)

Measures the unexplained variation in the dependent variable. Is the sum of the squared vertical differences between the actual values and the predicted values on the regression line.

How well did you know this?

Not at all

Perfectly

Breusch-Pagan (BP) Chi-Square Test

=n*residualR^2 with k degrees of freedom

If larger than the one-tailed critical value for a chi-square distribution, you should reject the null and conclude heteroskedasticity is present.

How well did you know this?

Not at all

Perfectly

Effect of Serial Correlation

Positive serial correlation typically results in coefficient standard errors that are too small. This will cause the t-statistic to be too high and the null to be rejected too often.
F-test will be unreliable because MSE will be underestimated

How well did you know this?

Not at all

Perfectly

F-Statistic

=MSR/MSE

If F-statistic > F-value, then you reject the null hypothesis, indicating at least one coefficient is statistically significant.

How well did you know this?

Not at all

Perfectly

Unconditional Heteroskedasticity

Occurs when the heteroskedasticity is not related to the level of independent variables. It usually causes no major problems with regression.

How well did you know this?

Not at all

Perfectly

Conditional Heteroskedasticity

Exists if the variance of the residual term increases as the value of the independent variable increases. Creates significant problems for regression and statistical inference.

How well did you know this?

Not at all

Perfectly

Adjusted R^2

=1 - { [ (n-1) / (n-k-1) ] x ( 1-R^2) }

Will always be equal to or less than R^2

How well did you know this?

Not at all

Perfectly

Root Mean Squared Error (RMSE)

Used to compare the accuracy of autoregressive models in forecasting out-of-sample values. The model with lower RMSE for the out-of-sample data will have lower forecast error and will be expected to have better predictive power in the future.

How well did you know this?

Not at all

Perfectly

Correcting Multicollinearity

Omit one or more of the correlated independent variables

How well did you know this?

Not at all

Perfectly

Random Walk with a Drift

The time series is expected to increase or decrease by a constant amount each period (b0).

Xt = b0 + b1*Xt-1 +εt

How well did you know this?

Not at all

Perfectly

Correcting Heteroskedasticity

Calculate robust standard errors used to recalculate the t-statistics using the original regression coefficient if evidence of heteroskedasticity.
Using generalized least squares, which attempts to eliminate heteroskedasticity, by modifying the original equation.

(If both heteroskedasticity and serial correlation, use the Hansen method)

How well did you know this?

Not at all

Perfectly

Detecting Multicollinearty

Fail to reject the null in a t-test, but fail to reject the null in an F-test while R^2 is high.

How well did you know this?

Not at all

Perfectly

Multicollinearty

The condition when. two or more independent variables are highly correlated with each other. This distorts the standard error of the estimate and the coefficient standard errors, leading to unreliable t-tests. Slope coefficients will be unreliable as well.

How well did you know this?

Not at all

Perfectly

Serial Correlation

The situation in which the residual terms are correlated with one another. Positive serial correlation exists when a positive regression error in one period increases the probability of observing a positive regression error in the next period. Negative serial correlation exists when a negative regression error in one period increases the probability of observing a negative regression error in the next period.

How well did you know this?

Not at all

Perfectly

Durbin-Watson Statistic

Detects the presence of serial correlation.

DW > 2 – Positive serial correlation
DW < 2 – Negative serial correlation
DW = 2 – No serial correlation
DW < dL – Reject null; positive serial correlation
DW > dU – Fail to reject null; no evidence of SC
dL < DW < dU – Test is inconclusive

How well did you know this?

Not at all

Perfectly

Confidence interval for Regression Coefficient

Study These Flashcards

=b1 ± ( t x Standard Error)

t is from the t-table
Standard Error is = (coefficient / t-statistic) using n-k-1 degrees of freedom

t-statistic

Study These Flashcards

= ( Estimated Regression Coefficient - Hypothesized Value) / Standard Error of Coefficient
with n-k-1 degrees of freedom

Two-tailed when hypothesized value is = or ≠
One-tailed when hypothesized value is > or <

If t-statistic > t-value, then the coefficient is statistically significant and you reject the null hypothesis.

Mean Square Error (MSE)

Study These Flashcards

= SSE / (n-k-1)

Regression Sum of Squares (RSS)

Study These Flashcards

Measures the variation in the dependent variable that is explained by the independent variable. Is the sum of squared differences between predicted values and the mean.

First Differencing

Study These Flashcards

Used to transform a time-series with a random walk to a covariance stationary time series. Involves subtracting the value of the time series in the immediately preceding period from the current value of the time series to define a new dependent variable.

p-value

Study These Flashcards

The smallest level of significance for which the null hypothesis can be rejected. If the p-value is less than the significance level, the null hypothesis can be rejected. If the p-value is greater than the significance level the null hypothesis cannot be rejected (fail to reject).

Coefficient of Determination (R^2)

Study These Flashcards

The percentage of variation in the dependent variable that is collectively explained by all of the independent variables. R^2 will increase with more variables. √R^2 is the correlation.

= (SST - SSE) / SST

= RSS / SST

=Correlation Coefficient^2

Standard Error of the Estimate (SEE)

Study These Flashcards

The standard deviation of the residual terms in the regression. SEE will be low relative to total variability if the relationship between dependent and independent variables is strong, and will be high if the relationship its weak (the smaller the standard error, the better the fit)

=√MSE

=√[ SSE / (n-k-1) ]

Cointegration

Study These Flashcards

Two time-series are economically linked (related to the same macro variables) or follow the same trend and that relationship is not expected to change. If cointegrated, the error term from regressing one on the other is covariance stationary and the t-tests are reliable.

Autoregressive Conditional Heteroskedasticity (ARCH)

Exists if the variance of the residuals in one period is dependent on the variance of the residuals in the previous period. When this condition exists, the standard errors of the regression coefficients in AR models and the hypothesis tests of these coefficients are invalid.

Correcting Seasonality

An additional lag of the dependent variable (corresponding to the same period in the previous year) is added to the original model as another independent variable.

Mean Reverting Level

Xt = b0 / (1 - b1) If Xt > b0 / (1 - b1), the model predicts that Xt+1 < Xt If Xt < b0 / (1 - b1), the model predicts that Xt+1 > Xt

Type I Error

Incorrectly rejecting the null when it should not be rejected.

Type II Error

Incorrectly failing to reject the null when out should be rejected.

Unit Root

If the lag coefficient is equal to one. Will follow a random walk process. Since a time series follows a random walk is not covariance stationary, modeling such a time series in an AR model can lead to incorrect inferences.

Linear Regression Assumptions

1. A linear relationship exists between the dependent and independent variable. 2. The variable is uncorrelated with the residuals. 3. Expected value of the residual terms is 0. 4. The variance of the residual term is constant for all observations. 5. The residual term is independently distributed; the residual term for one observation is not correlated with that of another observation. 6. The residual term is normally distributed. 7. Independent variables are not random, and there is no exact linear relationship between two or more independent variables.

Quantitative Methods Flashcards

(31 cards)