Quantitative Methods Flashcards
Covariance Stationary
Mean and variance don’t change over time
Determined by:
- Plotting data
- Run an AR model and test correlations
- Perform Dickey Fuller Test
Note: Most economic and financial time-series are not stationary
T-Test Significance and Z-Test
90%, 95%, 99%
90%: 1.645
95%: 1.96
99%: 2.326
When can we reject the null hypothesis
If the t-stat is too big
What is covariance
How 2 variances move together
- Very sensitive when only 2 variables
- Can be negative infinite or positive infinite
Increasing Adjusted R² means _______
The added variables are worth keeping
T-Stat Formula
Coefficient / Standard Error
Model Misspecification
Types:
- Time-series: Serial correlation with a lagged variable, or forecasting the past
- Functional: Omitting a variable or data pooled improperly
Correlation Squared Purpose
Explains the variability
Correlation
cov / (std of X * std of Y)
OR
(X - Xbar)(Y-Ybar) / √(X-Xbar)²(Y-Ybar)²
MSE
SSE / n - k - 1
ANOVA Table
Source DOF SOS Mean SOS
Regression(explained) K RSS MSR = RSS/K
Error (unexplained) n-k-1 SSE MSE = SSE/n-k-1
Total n-1 SST
R² from ANOVA
Name: Coefficient of Determination
Formulas: RSS / SST
SST - SSE / SST
Correlation²
Purpose: This is the % of variability of Y explained by X’s
Analysis: The higher the better fit
Problem: Always increase as variables are added
Random Walk
Unit root: coefficient = 1
This means the null (g = 0), cannot be rejected
Does not have a mean reverting level
Not stationary
Correct by first differencing
MSR
RSS / k
RMSE
Purpose: to compare the accuracy of AR models for our-of-sample
Formula: √Average squared error
Analysis: lower the better
Multiple Regression Analysis Steps
- Is there model mispecification
- Is the t-test significant? If no, use another model
- Is the F-Stat significant? If no, use another model
- Check for Hetero, serial correlation, and multicollinearity
When using Dummy Variables
They are either on of off
Always use n-1 or it will suffer from multcollinearity
Example: If using quarters per year, use 3
Log-Linear Trend Model
Purpose: Used when there is exponential growth or there is serial correlation
Formula: y = e^b0 + b1(t)
Use the time/observations for t
Sample Correlation Coefficient Other Formulas
Covariance / (Std X * Std Y)
OR
√R²
OR
Covariance / √X * √Y