Quantitative Methods Flashcards

Question

How does Adjusted R2 differ from R2?

Answer 1

Adjusted R2 penalizes the addition of unnecessary independent variables, preventing overfitting. It's always less than or equal to R2. Formula: 1−[(1−R2)×(n−k−1)/(n−1)].

Answer 2

Lower AIC (n×ln(SSE/n)+2(k+1)) or BIC (n×ln(SSE/n)+ln(n)(k+1)) indicates a better fitting model. They penalize adding variables, BIC more heavily. AIC is better for prediction, BIC for goodness of fit.

Answer 3

F-statistic = MSR/MSE. It compares explained variance to unexplained variance. A large F-statistic suggests significant differences between group means.

Answer 4

Error variance changes systematically with the independent variable. It can lead to underestimated standard errors (Type I errors) or overestimated standard errors (Type II errors). Detected using residual plots or Breusch-Pagan test. Corrected using White-corrected standard errors.

Answer 5

Error terms are not independently distributed. Positive serial correlation leads to underestimated standard errors and overestimated T/F statistics (Type I errors). Detected using residual plots, Durbin-Watson test, or Breusch-Godfrey test. Corrected using Newey-West standard errors.

Answer 6

Significant correlation exists between two or more independent variables. Causes unreliable coefficient estimates and inflated standard errors, leading to insignificant t-statistics (Type II errors). Detected by high pairwise correlations, insignificant t-tests with a significant F-test, or VIF > 10. Corrected by excluding problematic variables.

Answer 7

Grounded in economic reasoning, appropriate functional form, essential variables only, no violation of assumptions, tested out of sample.

Answer 8

Using dummy variables (binary 0 or 1 variables). To distinguish between n classes, use n-1 dummy variables.

Answer 9

Used for qualitative dependent variables, modeling the probability of an event happening (between 0-1). The dependent variable is the log odds: ln(1−P/P).

Answer 10

Check if: 1) Expected value is constant and finite. 2) Variance is constant and finite. 3) Covariance with itself for a fixed lag is constant and finite. Use the Dickey-Fuller test.

Answer 11

A time-series model that uses lagged values of itself to predict future values. Example AR(1): Yt = b0 + b1 Yt−1 + ϵt.

Answer 12

A tendency for the series to move back towards its long-term average. Occurs in AR models when $.

Answer 13

A time series where the predicted value in one period is equal to the value in the previous period (b1 = 1). Not covariance stationary. If b0 ≠ 0, it's a random walk with drift.

Answer 14

Occurs in an AR model when b1 = 1, leading to a non-stationary (random walk) process. Tested using the Dickey-Fuller test, which tests if (b1 −1)=0.

Answer 15

By first differencing: modeling the change in the variable (Yt − Yt−1) instead of the level.

Answer 16

Detected by testing for significant autocorrelation at seasonal lags. Corrected by adding a seasonal lag (another independent variable) to the AR model.

Answer 17

When the variance of the error term in one period depends on the variance in a previous period. Tested by regressing squared residuals on lagged squared residuals: ϵ^2t = a^0 + a^1 ϵ^2t−1 + μt.

Answer 18

Adding the new variable will change the coefficient for the other correlated variables.

Answer 19

It shows the value of the dependent variable when all independent variables are 0.

Answer 20

They are the estimated changes in the dependent variable for a one-unit change in the corresponding independent variable, holding all other independent variables constant. Also called partial slope coefficients.

Answer 21

1. Linearity between dependent and independent variables. 2. No significant multicollinearity. 3. Expected error is 0. 4. Homoscedasticity (constant error variance). 5. No serial correlation (errors are independent). 6. Errors are normally distributed.

Answer 22

1. Subtract the mean from each individual observation. 2. Square each result. 3. Sum the squared results. Degrees of freedom = n-1.

Answer 23

1. Subtract the mean from each predicted observation. 2. Square each result. 3. Sum the squared results. Degrees of freedom = k.

Answer 24

1. Subtract the predicted value from each observed value. 2. Square each result. 3. Sum the squared results. Degrees of freedom = n-k-1.

Answer 25

R2=SSTRSS (Explained Variation / Total Variation).

Answer 26

Adjusted R2 penalizes the addition of unnecessary independent variables, preventing overfitting. It's always less than or equal to R2. Formula: 1−[(1−R2)×(n−k−1)(n−1)].

Answer 27

Lower AIC (n×ln(SSE/n)+2(k+1)) or BIC (n×ln(SSE/n)+ln(n)(k+1)) indicates a better fitting model. They penalize adding variables, BIC more heavily. AIC is better for prediction, BIC for goodness of fit.

Answer 28

F-statistic = MSR/MSE. It compares explained variance to unexplained variance. A large F-statistic suggests significant differences between group means.

Answer 29

Error variance changes systematically with the independent variable. It can lead to underestimated standard errors (Type I errors) or overestimated standard errors (Type II errors). Detected using residual plots or Breusch-Pagan test. Corrected using White-corrected standard errors.

Answer 30

Error terms are not independently distributed. Positive serial correlation leads to underestimated standard errors and overestimated T/F statistics (Type I errors). Detected using residual plots, Durbin-Watson test, or Breusch-Godfrey test. Corrected using Newey-West standard errors.

Answer 31

Significant correlation exists between two or more independent variables. Causes unreliable coefficient estimates and inflated standard errors, leading to insignificant t-statistics (Type II errors). Detected by high pairwise correlations, insignificant t-tests with a significant F-test, or VIF > 10. Corrected by excluding problematic variables.

Answer 32

Grounded in economic reasoning, appropriate functional form, essential variables only, no violation of assumptions, tested out of sample.

Answer 33

Using dummy variables (binary 0 or 1 variables). To distinguish between n classes, use n-1 dummy variables.

Answer 34

Used for qualitative dependent variables, modeling the probability of an event happening (between 0-1). The dependent variable is the log odds: ln(1−PP).

Answer 35

Check if: 1) Expected value is constant and finite. 2) Variance is constant and finite. 3) Covariance with itself for a fixed lag is constant and finite. Use the Dickey-Fuller test.

Answer 36

A time-series model that uses lagged values of itself to predict future values. Example AR(1): Yt=b0+b1Yt−1+ϵt.

Answer 37

A tendency for the series to move back towards its long-term average. Occurs in AR models when $\$

Answer 38

A time series where the predicted value in one period is equal to the value in the previous period (b1=1). Not covariance stationary. If b0=0, it's a random walk with drift.

Answer 39

Occurs in an AR model when b1=1, leading to a non-stationary (random walk) process. Tested using the Dickey-Fuller test, which tests if (b1−1)=0.

Answer 40

By first differencing: modeling the change in the variable (Yt−Yt−1) instead of the level.

Answer 41

Detected by testing for significant autocorrelation at seasonal lags. Corrected by adding a seasonal lag (another independent variable) to the AR model.

Answer 42

When the variance of the error term in one period depends on the variance in a previous period. Tested by regressing squared residuals on lagged squared residuals: ϵ^t2=a^0+a^1ϵ^t−12+μt.

Answer 43

Supervised learning (labeled data), Unsupervised learning (unlabeled data), Deep learning (neural networks with many layers), Reinforcement learning (agent learns through rewards/penalties).

Answer 44

When a model learns the training data too well, including noise, resulting in poor performance on new, unseen data (high variance error). It often occurs with complex models or insufficient data.

Answer 45

Use validation samples, cross-validation (like K-fold), penalized regression (like LASSO), reducing model complexity, or using ensemble methods (like Random Forests).

Answer 46

Penalized Regression (Regression, reduces overfitting), Support Vector Machine (SVM) (Classification), K-Nearest Neighbor (KNN) (Classification), CART (Classification/Regression), Random Forest (Classification/Regression, ensemble).

Answer 47

Principal Components Analysis (PCA) (Dimension Reduction), K-Means Clustering (Clustering), Hierarchical Clustering (Clustering).

Answer 48

To reduce dimensionality by summarizing correlated features into a smaller set of uncorrelated factors called principal components (eigenvectors).

Answer 49

Partitions data into 'k' clusters by iteratively assigning observations to the nearest centroid and recalculating centroids until assignments stabilize.

Answer 50

Builds a hierarchy of clusters. Agglomerative (bottom-up) starts with individual points and merges clusters; Divisive (top-down) starts with one cluster and splits them.

Answer 51

Conceptualize model, Collect data, Prepare & Wrangle data (cleanse, transform, scale), Explore data (EDA, feature selection/engineering), Train model, Evaluate model.

Answer 52

Tokenization, removing stop words, lowercasing, stemming, lemmatization, creating Bag-of-Words or N-grams.

Answer 53

Using metrics like Accuracy, Precision, Recall, F1-score, ROC curve, and AUC derived from a confusion matrix (TP, FP, TN, FN)

Answer 54