Machine Learning Flashcards

1
Q

What is Artificial Intelligence (AI)?

A

AI involves machines perceiving their environment and taking actions to achieve human-set goals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Give examples of AI applications mentioned in the lecture.

A

Facebook’s Deepface, Uber’s self-driving cars, Apple Siri, and Google Translate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What distinguishes machine learning from traditional econometrics?

A

ML focuses on outcome prediction, while econometrics emphasizes parameter estimation and hypothesis testing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Name three Machine Learning methods mentioned.

A

Ridge regression, LASSO, and decision trees.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the bias in Machine Learning models?

A

The average error between the true function and the model’s prediction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does the variance measure in Machine Learning models?

A

The variability of the model’s predictions across different datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How does model complexity affect bias and variance?

A

High complexity reduces bias but increases variance, leading to overfitting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is overfitting?

A

When a model fits the training data well but performs poorly on new, unseen data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is R-Squared, and how is it calculated?

A

R-Squared measures the goodness-of-fit and is calculated as 1 - RSS/TSS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is adjusted R-Squared preferred over R-Squared?

A

Adjusted R-Squared accounts for the number of predictors, penalizing overly complex models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are AIC and BIC?

A

Model selection criteria that penalize model complexity. BIC imposes a stronger penalty.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do machine learning models penalize besides complexity?

A

Large coefficient estimates to reduce out-of-sample prediction variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does OLS minimize?

A

The sum of squared residuals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does ridge regression modify OLS?

A

It adds a penalty proportional to the sum of squared coefficients.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How does the LASSO estimator differ from ridge regression?

A

LASSO uses the absolute values of coefficients as a penalty, which can shrink some coefficients to zero.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the penalty parameter Lambda in machine learning?

A

A hyperparameter that controls the tradeoff between model fit and complexity.

17
Q

Name one method to choose Lambda.

A

Cross-validation.

18
Q

What is cross-validation?

A

A method to assess model performance by dividing the dataset into training and validation subsets.

19
Q

What are the steps in K-fold cross-validation?

A

Split the dataset into K subsets, train on K - 1 folds, test on the hold-out fold, and repeat.

20
Q

Why is cross-validation used?

A

To prevent overfitting and select optimal model parameters.

21
Q

What is a decision tree in machine learning?

A

A non-linear model that splits data into subsets based on predictor values.

22
Q

How does a decision tree make predictions?

A

By averaging outcomes within each terminal node.

23
Q

What is recursive partitioning?

A

Splitting the data repeatedly to form a tree structure.

24
Q

How is the depth of a decision tree selected?

A

Using a hold-out sample or cross validation.

25
Q

Name two financial applications of machine learning.

A

Predicting borrower default probability and selecting assets for investment.

26
Q

What is an example dataset for machine learning in housing markets?

A

The American Housing Survey with 150 variables on house and location characteristics.

27
Q

How do machine learning models compare to OLS in out-of-sample performance?

A

Machine learning models often outperform OLS due to better handling of complex relationships.

28
Q

What is the focus of OLS?

A

Estimation of parameters with desirable statistical properties.

29
Q

What is the focus of machine learning?

A

Accurate prediction of outcomes, often at the expense of interpretability.

30
Q

Can machine learning be used for causal inference?

A

No, machine learning is designed for prediction, not establishing causation.

31
Q

How does LASSO handle irrelevant predictors?

A

By shrinking their coefficients to zero, effectively excluding them from the model.

32
Q

What is the main difference between ridge regression and LASSO?

A

Ridge regression includes all predictors but shrinks coefficients, while LASSO can exclude predictors.

33
Q

Why is machine learning increasingly important in finance?

A

For applications like trading algorithms, asset management, and credit risk assessment.

34
Q

What advantage does machine learning provide in predictive modeling?

A

It can handle high-dimensional data and complex relationships better than traditional methods.

35
Q

Why do we still need traditional econometric methods?

A

To estimate causal relationships and understand underlying mechanisms.