7.2 - Supervised Learning Algorithms Flashcards by Sam Patterson

______ _______ models reduce the problem of ______ by imposing a penalty based on the number of features (i.e., _________ variables) used by the model.

Penalized regression;
overfitting;
independent;

How well did you know this?

Not at all

Perfectly

In a penalized regression model, the penalty value _________ with the number of independent variables (i.e., ______) used.

increases;

features;

How well did you know this?

Not at all

Perfectly

Imposing a penalty on the number of features makes a model more _________.

parsimonious;

How well did you know this?

Not at all

Perfectly

Penalized regression models seek to minimize the sum __ _____ _____ (___), as well as the magnitude of the _____ _____.

of squared errors (SSE);

penalty value;

How well did you know this?

Not at all

Perfectly

LASSO stands for what?

Least absolute shrinkage and selection operator.

How well did you know this?

Not at all

Perfectly

In addition to minimizing SSE, LASSO minimizes what?

The sum of the absolute values of the slope coefficients.

How well did you know this?

Not at all

Perfectly

LASSO automatically eliminates what?

The least predictive features of a regression model.

How well did you know this?

Not at all

Perfectly

In a LASSO model, the penalty term, referred to by the Greek letter ______, is the __________ that determines the balance between _________ the model and keeping it _________.

lambda;
hyperparameter;
overfitting;
parsimonious;

How well did you know this?

Not at all

Perfectly

In a LASSO model, you want both the _____ and _______ (the value of the penalty) to be low.

SSE;

lambda;

How well did you know this?

Not at all

Perfectly

A method related to LASSO that is used to reduce statistical variability in a high dimension data estimation problem is referred to as __________. This method forces the ______ ________ of nonperforming features toward _______.

regularization;
beta coefficients;
zero;

How well did you know this?

Not at all

Perfectly

What does SVM stand for?

support vector machine

How well did you know this?

Not at all

Perfectly

SVM is a linear classification algorithm that does what?

SVM separates the data into one of two possible classifiers (e.g., sell vs. buy).

How well did you know this?

Not at all

Perfectly

What does KNN stand for?

K-nearest neighbor.

How well did you know this?

Not at all

Perfectly

KNN is more commonly used in _________ (but sometimes in ________).

classification;

regression;

How well did you know this?

Not at all

Perfectly

KNN is used to classify an observation based on _________ to the observations in the __________ ________.

nearness;

training sample;

How well did you know this?

Not at all

Perfectly

In KNN, the researcher specifies the value of __, the __________, triggering the algorithm to look for the __ _________ in the sample that are closest to the new observation that is being _____________.

Study These Flashcards

k;
hyperparameter;
k observations;
classified;

As an example of KNN, if k=5, the algorithm will look for what?

Study These Flashcards

The 5 nearest neighbors i.e., the 5 most similar observations).

What are some investment applications of KNN?

Study These Flashcards

Predicting bankruptcy; assigning a bond to a ratings class; predicting stock prices; creating customized indices;

What does CART stand for?

Study These Flashcards

Classification and regression trees.

Classification trees (aka “____”) are appropriate when the ______ _____ is __________, and are typically used when the target is ________ (e.g., an IPO will be successful vs. not successful).

Study These Flashcards

CART;
target variable;
categorical;
binary;

Classification trees contain two types of _____: (1) __________ ____ and (2) ________ _____. They repeatedly divide the data until a _______ _____ is reached becomes the end of each branch.

Study These Flashcards

nodes;
decision nodes;
terminal nodes;
terminal;

What are some investment applications of CART?

Study These Flashcards

detecting fraudulent financial statements (e.g., these are likely manipulators vs. these are likely not manipulators); selecting stocks and bonds;

________ _______ is the technique of combining predictions from multiple models rather than from a single model.

Study These Flashcards

Ensemble learning;

The purpose of using _______ ________ models is that an individual model will have a certain error rate and will make _____ _________. But by ________ predictions from many models, you can reduce the _______.

Study These Flashcards

ensemble learning;
noisy predictions;
averaging;
noise;

In "________", the ________ training set is used to generate "__" training data sets or "____" of data. Each new ___ is generated by ______ ________ with ___________. _________ helps to improve the stability of predictions and protects against overfitting the model.

``` bagging; original; bags; random sampling; replacement; Bagging: ```

_______ _______ is a variant of classification trees whereby a large number of classification trees are trained using data __________ from the same data set.

Random forest; | bagged;

7.2 - Supervised Learning Algorithms Flashcards

(26 cards)