ML-06 - Model and feature selection Flashcards by Rikard Donnelly

ML-06 - Model and feature selection

How do you use validation data to check if your problem is due to high bias vs. high variance?

(See image)

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

Which area is high bias and which is high variance?

(See image)

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

If you have high bias, is your model underfit/overfit?

Underfit

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

If you have high variance, is your model underfit/overfit?

Overfit

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

If your model is underfit, do you have high bias or variance?

High bias

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

If your model is overfit, do you have high bias or variance?

High variance

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

What is the first tool to try for overfitting problems?

Regularization

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

What does regularization prevent?

Overfitting.

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

Rescribe the bias/variance as a function of the regularization lamba parameter.

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

Describe how the error vs. training set size looks for a situation with a good bias/variance trade-off.

(See image)

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

Describe how the error vs. training set size looks for a situation high bias.

(See image)

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

Describe how the error vs. training set size looks for a situation with high variance.

(See image)

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

What should you try if you have high variance? (3)

Get more data
Smaller sets of features (or smaller NN)
Try increasing regularization lambda

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

What should you try if you have high bias? (3)

Get more features
Feature engineering, add polynomial features
Try decreasing regularization lambda

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

What are the 3 steps of the ML design guideline?

1) Start with a small model (baseline) that’s quick to implement.
2) Decide if more data or features will help (guided by learning curves)
3) Error analysis, manually examine samples where model made errors

How well did you know this?

Not at all

Perfectly

ML-06 - Model and feature selection

In the ML design guideline, how do you perform error analysis? (3)

Study These Flashcards

Look at data your model predicted wrongly
Look for systematic trends in the type of errors made
Hypothesize what cues (features) could have helped

ML-06 - Model and feature selection

What is feature selection in ML?

Study These Flashcards

Selecting which features are necessary, e.g. because they are redundant or not correlated with the labels (e.g. ID column).

ML-06 - Model and feature selection

What is the goal of feature selection?

Study These Flashcards

Find an optimal set of features that results in a “best model” for a problem.

ML-06 - Model and feature selection

For feature selection, what are the 4 classes of feature selection mentioned?

Study These Flashcards

Filter methods
Wrapper methods
Embedded methods
Dimensionality reduction methods

ML-06 - Model and feature selection

Describe the gist of feature selection filter methods.

Study These Flashcards

From the set of all features, find a subset based on some selection criteria, e.g. correlation coefficient.

ML-06 - Model and feature selection

If you use a selection criteria to reduce a set of features to a subset, what type of feature selection method is that?

Study These Flashcards

Filter methods

ML-06 - Model and feature selection

What does the correlation coefficient measure?

Study These Flashcards

Linear relationships between two or more features relative to each other or the output label.

ML-06 - Model and feature selection

How would you use the correlation coefficient to filter out unnecessary features?

Study These Flashcards

If two features are correlated, you don’t need both.

ML-06 - Model and feature selection

Describe how wrapper methods work.

Study These Flashcards

Search
- Loop until stopping
- Generate a feature set
- Test performance
- Select best performer

# ML-06 - Model and feature selection What are the most commonly used wrapper methods? (4)

- Exhaustive feature selection - Forward/sequential feature selection - backward feature elimination - recursive feature elimination (RFE)

# ML-06 - Model and feature selection Describe exhaustive feature selection.

Try every combination of feature subsets and use the best one.

# ML-06 - Model and feature selection Describe forward/sequential feature selection.

An iterative method starting with a single feature, then add more.

# ML-06 - Model and feature selection Describe backward feature elimination

Start with all features, remove the worst performers.

# ML-06 - Model and feature selection Describe recursive feature elimination (RFE)

is a type of back feature elimination method, which removes the weakest feature (or features) recursively one at a time until the specified number of features is reached.

# ML-06 - Model and feature selection What is RFE short for?

recursive feature elimination

# ML-06 - Model and feature selection What is RFECV short for?

recursive feature elimination with cross validation

# ML-06 - Model and feature selection What are embedded methods?

An iterative method where in each iteration, a method extracts the features that contributed the most.

# ML-06 - Model and feature selection Describe how regularization is used for feature selection.

L1 - Penalizes features to change their importance in the model. Promotes sparsity, i.e. feature is not used. L2 - Maintains all variables, but assigns importance to improve performance.

# ML-06 - Model and feature selection Describe how dimensionality reduction methods are used for feature selection.

Analyses input and projects it into a lower dimension, meaning it transforms the inputs into a new form.

# ML-06 - Model and feature selection What are 3 common dimensionality reduction methods?

- PCA (Principal Component Analysis) - SVD (Singular Value Decomposition) - LDA (Linear Discriminant Analysis)

# ML-06 - Model and feature selection What is PCA short for?

Principal Component Analysis

What is SVD short for?

Singular Value Decomposition

What is LDA short for?

Linear Discriminant Analysis

ML-06 - Model and feature selection Flashcards

(38 cards)