ML - Bias vs. Variance Flashcards

Question

High variance models does / does not generalize well to unseen data.

Answer 1

High variance models _does not_ generalize well to unseen data.

Answer 2

High variance models: 1. _Does well_ on *training* data 2. _Does not do well_ on *test* data

Answer 3

Changes to the training data cause big changes in the estimate of the target function are _high_ variance models.

Answer 4

Changes to the training data cause small changes in the estimate of the target function are _low_ variance models.

Answer 5

Examples of low variance models are: 1. Linear / Logistic Regression 2. Naive Bayes

Answer 6

Examples of high variance models are: 1. Decision Trees 2. k-Nearest Neighbors 3. SVMs 4. NNs

Answer 7

Does not generalize + model overfits to training data is... _low_ bias + _high_ variance

Answer 8

Does not capture the true relationship between predictors and target variable + model is underfitting training data is... _high_ bias + _low_ variance

Answer 9

_low_ bias + _high_ variance

Answer 10

This picture represents... _high_ bias + _low_ variance

Answer 11

Name 6 possible solutions to fix a low bias + high variance model: 1. Choose Simpler Model 2. Feature Selection (reduce # of features) 3. Dimensionality Reduction 4. Regularize (penalize model complexity) 5. Bagging + Resampling techniques 6. Training on larger dataset

Answer 12

Name 2 possible solutions to fix a high bias + low variance model: 1. Add more features 2. Make model more flexible / sensitive

Answer 13

We do k-fold Cross-Validation. We do this to evaluate the performance of our model we need to test it on unseen data. This tells us how well our model generalizes and also whether we over / under-fit. We do this by creating a train-test split of our data.

Answer 14

k-Folds = k is our number of splits of training data, higher value of k leads to less bias / high variance, lower value of k leads to more bias / low variance. In effect, making k small (k=1) turns the training data into the original training set and makes model too simple and underfit (ie - high bias, low variance). Bumping up k, makes the model more sensitive to the underlying training set (ie - higher variance).

Answer 15

Regression leads to a high variance model. Regression can be regularized to reduce model complexity. We do this by adding a penalty term for model complexity. This reduces variance.

Answer 16

Decision trees are usually a high variance model. Decision Trees can be pruned to reduce model complexity. This reduces variance.

Answer 17

kNN = has low bias + high variance, but we can change this by increasing the value of k, which increases the number of neighbors that contribute to the prediction (which increases the bias) * large K = simple model = underfit = low variance & high bias * small K = complex model = overfit = high variance & low bias

Answer 18

For k-NNs, when k increases to infinity, our model because _less_ complex. This leads to _under_-fitting. This leads to _high_ bias. This leads to _low_ variance. --- All test data point will belong to the same class: the majority class. If “granularity” is too fine, the result “outliers” and “noised” affect the decision process

Answer 19

_Bagging_ is an ensemble method that aggregates models in _parallel_.

Answer 20

_Boosting_ is an ensemble method that aggregates models in _sequential order._

Answer 21

Bagging and Boosting _decreases the variance_ of your single estimate as they combine several estimates from different models. So the result may be a model with _higher stability_.

Answer 22

Both bagging + boosting generate several training data sets by random sample, but only _boosting determines the weights_ for the data to tip the scales in _favor of the most difficult cases_.

Answer 23

Both bagging + boosting make the final decision by averaging the N learners (or taking the majority of them), but it is an equally weighted average for bagging, and a weighted average for boosting (ie - more weight is given to those models with better performance on training data)

Answer 24

Both bagging + boosting are good at reducting variance and providing higher stability, but only _boosting_ tries to reduce bias.

Answer 25

On the other hand, _bagging_ may solve the over-fitting problem, while _boosting_ can increase it.

ML - Bias vs. Variance Flashcards

(49 cards)