8.2 Predictive Analytics: Model Evaluation & Bias-Variance Trade-Off Flashcards

Question 1

Q

What is the primary goal of supervised learning methods?

Answer

A

To minimize error by making predictions as accurate as possible.

Question 2

Q

How do classification methods evaluate model accuracy?

Answer

A

By calculating the percentage of correct predictions.

Question 3

Q

How do regression models measure error?

Answer

A

By calculating the difference between predicted and actual values.

Question 4

Q

Why is evaluating unsupervised learning models difficult?

Answer

A

We don’t have labeled data, so we don’t know the correct number of groups or classifications.

Question 5

Q

What is the purpose of a train-test split in machine learning?

Answer

A

To evaluate how well a model generalizes to unseen data and prevent overly optimistic projections.

Question 6

Q

What percentage of data is typically used for training vs. testing?

Answer

A

80% for training, 20% for testing

Question 7

Q

What happens at deployment after the train-test split?

Answer

A

The model is trained on 100% of the data based on what was learned during model evaluation.

Question 8

Q

What is the formula for model error?

Answer

A

Model Error = Irreducible error + Bias + Variance

Question 9

Q

What is irreducible error in a model?

Answer

A

Noise in the data or insufficient data to fully represent patterns, which cannot be reduced by training models.

Question 10

Q

How is bias measured in a model?

Answer

A

By how far predictions are from actual values; high bias means the model oversimplifies patterns.

Question 11

Q

How is variance measured in a model?

Answer

A

By how much predictions fluctuate around the mean when trained on different data.

Question 12

Q

What is underfitting?

Answer

A

When a model hasn’t sufficiently learned patterns from the data, leading to high bias and low variance.

Question 13

Q

What is overfitting?

Answer

A

When a model fits too tightly to the training data, causing low bias and high variance.

Question 14

Q

What is the ideal bias-variance trade-off?

Answer

A

Low bias and low variance
a model complex enough to capture patterns but not overly sensitive to small data changes.

Question 15

Q

How do training error and variance relate?

Answer

A

As training error decreases, model complexity increases, leading to higher variance.

Question 16

Q

How do error rates indicate underfitting and overfitting?

Answer

Study These Flashcards

A

Underfit model: High error in both training and testing data.
Overfit model: Low error in training but high error in testing

8.2 Predictive Analytics: Model Evaluation & Bias-Variance Trade-Off Flashcards

(16 cards)