Model Evaluation Flashcards by Jan Wilhelm Sy

Formula for Accuracy

True Positive + True Negative / Number of Items

How well did you know this?

Not at all

Perfectly

Formula for Precision (p)

p = TP / TP + FP

How well did you know this?

Not at all

Perfectly

Formula for Recall (r)

r = TP / TP + FN

How well did you know this?

Not at all

Perfectly

Formula for F-Measure

Precision = p
Recall = r

fm = 2rp / r + p

How well did you know this?

Not at all

Perfectly

compare the accuracy of the classifier with a random classifier.

Kappa Statistics

How well did you know this?

Not at all

Perfectly

was developed in 1950 for signal detection theory.’

Works only for binary classification.

Receiver Operating Characteristic (ROC)

How well did you know this?

Not at all

Perfectly

to estimate performance of classifier on previously unseen data.

Purpose of Model Evaluation

How well did you know this?

Not at all

Perfectly

reserve k% for training and (100 - k) % for testing.

Holdout

How well did you know this?

Not at all

Perfectly

partition data into k disjoint subset.

Cross Validation

How well did you know this?

Not at all

Perfectly

train on k-1 partition, test on the remaining one.

K- Fold

How well did you know this?

Not at all

Perfectly

shows how accuracy on unseen examples changes with varying training sample size.

Learning Curve

How well did you know this?

Not at all

Perfectly

many algorithms allow choices for learning.

Hyperparameters

How well did you know this?

Not at all

Perfectly

3 STEPS IN TRAINING THE MODEL

Train
Model Selection
Test

How well did you know this?

Not at all

Perfectly

learn models on the training data using different hyperparameters.

Train

How well did you know this?

Not at all

Perfectly

evaluate the models using the validation data and choose the hyperparameters with the best accuracy.

Model Selection

How well did you know this?

Not at all

Perfectly

test the final model using the test data.

Study These Flashcards

Test

3 TYPES OF CLASSIFICATIONS ERRORS

Study These Flashcards

Training Errors
Test Errors
Generalization Errors

errors committed on the training set.

Study These Flashcards

Training Errors

errors committed on the test set.

Study These Flashcards

Test Errors

expected errors of a model over random selection of records from same distribution.

Study These Flashcards

Generalization Errors

is when a model is too simple, both training and test errors are large.

Study These Flashcards

Underfitting

when model is too complex, training error is small, but test error is large.

Study These Flashcards

Overfitting

2 REASONS FOR OVERFITTING

Study These Flashcards

Not enough training data
High model complexity

2 MODEL SELECTION FOR DECISION TREE

Study These Flashcards

Pre-Pruning (Early Stopping Rule)
Post-Pruning

stops the algorithm before it becomes a fully grown tree.

Pre-Pruning (Early Stopping Rule)

grow decision tree to its entirety.

Post-Pruning

Model Evaluation Flashcards

(26 cards)