B09 Evaluating Performance Flashcards

Question 1

Q

The goal of evaluating a ___________ is to have a better understanding of how its performance will _______ to future cases.

Answer

A

Classification Model Extrapolate

Question 2

Q

A ____________ is a table that categorizes predictions according to whether they match the ________.

Answer

A

Confusion Matrix Actual Value

Question 3

Q

The class of interest is known as the ________ class, while all others are known as _______.

Answer

A

Positive Negative

Question 4

Q

The ____________ adjusts accuracy by accounting for the possibility of a correct prediction by chance alone.

Answer

A

kappa statistic

Question 5

Q

Kappa values range from ___ to _____

Answer

A

0 (poor) - 1 (good)

Question 6

Q

Kappa can also be though of as ______

Answer

A

Proportion of all instances where the predicted and actual values match.

Question 7

Q

The _________ is defined as the proportion of positive predictions that are truly positive. A model with high _______ is trustworthy.

Answer

A

Precision

Question 8

Q

The ________is a measure of the completeness of the results. A model with high _______has wide breadth.

Question 9

Q

The _______ combines precision and recall into a single number using the harmonic mean. It provides a convenient way to compare several models side by side.

Answer

A

F-Measure

Question 10

Q

The _______ of a model (also called the __________) measures the proportion of positive examples that were correctly classified.

Answer

A

Sensitivity; True Positive Rate

Question 11

Q

The __________ of a model (also called the ____________) measures the proportion of negative examples that were correctly classified.

Answer

A

Specificity; True Negative Rate

Question 12

Q

The __________________ is commonly used to examine the trade-off between the detection of true positives, while avoiding the false positives.

Answer

A

Receiver Operating Characteristic (ROC) curve

Question 13

Q

The _______ treats the ROC diagram as a twodimensional square and measures the total area under the ROC curve.

Answer

A

Area Under the Curve (AUC)

Question 14

Q

Most learners present performance measures during training. This is known as the __________ This metric is overly optimistic and cannot reliably measure future performance.

Answer

A

Resubstitution error

Question 15

Q

The _______ method splits data into a ______ and ____ partition. At no time should the performance on the ____ dataset be allowed to influence the model

Answer

A

Holdout

Training

Test

Question 16

Q

Problems with the Holdout Method?

Answer

Study These Flashcards

A

Each partition may have a larger or smaller proportion of some classes. This could lead to a class being omitted from the training data (resolved by stratified random sampling).
Some samples may have too many or few difficult cases, easy-to-predict cases, or outliers.
Substantial portions of data must be reserved to test and validate the model.

Question 17

Q

A technique known as _________ is sometimes used to mitigate the problems with the holdout method. It uses the average result from several random holdout samples to evaluate a model’s performance.

Answer

Study These Flashcards

A

Repeated Holdout

Question 18

Q

_____ cross-validation randomly divides the data into ___ completely separate random partitions called ____

Answer

Study These Flashcards

A

K Fold

K

Folds

Question 19

Q

At the end of K-Fold Cross Validation, the __________ across all the folds is reported.

Answer

Study These Flashcards

A

Average performance

Question 20

Q

Some other Cross Validation techniques include:

Answer

Study These Flashcards

A

5 Fold Cross Validation

Leave-one-out cross-validation

Random cross-validation

Stratified cross-validation

Question 21

Q

Using sampling with replacement to form training set, _________ presents an alternative to cross-validation

Answer

Study These Flashcards

A

bootstrapping

Question 22

Q

Bootstrapping typically uses less data than crossvalidation for training, therefore, its test error will be _________

Answer

Study These Flashcards

A

rather pessimistic

B09 Evaluating Performance Flashcards

Exam Prep (22 cards)