Model Evaluation Flashcards

Question 1

Q

Accuracy

Answer

A

Measures the proportion of correctly classified predictions in the model.

Accuracy = (TP+TN)/(TP+TN+FN+FP)

Question 2

Q

Sensitivity / Recall

Answer

A

True positive rate (TPR)

TPR = TP/(TP+FN)

Looking at all the actual positives, how many of them correctly classified as positive?

Question 3

Q

Specificity

Answer

A

True negative rate (TNR)

TNR = TN/(TN+FP)

Looking at all the actual negatives, how many of them were correctly classified as negative?

Question 4

Q

Precision

Answer

A

Precision = TP/(TP+FP)

Look at all predicted positives cases, how many of them were correctly classified as positive?

“If I predict something negative and it’s wrong, it’s fine, but let the ones predicted “positive” be good!

Question 5

Q

ROC Curve

Answer

A

Confusion matrix is at the basis of the Receiver Operating Characteristic (ROC) curve.

The ROC curve allows to compare different models of classification. Each point in the ROC curve corresponds to a different cut-off thresholds of the model.

X axis - rate of false positives / (amongst all negatives) (1 - specificity) → indicator of how well one classifies negative cases.

Y axis - rate of true positives / (amongst all positives) (sensitivity) → indicator of how well one classifies positive cases.

Depending on what your priorities are, you might choose a model that allows more or less of either.

Question 6

Q

AUC

Answer

A

Sometimes the curves of the chart does not clearly show which model performs better. The AUC metric can help.

Question 7

Q

Gain Chart

Answer

A

have a population of customers. On average, 10% of our customers end up buying the product. We want to select a subgroup of our population whom to send marketing emails, 60% of which will buy the product. Hence we want out model to predict the people that will buy the product. We want a high True Positive Rate (sensitivity).

The gain chart shows us how our models’ target metric (sensitivity) changes when we change sample size, as we send more marketing emails.

The greater the distance between the lift curve and the baseline, the better the model.

Question 8

Q

Lift Chart

Answer

A

The lift chart shows how many times will the target in the selected sample increase with respect to the random sample, with increasing percentage of the population.

Lift is calculated as the ration between the results in the target, obtained with and without the model.

If we contact only 10% of the population, when customers contacted are chosen with a model, the response rate is 35%, and with random sampling the response rate is 10%.

Question 9

Q

Kolmogorov-Smirnov graph

Answer

A

a measure of the degree of separation between the positives and negative distributions.

K-S-Value = 100, population divided into two completely separate groups, one containing positives, other negatives.

K-S-Value = 0, the model is not able to differentiate between positive and negatives, then the model works as a random selection.

Question 10

Q

Akaike Information Criterion

Answer

A

Combines the goodness of the model with its complexity, measured by the number of independent variables.

AIC = -2\log (hat(L)) + 2(k+1)

Where hat(L) is the maximum of the likelihood function and k the number of independent variables.

The lower the better!

Question 11

Q

MacFadden’s R2

Answer

A

1 - (LL_fullmodel/ LL_intercept)

Compares LL of model with LL of a model with only the intercept. The closer to 1, the better. Values tend to be smaller, and values such as 0.2 or 0.4 can be considered satisfactory.

Question 12

Q

R2 of Cox and Snell

Answer

A

R_cs2 = 1-exp[-2/n [LL(B)-LL(0)]]

Takes into account also the size of the data sample.

Question 13

Q

Nagelkerke R2

Answer

A

R_N2 = R_{CS}^2 / R_{MAX}^2

where

R_MAX^2 = 1- exp[2/n-LL(0)]

Question 14

Q

Measures for Quantitative target model evaluation

Answer

A

MAE - Mean Absolute Error
MSE - Mean Squared Error
MPE - Mean Percentage Error
MAPE - Mean Absolute Percentage Error

Model Evaluation Flashcards

(14 cards)