W07 Validation and Evaluation Flashcards

Question 1

Q

Forecast Evaluation

2 possibilities

Answer

A

compare to actual observations (beware of self fulfilling prophecies)
compare to naive

Question 2

Q

Error Measures

Answer

A

Absolute
Percentage
Scaled

Question 3

Q

Self-fulfilling forecast - negative consequence

Question 4

Q

Classification Performance

Answer

A

Recall (Sensitivity)
correctly assigned / actually in class

Precision
actually in class and assigned / assigned to class

Specificity
how many (share of) not selected are actually not true?

Question 5

Q

Error rate across categories

Answer

A

average or weighted average or importance-weighted

Question 6

Q

Comparing error rates

Answer

A

training vs validation vs test set

expected error vs observed error vs benchmark approach

Question 7

Q

Possible Benchmarks

Answer

A

statistically expected error rate
naive rules
expert assignment

Question 8

Q

Benchmark Factors beyond accuracy

Answer

A

effort
reliability
acceptance

Question 9

Q

Data Set split

Answer

A

Training Set: build tree
Validation Set: prune tree
Test Set: evaluate tree’s predictions

Question 10

Q

Testing: Hold out 1

Answer

A

k-fold cross validation

1 split data into k partitions of equal size

2 use k-1 for training
3 use k for evaluation

4 repeat k times
5 average the results

Question 11

Q

Testing: Hold out 2

Answer

A

Bootstrap
alternative to cross validation, for small data sets

n is original data set size

draw n instances with replacement (same can be drawn multiple times)

This is the training set.

Never drawn instances are test set.

Question 12

Q

Lift Factor

Answer

A

What increase in accuracy does my prediction promise?

Gives ratio, not absolutes. Helpful for cost-benefit analysis.

Question 13

Q

Lift Chart

Answer

A

when classification is probabilistic

compute lift factor when increasing sample size, possibly comparing to increase in cost caused by increasing the sample size

W07 Validation and Evaluation Flashcards

(13 cards)