W07 Validation and Evaluation Flashcards

1
Q

Forecast Evaluation

2 possibilities

A
  • compare to actual observations (beware of self fulfilling prophecies)
  • compare to naive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Error Measures

A

Absolute
Percentage
Scaled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Self-fulfilling forecast - negative consequence

A

buy-down

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Classification Performance

A

Recall (Sensitivity)
correctly assigned / actually in class

Precision
actually in class and assigned / assigned to class
Specificity
how many (share of) not selected are actually not true?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Error rate across categories

A

average or weighted average or importance-weighted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Comparing error rates

A

training vs validation vs test set

expected error vs observed error vs benchmark approach

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Possible Benchmarks

A

statistically expected error rate
naive rules
expert assignment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Benchmark Factors beyond accuracy

A

effort
reliability
acceptance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Data Set split

A

Training Set: build tree
Validation Set: prune tree
Test Set: evaluate tree’s predictions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Testing: Hold out 1

A

k-fold cross validation

1 split data into k partitions of equal size

2 use k-1 for training
3 use k for evaluation

4 repeat k times
5 average the results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Testing: Hold out 2

A

Bootstrap
alternative to cross validation, for small data sets

n is original data set size

draw n instances with replacement (same can be drawn multiple times)

This is the training set.

Never drawn instances are test set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Lift Factor

A

What increase in accuracy does my prediction promise?

Gives ratio, not absolutes. Helpful for cost-benefit analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Lift Chart

A

when classification is probabilistic

compute lift factor when increasing sample size, possibly comparing to increase in cost caused by increasing the sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly