Topic 6: Machine Learning: Performance Evaluation, Backtesting & False Discoveries Flashcards

Question 1

Q

Describe a ranking classifier

Answer

A

a classifier that gives scores to instances (classifier + threshold = single confusion matrix)

Question 2

Q

Describe the ROC graph

Answer

A

Two-dimensional plot with false positives rate on the x-axis and true positive rates on the y-axis

Question 3

Q

Describe the ROC graph.

Answer

A

Y-axis shows the true positive rate (sensitivity) and the X-axis shows the false positive rate (1-specificity)

sensitivity = TP / (TP+FN)

specificity = FP / (FP+TN)

Question 4

Q

Describe the four corners and the diagonal of the ROC graph.

Answer

A

Bottom left: Conservative (only make classifications with strong evidance)

Upper right: permissive (make positive classifications with weak evidence)

Question 5

Q

Define the hit rate and false alarm rate.

Answer

A

hit rate = percentage of positives correctly classified (TP/(TP+FP))

false alarm rate = FP/(FP+TN)

Question 6

Q

Define the AUC measure.

Answer

A

Area under the curve is used to assess the performance of the detection of a model independent of the detection threshold.

Question 7

Q

Describe the cumulative response curve, also known as the lift curve.

Answer

A

Lift curve plots the hit rate as a function of the population that is targeted.

(e.g. 20% test instances targets 60% of positives targeted)

Question 8

Q

Describe why standard statistical tools, such as p-values and t-statistics,
can lead to false discoveries in the presence of multiple tests.

Answer

A

the large number of tests will lead to false positives/false negatives so you need a tougher standard.

Question 9

Q

Calculate the t-statistic based on the reported Sharpe ratio for testing a
single trading strategy.

Answer

A

T-statistic = Sharpe Ratio × √Number of years

Question 10

Q

Describe and apply Bonferroni tests in the context of the family-wise error rate
(FWER) approach to adjusting p-values for multiple tests.

Answer

A

Approaches to the multiple testing problem in statistics:

Bonferroni test, an FWER, accepts no false discoveries. Calculated by 0.05/number of tests.

Question 11

Q

Recognize and apply the Holm function to calculate adjusted p-values

Answer

A

Holm pk = 0.05 / (total number of tests + 1 - k), compare p-value with their hurdles

Question 12

Q

Describe the Holm method in the context of the false discovery rate (FDR) approach to adjusting p-values for multiple tests.

Answer

A

The holm method is less stringent than the bonferoni method, the false discovery rate (FDR) is less stringent than both of them.

Question 13

Q

Describe the process of accepting and rejecting tests using the Holm method.

Answer

A

P-value should be less than the Holm statistic

Question 14

Q

Explain the relationship between avoiding false discoveries and missing
profitable opportunities.

Answer

A

Adjusting the hurdle when performing multiple tests decreases type I errors (false discoveries) but increases type II errors (missing discoveries).

Question 15

Q

Define specificity and sensitivity.

Answer

A

specificity = TN / (TN + FP)

sensitivity = TP / (TP + FN)

Question 16

Q

Describe the false discovery rate with the help of a tree diagram

Answer

Study These Flashcards

A

top part of tree diagram = sensitivity, bottom part is specificity.

Question 17

Q

Calculate the false discovery rate.

Answer

Study These Flashcards

A

FP / (FP + TP)

Topic 6: Machine Learning: Performance Evaluation, Backtesting & False Discoveries Flashcards

(17 cards)