9. A Protocol in the Era of ML Flashcards

Question 1

Q

Answer

A

(Important bc large out-of-the-sample tests not possible in ML)

Question 2

Q

Answer

A

( 2-sigma is not necessarily meaningful, and a strategy working by chance is not necessarily significant )

Question 3

Q

Answer

A

Test sample defined and justified ex-Ante
–> Justify in advance and never change sample
Winsorization (truncation at certain threshhold) defined and justified ex-Ante
Pre-processing to ensure data quality
Pre-processing choices regarding Data transformation or Outlier exclusion need to be justified and documented.

Question 4

Q

Answer

A

( in Trading only out-of-sample data is live trading data )

Question 5

Q

Answer

A

Question 6

Q

Answer

A

Beware of dimensionality curse
( more predictor variables == more data needed )
Aim for simplicity and Regularization and interpretability ( ML application should not be a black box )

Question 7

Q

Answer

A

Establish a Research Culture That Rewards Quality Science
Be Careful with Delegated Research (assistants have incentive to support supervisor hypothesis)

Question 8

Q

Protocol Summary

Answer

A

(8 cards)