Hypothesis Testing in AI Flashcards
What are the 4 steps in hypothesis testing?
- State the hypothesis
- Design the statisitcal test
- Calculate the test statistic
- Reject or fail to reject the null hypothesis
State, Design,Calcuate, R or A
What is a test statstic?
It is a function of the observed values of the random variables of interest
What is meant by the p-value?
The probability in the tail
The probabability of observing a sample estimate as extreme as the one observed, assuming the null hypothesis is true.
What is meant by the level of significance?
The probability of making a Type I error
What is Type I error?
What is Type II error?
There are 2 types of errors,
Type I: Wrongly rejecting a true null hypothesis
Type II; Accept (Fail to reject) an unture null hypothesis.
The probability of a Type I error is the level of significance.
What are the reasons for unrepresentative datasets?
- Selection bias (leaving certain obervations out);
- Survivorship bias (leave out funds no longer in existence);
- Self selection bias (Fund Managers leave out certain funds)
The 3 S’s
Data mining
Data dredging (Data snooping)
Data mining; vigorously testing until valid relationships are established
Data dredging: Overuse of statistical tests or Running hundreds of statistical tests to identify significant relationship without regard to economic rationale.
Backtesting, overfitting and Backfilling
- Backtesting: Apply models on historical data to determine how well the model would have explained the actuals results: Good sometimes
- Overfitting;When many parameters are used to a model in histotical data;Not good
- Backfilling: Updating databases by inserting returns that pre-date the date of entry in the database. Backfill bias is also known as instant history bias. Generally not good
Cherry picking and chumming
- Cherry picking ; Selectivley reporting results;
- Chumming: If enough predictions are made, some will be correct
Why are cumulative return charts deceptive?
How is deception avoided?
Because of compounding effects. Gap widens with an early period advantage.
Deception is avoided by using cumulative log chart becase log returns are additive, not multiplicative.