Quant: Hypothesis Testing Flashcards
What is a hypothesis?
A statement about the value of a population parameter developed for the purpose of testing a theory or belief.
Steps in Hypothesis Testing Procedure =
State the hypothesis
Choose appropriate test statistic
Specify the level of significance
State the decision rule regarding the hypothesis
Take a sample, calculate sample stats
Make a decision regarding the hypothesis
Make a decision based on the results of the test
Null hypothesis =
what we want to REJECT.
It is what is ACTUALLY tested, and is the BASIS for the test statistic chosen
always includes the “equal to” condition
Alternative hypothesis =
what we actually want to be TRUE (most of the time)
we conclude this if there is sufficient evidence to REJECT the null hypothesis
One tailed vs Two tailed tests =
one sided will be used to show that a value is greater than a single value.
a two tailed test will have 2 critical values/rejection points.
Test Statistic =
is a RANDOM VARIABLE that follows a T, Z, CHI-SQUARE or F distribution
calculated by comparing the point estimate of the population parameter and the hypothesized value of the parameter.
We then compare the TEST STATISTIC to the CRITICAL VALUE of the test statistic.
population
sample
sample statistic
WHAT IS THE CONNECTION?
we want to draw an inference regarding the parameter of a POPULATION. We take a SAMPLE of the population. We compute a SAMPLE STATISTIC. We draw our INFERENCE.
HOWEVER there is a chance that this SAMPLE is not representative of the popuilation–> ERROR!
when drawing inferences from a population -TYPE I ERROR =
rejection of the null hypothesis when it is actually true.
THE SIGNIFICANCE LEVEL is the chance of making a type 1 error - ie 5% significance level, means there is a 5% chance of rejecting the null when it should not be.
when drawing inferences from a population -TYPE II ERROR =
failure to reject the null hypothesis when it is actually false
chance of type II error INCREASES/power of test DECREASES with
- decrease in level of significance (ie from 5% to 1%)
- decrease in sample size
the POWER of the test =
1- P(making a TYPE II ERROR)
where a type II error is failing to reject the null when it is false.
We can use the power of the test to compare two different test statistics (and typically choose the one with THE GREATEST POWERS)
The power of the test increases whenyou reduce the chance of a type two error. This happens when you increase the significance level (ie. 1% to 5%), which increases the chance of making a type 1 error.
Seems we can take P.O.T as “the usefulness of the test in rejecting the null’
CHEAT CARD, T1 & 2 ERRORS, SIGNIFICANCE LEVELS AND POWER OF THE TEST=
Confidence interval =
A range of values within which the researcher believed the true population parameter may lie.
A 95% confidence interval uses a critical value associated with a particular distribution at a 5% level of significance.
A hypothesis test would compare a test statistic to a critical value at a 5% level of significance.
NB. standard error = s/√n
Statistical results vs economically meaningful results =
a statisitically significant result does not necessarrily mean an economically meaningful one: transactions costs, taxes and risk are not normally considered in the statistical test.
“Any of these factors could make committing funds to a strategy unattractive, even though the statistical evidence of positive returns is highly significant. By the nature of statistical tests, a very large sample size can result in highly (statistically) significant results that are quite small in absolute terms.”
P-value =
probability of obtaining a test statistic that would lead to the rejection of the null hypothesis when it is actually TRUE. Smallest level of significance for which the null hypothesis can be rejected.
take the test statistic, find the corresponding significance level from the distributiont (multiply by two if a two tailed test)
P- VALUE WILL SHOW THE LOWEST LEVEL OF SIGNIFICANCE FOR WHICH WE WOULD ACCEPT/REJECT THE NULL (THE THRESHOLD) - WHAT THE PROBABILITY IS THAT A VALUE WOULD BE GREATER THAN THE TEST STAT
t-test =
Use t-test when you could use a t distribution (DO YOU KNOW WHAT THE NECESSARY CONDITIONS ARE?)