Part 6. Hypothesis Testing Flashcards
Hypothesis testing
The part of statistical inference, the process of making judgments about a larger group (a population) based on a smaller group of observations (sample).
Definition:
We test to see whether a sample statistic is likely to come from a population with the hypothesized value of the population parameter.
Statistical inference
- Estimation
2. Hypothesis testing
Hypothesis
A statement about one or more populations that we test using sample statistics.
The process of hypothesis testing
- State the hypothesis.
- Identify the appropriate test statistic.
- Specify the level of significance.
- State the decision rule.
- Collect data and calculate the test statistics.
- Make a decision.
Null hypothesis (H0)
A statement concerning a population parameter or parameters considered to be true, unless the sample we use to conduct the hypothesis test gives convincing evidence the null hypothesis is false.
The statement we want to reject, in favour of alternative hypothesis, Ha.
Level of significance
The probability of Type I error in testing hypothesis denoted by alpha, a.
i.e. probability of incorrectly rejecting the true H0.
Confidence level
The complement of level of significance; 1-a.
i.e. 5% probability of rejecting a true H0, corresponds to 95% confidence level.
Type 1 and 2 error dilemma
- Both errors involve a trade off
- If we decrease the probability of Type 1 error by specifying a smaller significance level (1% instead of 5%), we increase the probability of making a Type 2 error as will reject H0 less frequently.
- To reduce the probability of both types of errors simultaneously is to increase sample size, n.
Power of test
The probability of correctly rejecting the null, the probability of rejecting the null when it is false.
The complement of Type 2 error.
Critical values
The action of comparing calculated test statistic with specified value or values.
Statistically significant
Finding result of calculated value of test statistic is more extreme than critical value/values, we reject H0.
Collecting data considerations:
- Ensure that the sampling procedure does not include biases, such as sample selection or time bias.
- We need to cleanse the data, checking inaccuracies and other measurement errors in data.
Once assured sample is unbiased and accurate, the sample info is used to calculate the appropriate test statistic.
Make a decision
- Statistical decision:
- consider a test of mean risk premium, comparing population mean with zero using bounds. - Economic decision:
- Considers statistical decision, but all pertinent economic issues, i.e. reject H0 that risk premium is zero for greater than zero. This is economically meaningful that investor commit funds to US equities.
- Non-statistical considerations such as investors tolerance for risk and financial position.
- Statistical significance, not economic?
- The smaller the standard error of mean, the larger value of t-stat and greater chance H0 rejected all else equal.
- Standard error decreases as sample size, n, increases, so that for large samples we can reject H0 for small departures from it.
- There is a statistically positive mean return, results may not be economically significant when accounting for transaction costs, taxes, and risk.
P-Value
The area in the probability distribution outside the calculated test statistic, for a 2-sided test this is the area outside +- the calculated test statistic.
For one sided test, this is the area outside the calculated test statistic on the appropriate side of the probability distribution.
The p value is the smallest level of significance at which the null hypothesis can be rejected.
P-value
The smallest level of significance at which null hypothesis can be rejected.
The area in the probability distribution outside the calculated test statistic for a two-sided test, the area outside +- the calculated test statistic.
For one sided test, this is the area outside the calculated test statistic on the appropriate side of probability distribution.