Hypothesis Testing Flashcards
What is the null hypothesis?
There is no statistical significant difference between specified population, any observed difference is due to sampling or experimental error (chance).
What do we ask when we conduct a hypothesis test?
1) is the value of the test statistic extreme enough for us to reject the null hypothesis?
2) what is our test statistic
3) what should the distribution of the test statistic be if the null hypothesis is true.
What is the p-value?
The probability of obtaining a particular value of the test statistic or a more extreme value if the null hypothesis is true.
What is alpha?
This is a pre determined level of significance which enables us to decide whether to reject the null hypothesis.
If the p-value is <= alpha we regard it as significant band reject the null hypothesis.
If the p-value is >= alpha we fail to reject the null hypothesis
What is a Type I error?
This is when the null hypothesis is actually true but we reject it.
What is a Type II error?
This is when the null hypothesis is false but we fail to reject it.
What is the probability of making a Type I error?
This is equal to the value of alpha (significance threshold)
Write a null hypothesis and a hypothesis for a one sample T-test?
Null hyp: the mean of this sample is equal to the hypothesised value
Hyp: The mean of this sample is different from the hypothesised value.
In a T-test in which the null hypothesis is true where should the value of t come from?
The value of t should come from a t- distribution with n-1 degrees of freedom
When would you use a Paired T-test?
When there is a pair of measurements per subject and you are measuring a before and after change.
(We still only have n independent data points even if we have 2n numbers)
What distribution is tested in a paired T-test
Difference between pairs of values are calculated and then distribution of differences is tested to see if it is equal to 0.
(It’s a one sample T-test where the hypothesised value is always zero)
What is a general null hypothesis and hypothesis for a 2 sample T-test.
Null hyp: the means of these two samples are the same
Hyp: the means of these two samples are different
(We are looking at the difference between two mean)
(Difference between two means also has a sampling distribution)
What does a 2 sample T-test assume?
1) The data is normally distributed
2) The variances of the two groups are equal
If the null hypothesis is true where should the t value come from?
1) For a single sample T-test
2) For a 2 sample T-test
The t-value should come from a t- distribution with n-2 degrees of freedom (where n is the total number of data points from both groups)
1) When is a 2 sample T-test robust?
2) What are the exceptions?
1) When sample sizes of both groups are >= 30 (provided groups show equal variances)
2) if each group being compared have similar numbers in each group then it’s ok to use the test even when the standard deviations in the two groups differ by up to 3 fold
What is a non parametric substitute for the 2 sample T-test?
Mann Whitney Wilcoxon test