L8 - Statistical Power P1 Flashcards
Statistical Significance Testing requires that we have a hypothesis about something we are interested in.
What does hypothesis refer to in this sense?
A hypothesis is an assertion about some population/property parameter that we are interested in in the world
At it’s core level, what is significance testing?
The traditional means of deciding whether to reject an hypothesis or not
Why is significance testing necessary to accept/reject a hypothesis?
This is because we test our hypothesis by sampling from the population
We cannot test the entire population.
In significance testing, when would we reject the hypothesis?
If our sample statistic differs significantly from that specified by the hypothesis, we reject the hypothesis.
Otherwise, we continue to entertain the hypothesis
What is the type of hypothesis that we conventionally use for significance testing?
The Null Hypothesis
The null hypothesis (H0) states that there is ___ effect
No effect
If the null hypothesis is rejected then the _______ hypothesis is supported
Alternative (H1) hypothesis
The alternative hypothesis is strictly NOT H0
How do you frame the null hypothesis so you can test it?
You identify a set of values you would expect if the null hypothesis were true.
You test the data against this distribution of values.
Once seeing your results, you can reject or retain the null hypothesis.
Rejecting the null hypothesis means that your hypothesis is supported.
True or False
False
When you reject the null, you are supporting the alternative hypothesis.
The alternative hypothesis is strictly every possible hypothesis that is not the null.
Not what you happen to think is the way of the world.
What is the sampling distribution of the mean?
If you were to do your test about your hypothesis over and over, the test results all vary in some ways.
What you are left with in your results is a normal distribution of your samples.
What are some of the features of the sampling distribution of the mean?
Looks like a normal distribution
Symmetrical
It’s mean is the estimate of the mean of the population
It’s SD is referred to as the “standard error of the mean”
We can know the proportions of values we would expect at any part of the curve.
How do we use sampling distribution of the mean to accept or reject the null hypothesis?
Because we know what the null should look like, if our mean sampling distribution looks like the null, then we retain the null.
Values that are close to the mean have high probability. Values at the ends of the distribution have low probability. If our values fall in the very low category, chances are it’s the alternative hypothesis.
What is the critical region in NHST?
The cutoff for rejecting the null hypothesis
Typically 2.5% on each side (p = .05)
A 5% level of significance means that if our data falls within that 5% critical region (2.5% on each side) our results are…
Statistically significant, we reject the null hypothesis
A z-score of 1 means…
The results are 1 standard deviation away from the mean.
z-scores represent amount of standard deviation away from the mean
When setting up null-hypothesis significance testing, are we trying to prove our hypothesis?
No, we are comparing our results to what we would expect if there were no effect.
If the results differ, we reject the null. This does not mean our conclusions are accurate, only that there is an effect somewhere.
How do we determine the “critical region” where we reject the null hypothesis?
The p value.
It’s arbitrary we decide before doing the study.
In NHST there are 4 outcomes of whether we are right or wrong in our conclusions (2 right and 2 wrong).
What are they?
- We reject the null hypothesis when the null hypothesis is false, then we have drawn the correct conclusion.
- We fail to reject the null when the null hypothesis is true, then we have drawn the correct conclusion
- The null is true but we reject the null. Type 1 error
- The null is false, but we retain the null. Type 2 error
Why are type 1 errors seen as worse than type 2 errors typically?
Type 1 errors mean that you believe there is an effect when there is none, so future researchers will be operating on false pretenses.
Type 2 errors are still bad, as we believe that there is no effect when there is one (e.g. trying to cure a disease, but we believe the treatment doesn’t work but it does)