Data Analysis week 4 Flashcards
What is the difference between a point hypothesis and an interval hypothesis
A point hypothesis is of the form statistic = value. An interval hypothesis is of the form statistic is in range …
What is a condition for a hypothesis to be testable
It needs to be quantified
What does hypothesis testing quantify
It quantifies how unusual the data is, assuming the null hypothesis is true
What are the steps in hypothesis testing
You propose a number. You get a number as result of testing with a sample. You compare the proposed number to the resulted number out of tests. You search for evidence that the proposed number is not correct.
What is the difference between the null hypothesis and the alternative hypothesis
The null hypothesis is often a point hypothesis and is interesting to reject. The alternative hypothesis is often the interval of everything but the value of the null hypothesis.
What is a test statistic
A number calculated from the sample data used to compare test results with the expected result (null hypothesis).
What happens when the data of the test results is consistent with the null hypothesis
Then you say “we failed to reject the null hypothesis”.
What is the null distribution
The sampling distribution of the outcomes of a test statistic under the assumption that the null hypothesis is true.
What is a p-value and what does it depend on
A p-value is the probability that a value at least as extreme as x is observed if the null hypothesis is true. It is the strength of the evidence that the null hypothesis is false. It is also a level of surprise about how surprised we are about the test results. The p-value is dependent on the null hypothesis and an observed sample statistic x.
What does at least as extreme as mean (in a two sided test)
It means at least as different as the value of x from the value of the null hypothesis
What is the significance level (alpha)
The criteria for rejecting the null hypothesis.
When is the null hypothesis rejected
If the p-value is smaller than the significance level.
What do you need to calculate the p-value
The null distribution
How do you calculate the p-value
Create the null distribution by shifting the dataset so the null hypothesis becomes true. Create the sampling distribution of this dataset.