Chapter 13 - Inferential Statistics Flashcards
Parameters
Corresponding values in the population
Sampling error
The random variability in a statistic from sample to sample
Note that the term error here refers to random variability and does not imply that anyone has made a mistake. No one “commits a sampling error.”)
Interpreting Possibilities of statistical relationships in a sample
In fact, any statistical relationship in a sample can be interpreted in two ways:
There is a relationship in the population, and the relationship in the sample reflects this.
There is no relationship in the population, and the relationship in the sample reflects only sampling error.
The purpose of null hypothesis testing is simply to help researchers decide between these two interpretations.
NHST
A formal approach to deciding between two interpretations of a statistical relationship in a sample.
Null Hypothesis
The idea that there is no relationship in the population and that the relationship in the sample reflects only sampling error (often symbolized H0 and read as “H-zero”).
Alternative Hypothesis
An alternative to the null hypothesis (often symbolized as H1), this hypothesis proposes that thereisa relationship in the population and that the relationship in the sample reflects this relationship in the population.
Steps of NHST
Although there are many specific null hypothesis testing techniques, they are all based on the same general logic. The steps are as follows:
Assume for the moment that the null hypothesis is true. There is no relationship between the variables in the population.
Determine how likely the sample relationship would be if the null hypothesis were true.
If the sample relationship would be extremely unlikely, thenrejectthenullhypothesisin favor of the alternative hypothesis. If it would not be extremely unlikely, thenretainthenullhypothesis.
P value
A crucial step in null hypothesis testing is finding the probability of the sample result or a more extreme result if the null hypothesis were true (Lakens, 2017).[1]This probability is called thepvalue. A lowpvalue means that the sample or more extreme result would be unlikely if the null hypothesis were true and leads to the rejection of the null hypothesis. Apvalue that is not low means that the sample or more extreme result would be likely if the null hypothesis were true and leads to the retention of the null hypothesis.
How low does a p value have to be before sample result is considered unlikely enough to reject null hypothesis?
But how low must thepvalue criterion be before the sample result is considered unlikely enough to reject the null hypothesis? In null hypothesis testing, this criterion is calledα(alpha)and is almost always set to .05. If there is a 5% chance or less of a result at least as extreme as the sample result if the null hypothesis were true, then the null hypothesis is rejected. When this happens, the result is said to bestatistically significant. If there is greater than a 5% chance of a result as extreme as the sample result when the null hypothesis is true, then the null hypothesis is retained.
Misunderstanding the P value
Thepvalue is one of the most misunderstood quantities in psychological research (Cohen, 1994)[2]. Even professional researchers misinterpret it, and it is not unusual for such misinterpretations to appear in statistics textbooks!
The most common misinterpretation is that thepvalue is the probability that the null hypothesis is true—that the sample result occurred by chance. For example, a misguided researcher might say that because thepvalue is .02, there is only a 2% chance that the result is due to chance and a 98% chance that it reflects a real relationship in the population. But this isincorrect. Thepvalue is really the probability of a result at least as extreme as the sample resultifthe null hypothesisweretrue. So apvalue of .02 means that if the null hypothesis were true, a sample result this extreme would occur only 2% of the time.
You can avoid this misunderstanding by remembering that thepvalue is not the probability that any particularhypothesisis true or false. Instead, it is the probability of obtaining thesample resultif the null hypothesis were true.
clarifying what it means to test the null hypothesis
Recall that null hypothesis testing involves answering the question, “If the null hypothesis were true, what is the probability of a sample result as extreme as this one?” In other words, “What is thepvalue?”
Interpretive skill
weak relationships based on medium or small samples are never statistically significant and that strong relationships based on medium or larger samples are always statistically significant. If you keep this lesson in mind, you will often know whether a result is statistically significant based on the descriptive statistics alone. It is extremely useful to be able to develop this kind of intuitive judgment. One reason is that it allows you to develop expectations about how your formal null hypothesis tests are going to come out, which in turn allows you to detect problems in your analyses. For example, if your sample relationship is strong and your sample is medium, then you would expect to reject the null hypothesis.
A statistically _____ result is not necessarily a ____ one.
significant, strong
practical significance
Practicalsignificancerefers to the importance or usefulness of the result in some real-world context. Many sex differences are statistically significant—and may even be interesting for purely scientific reasons—but they are not practically significant. In clinical practice, this same concept is often referred to as “clinical significance.”
T test
The most common null hypothesis test for this type of statistical relationship is thet-test.
The t-test is a test that involves looking at the difference between two means.
One Sample t-test
Used to compare a sample mean (M) with a hypothetical population mean (μ0) that provides some interesting standard of comparison
Test Statistic
A statistic (e.g.,F,t, etc.) that is computed to compare against what is expected in the null hypothesis, and thus helps find the p value.
Critical Value
The absolute value that a test statistic (e.g.,F,t, etc.) must exceed to be considered statistically significant.