NHST & Sampling Flashcards
Define sampling error.
The difference between the population value of interest and the sample value. Can be any quality or property of the data, like variance or mean; occurs bc the sample only represents an estimation of the actual population data.
Define sampling distributions
The distribution of a sample statistic (e.g., a mean) when sampled under known sampling conditions from a known population. Effectively the same statistic overlaid across several different trials.
Define the null hypothesis
Any difference b/w sample and population statistic is due to sampling error. The sample and population both represent the same quantity; there is unlikely to be any real difference.
Define the alternative hypothesis
Any difference b/w sample and population statistic is probably not the result of sampling error. The sample and population do not represent the same quantity.
Define NHST
Significance tests are a broad set of quantitative techniques for evaluating the probability of observing the data under the assumption that the null hypothesis is true. Lets us decide if the null hypothesis is more probable than the alternative hypothesis.
Define statistical power
The probability of rejecting the null hypothesis when it is false, or correctly rejecting the null hypothesis.
Define a p-value
A probability value used to determine how likely it is to observe certain values based on sample error alone.
Define alpha value
The probability of rejecting the null hypothesis when it is true; called a significance level. Effectively the cutoff percentage for the risk of erroneously rejecting the null.
Explain why the mean is an unbiased statistic and the variance is a biased statistic
- The mean is an unbiased statistic bc the typical sample mean is equal to the mean of the population. Any sample mean that differs from the population mean is equally likely to be arbitrarily high or low.
- The variance is a biased statistic bc the expected sample variance is usually smaller than the population variance. It does not capture the same value as population statistic.
Explain why sampling error occurs
Sampling error occurs bc the sample only represents an estimation of the actual population data. The sample could have different properties from the population, or misrepresent it.
Explain what two problems sampling error causes in psychological research
1.) Our sample values might not be equal to the population values.
2.) Because of this obfuscation, we can run into a number of difficulties testing scientific hypotheses.
Explain what the sampling distribution for the t-test is based on
T-test distribution: the sampling distribution is based on drawing random samples with known parameters. The means are then compared in relation to the assumed population mean; this lets us find the difference b/w the expected sample mean of the distribution and of the population when a sampling error is made.
Explain what the sampling distribution for the ANOVA is based on
ANOVA distribution: the sampling distribution is based on the ratio of the population variance as estimated between groups vs. within groups.
Explain what the standard error of the mean is conceptually. What does the formula tell you about the relationship between sample size and sampling error?
Conceptually, the SEM is the standard deviation of a sampling distribution. The equation for SEM tells you that sampling error decreases as sample size increases.
Explain the basic logic of NHST.
If we make certain assumptions about the population (e.g., mu = 3) and the sampling process (e.g., random sampling, N= 25), we can determine:
a. ) the expected sample mean.
b. ) the expected difference between an observed sample mean and the population mean when a sampling error is made.
This means that we are evaluating a mean difference (Z-test), relative to how much we would expect means to differ on average (SEM).