week 3 Sampling distributions and Hypothesis testing Flashcards
Sampling Error
Variability due to chance. Amount of deviation between a sample mean and a population mean. The Sampling Error is a concept.
Sampling distributions
identifies the degree of variability between samples. The more random samples drawn, the closer the sample means will approximate a normal distribution (if is normally distributed).
n.b. “Distribution” refers to the set of values obtained from any set of observations, whereas “Sampling distribution” is reserved for the distribution of outcomes of a sample statistic.
An example of a sample distribution is if have many samples, and a mean for each. And if we plot all the sample means, we will generate a sampling distribution.
Standard Error
Standard error=the standard deviation of a set of means. The degree to which a sample mean varies from the population mean. Standard error=S/ (square root of N)
where S here is the standard deviation of a set of means
Note that larger samples have less standard error than smaller ones.
The Standard Error can be calculated to estimate how much difference we might expect between a sample and population mean, purely due to chance. The Standard Error is a precise calculation.
Hypothesis testing
Use sample distribution of the mean to test whether there is a statistically significant difference between the sample mean and the population mean.
A hypothesis test either tells us if there is a significant difference or not, or, whether a relationship is reliable.
Null Hypothesis
Ho
The null hypothesis states that there is no satistically significant difference between groups or sample mean and population mean. Assess data and determine if we reject the null hypothesis or if we fail to reject the null hypothesis.
Null hypothesis decision using Z-score
Only applicable for a normal distribution.
- Convert to z-score.
- consult table, when z=… smaller portion table tells us…
- At alpha level 0.05, if smaller portion value is =/< 0.05, then the null hypothesis is rejected.
rejection level
if set alpha or rejection level at 0.05 then this means there is a 5% degree of error. This is the probability of saying there is a difference, when in fact there is not.
Standardly, a one-tail test is to reject or accept at one tail of the distribution, and has alpha of 0.05. A direction is predicted in a one-tail test hypothesis eg there will be a higher than population mean in the sample mean, due to….
A 2-tail test does not predict direction, merely that there is a difference. But if we set alpha at 0.05 at both ends of a normal distribution, this results in a total of 0.1 error or 10% which is considered to gtreat. Therefore normally set 0.025 rejection at each end of the test.
ie usually if p= 0.05, the null hypothesis is rejected.
Type 1 error
when we erroneously reject the null hypothesis. (controlled by where we set rejection level)
Type 2 Error
When we fail to reject the null hypothesis and the null hypothesis is false (ie it should be rejected).
This relates to how powerful the hypothesis testing is.