Chapters 7&8 Flashcards
Sampling Error
The natural discrepancy, or amount of error, between a sample statistic and its corresponding population parameter.
• a random sample may not be representative of the entire population
- extreme scores are not likely included in the sample
• difference between the sample statistic and population parameter is sampling error
• two samples from the same population will be different
- different individuals, scores, means
Distribution of Sample Means
• a collection of all sample means (M) for all possible random samples of a specific size (n) from a population
- an example of a sampling distribution
- sampling distribution of M
- primary use is to find the probability associated with any specific sample
• to calculate the distribution we need to know all the possible samples
• How would we construct a distribution of sample means?
- select a random sample of a specific size (n), and calculate the mean
- place the sample mean in a frequency distribution
- repeat again, and again, and again, until you have the complete set of all the possible random samples
Sampling Distribution
A distribution of statistics obtained by selecting all of the possible samples of a specific size from a population.
Sample Means: Characteristics
(1) The sample means should pile up around the population mean.
(2) The pile of sample means should tend to form a normal-shaped distribution.
(3) In general, the larger the sample size, the closer the sample means should be to the population mean.
Central Limit Theorum
For any population with mean (μ) and standard deviation (σ), the distribution of sample means for sample size (n) will have a mean of μ and a standard deviation of σ/ √n and will approach a normal distribution as n approaches infinity.
• For any population with a mean μ and a standard deviation σ
- expected value of M
- the sample mean distribution will be a normal distribution if: n ≥ 30 and/or the population has a normal distribution
- standard error of M
• The sample mean M will unlikely be equal to the population mean μ
- standard error = average distance between M and μ
• the larger the sample (n), the more likely M = μ
- the law of large numbers
• the larger the sample (n), the smaller the standard error
Distribution of sample means is almost perfectly normal if either of the following two conditions is satisfied:
(1) The population from which the samples are selected is a normal distribution.
(2) The number of scores (n) in each sample is relatively large, around 30 or more.
Expected Value of M
The mean of the distribution of sample means is equal to the mean of the population of scores, μ.
- distribution of sample means will have a mean of μ
Sample Means are an example of:
an unbiased statistic
Standard Error of M
σm = σ/√n = √σ2/√n = square root of σ2/n
- distribution of sample means will have a standard deviation of σ/ n
The standard error provides a measure of how much distance is expected on average between a sample mean (M) and the population mean (μ). - The σ indicates that this value is a standard deviation, and the subscript m indicates that it is the standard deviation for the distribution of sample means.
- Is an extremely valuable measure because it specifies precisely how well a sample mean estimates its population mean—that is, how much error you
should expect, on the average, between M and μ.
The standard error serves the same two purposes for the distribution of sample means:
(1) The standard error describes the distribution of sample means.It provides a measure of how much difference is expected from one sample to another. When the standard error is small, then all of the sample means are close together and have similar values. If the standard error is large, then the sample means are
scattered over a wide range and there are big differences from one sample to another.
(2) Standard error measures how well an individual sample mean represents the entire distribution. Specifically, it provides a measure of how much distance is reasonable to expect between a sample mean and the overall mean for the distribution
of sample means. However, because the overall mean is equal to μ, the standard error also provides a measure of how much distance to expect between a sample mean (M) and the population mean (μ).
The magnitude of the standard error is determined by two factors:
(1) the size of the sample
- law of large numbers
(2) the standard deviation of the population from which the sample is selected.
- When n = 1, σm = σ (standard error = standard deviation).
Law of Large Numbers
The larger the sample size (n), the more probable it is that the sample mean is close to the population mean.
- the error between the sample mean and the population mean should decrease
A z-Score for Sample Means
(1) we are finding the location for a sample mean (M) rather than a score (X).
(2) the standard deviation for the distribution of sample means is the standard error, σm.
With these changes, the z-score formula for locating a sample mean is:
z = M - μ / σm
Hypothesis Test
A statistical method that uses sample data to evaluate a hypothesis about a population.
- inferential process
The Unknown Population
The researcher begins with a known population. This is the set of individuals as they exist before treatment.
The unknown population, after treatment, is the focus of the research question.
Null Hypothesis
The null hypothesis (H0) states that in the general population there is no change, no difference, or no relationship.
In the context of an experiment, H0 predicts that the independent variable (treatment) has no effect on the dependent variable (scores) for the population.
Alternative Hypoethesis
The alternative hypothesis (H1) states that there is a change, a difference, or a relationship for the general population.
In the context of an experiment, H1 predicts that the independent variable (treatment) does have an effect on the dependent variable.
The distribution of sample means is then divided into two sections:
(1) Sample means that are likely to be obtained if H0 is true; that is, sample means that are close to the null hypothesis
(2) Sample means that are very unlikely to be obtained if H0 is true; that is, sample means that are very different from the null hypothesis
Alpha Level / Level of Significance
α
The alpha level, or the level of significance, is a probability value that is used to define the concept of “very unlikely” in a hypothesis test.
- The alpha level for a hypothesis test is the probability that the test will lead to a Type I error if the null hypothesis is true.
Critical Region
The critical region is composed of the extreme sample values that are very unlikely (as defined by the alpha level) to be obtained if the null hypothesis is true.
The boundaries for the critical region are determined by the alpha level.
If sample data fall in the critical region, the null hypothesis is rejected.
Can also define the critical region as sample values that provide convincing evidence that the treatment really does have an effect.
The Boundaries of the Critical Region
To determine the exact location for the boundaries that define the critical region, we use the alpha-level probability and the unit normal table.
In most cases, the distribution of sample means is normal, and the unit normal table provides the precise z-score location for the critical region boundaries.
The z-score formula for a sample mean
In the formula, the value of the sample mean (M) is obtained from the sample data, and the value of m is obtained from the null hypothesis. Thus, the z-score formula can be expressed in words as follows:
z = sample mean hypothesized - population mean / standard error between M and μ
a.k.a.
z = M - μ / σm
Collect data and compute sample statistics
(1) The data are collected after the researcher has stated the hypotheses and established the criteria for a decision.
- helps to ensure that a researcher makes an honest, objective evaluation of the data and does not tamper with the decision criteria after the experimental outcome is known.
(2) Next, the raw data from the sample are summarized with the appropriate statistics
- Now it is possible for the researcher to compare the sample mean (the data) with the null hypothesis.
- The comparison is accomplished by computing a z-score that describes exactly where the sample mean is located relative to the hypothesized population mean from H0.