Chapters 7&8 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Sampling Error

A

The natural discrepancy, or amount of error, between a sample statistic and its corresponding population parameter.
• a random sample may not be representative of the entire population
- extreme scores are not likely included in the sample
• difference between the sample statistic and population parameter is sampling error
• two samples from the same population will be different
- different individuals, scores, means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Distribution of Sample Means

A

• a collection of all sample means (M) for all possible random samples of a specific size (n) from a population
- an example of a sampling distribution
- sampling distribution of M
- primary use is to find the probability associated with any specific sample
• to calculate the distribution we need to know all the possible samples
• How would we construct a distribution of sample means?
- select a random sample of a specific size (n), and calculate the mean
- place the sample mean in a frequency distribution
- repeat again, and again, and again, until you have the complete set of all the possible random samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Sampling Distribution

A

A distribution of statistics obtained by selecting all of the possible samples of a specific size from a population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sample Means: Characteristics

A

(1) The sample means should pile up around the population mean.
(2) The pile of sample means should tend to form a normal-shaped distribution.
(3) In general, the larger the sample size, the closer the sample means should be to the population mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Central Limit Theorum

A

For any population with mean (μ) and standard deviation (σ), the distribution of sample means for sample size (n) will have a mean of μ and a standard deviation of σ/ √n and will approach a normal distribution as n approaches infinity.
• For any population with a mean μ and a standard deviation σ
- expected value of M
- the sample mean distribution will be a normal distribution if: n ≥ 30 and/or the population has a normal distribution
- standard error of M
• The sample mean M will unlikely be equal to the population mean μ
- standard error = average distance between M and μ
• the larger the sample (n), the more likely M = μ
- the law of large numbers
• the larger the sample (n), the smaller the standard error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Distribution of sample means is almost perfectly normal if either of the following two conditions is satisfied:

A

(1) The population from which the samples are selected is a normal distribution.
(2) The number of scores (n) in each sample is relatively large, around 30 or more.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Expected Value of M

A

The mean of the distribution of sample means is equal to the mean of the population of scores, μ.
- distribution of sample means will have a mean of μ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Sample Means are an example of:

A

an unbiased statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Standard Error of M

A

σm = σ/√n = √σ2/√n = square root of σ2/n

  • distribution of sample means will have a standard deviation of σ/ n
    The standard error provides a measure of how much distance is expected on average between a sample mean (M) and the population mean (μ).
  • The σ indicates that this value is a standard deviation, and the subscript m indicates that it is the standard deviation for the distribution of sample means.
  • Is an extremely valuable measure because it specifies precisely how well a sample mean estimates its population mean—that is, how much error you
    should expect, on the average, between M and μ.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The standard error serves the same two purposes for the distribution of sample means:

A

(1) The standard error describes the distribution of sample means.It provides a measure of how much difference is expected from one sample to another. When the standard error is small, then all of the sample means are close together and have similar values. If the standard error is large, then the sample means are
scattered over a wide range and there are big differences from one sample to another.
(2) Standard error measures how well an individual sample mean represents the entire distribution. Specifically, it provides a measure of how much distance is reasonable to expect between a sample mean and the overall mean for the distribution
of sample means. However, because the overall mean is equal to μ, the standard error also provides a measure of how much distance to expect between a sample mean (M) and the population mean (μ).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The magnitude of the standard error is determined by two factors:

A

(1) the size of the sample
- law of large numbers
(2) the standard deviation of the population from which the sample is selected.
- When n = 1, σm = σ (standard error = standard deviation).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Law of Large Numbers

A

The larger the sample size (n), the more probable it is that the sample mean is close to the population mean.
- the error between the sample mean and the population mean should decrease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

A z-Score for Sample Means

A

(1) we are finding the location for a sample mean (M) rather than a score (X).
(2) the standard deviation for the distribution of sample means is the standard error, σm.
With these changes, the z-score formula for locating a sample mean is:
z = M - μ / σm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Hypothesis Test

A

A statistical method that uses sample data to evaluate a hypothesis about a population.
- inferential process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

The Unknown Population

A

The researcher begins with a known population. This is the set of individuals as they exist before treatment.
The unknown population, after treatment, is the focus of the research question.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Null Hypothesis

A

The null hypothesis (H0) states that in the general population there is no change, no difference, or no relationship.
In the context of an experiment, H0 predicts that the independent variable (treatment) has no effect on the dependent variable (scores) for the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Alternative Hypoethesis

A

The alternative hypothesis (H1) states that there is a change, a difference, or a relationship for the general population.
In the context of an experiment, H1 predicts that the independent variable (treatment) does have an effect on the dependent variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

The distribution of sample means is then divided into two sections:

A

(1) Sample means that are likely to be obtained if H0 is true; that is, sample means that are close to the null hypothesis
(2) Sample means that are very unlikely to be obtained if H0 is true; that is, sample means that are very different from the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Alpha Level / Level of Significance

A

α
The alpha level, or the level of significance, is a probability value that is used to define the concept of “very unlikely” in a hypothesis test.
- The alpha level for a hypothesis test is the probability that the test will lead to a Type I error if the null hypothesis is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Critical Region

A

The critical region is composed of the extreme sample values that are very unlikely (as defined by the alpha level) to be obtained if the null hypothesis is true.
The boundaries for the critical region are determined by the alpha level.
If sample data fall in the critical region, the null hypothesis is rejected.
Can also define the critical region as sample values that provide convincing evidence that the treatment really does have an effect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

The Boundaries of the Critical Region

A

To determine the exact location for the boundaries that define the critical region, we use the alpha-level probability and the unit normal table.
In most cases, the distribution of sample means is normal, and the unit normal table provides the precise z-score location for the critical region boundaries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

The z-score formula for a sample mean

A

In the formula, the value of the sample mean (M) is obtained from the sample data, and the value of m is obtained from the null hypothesis. Thus, the z-score formula can be expressed in words as follows:
z = sample mean hypothesized - population mean / standard error between M and μ
a.k.a.
z = M - μ / σm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Collect data and compute sample statistics

A

(1) The data are collected after the researcher has stated the hypotheses and established the criteria for a decision.
- helps to ensure that a researcher makes an honest, objective evaluation of the data and does not tamper with the decision criteria after the experimental outcome is known.
(2) Next, the raw data from the sample are summarized with the appropriate statistics
- Now it is possible for the researcher to compare the sample mean (the data) with the null hypothesis.
- The comparison is accomplished by computing a z-score that describes exactly where the sample mean is located relative to the hypothesized population mean from H0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Making a Decision

A

Comparing our treated sample with the distribution of sample means that would be obtained for untreated samples

25
Q

Possible Outcomes of Hypothesis Testing

A

(1) The sample data are located in the critical region.
- The sample data provide sufficient evidence to reject the null hypothesis and conclude that the treatment has an effect.
(2) The sample data are not in the critical region.
- The sample data do not provide enough evidence to reject the null hypothesis. In this case, you fail to reject H0 and conclude that the treatment does not appear to have an effect.

26
Q

Type 1 Error

A

(α)
• reject null hypothesis when it is actually true
• incorrectly conclude the treatment has an effect
• selecting an extreme sample by chance
• the alpha level determines the probability of this error occurring

27
Q

Type II Error

A

(ß)
• retain the null hypothesis when it is actually false
• the treatment effect really exists, but wasn’t detected
• sample mean not in the critical region even though treatment had an effect

28
Q

Primary Concern of Selecting an Alpha Level

A

Primary concern is to minimize the risk of a Type I error. Thus, alpha levels tend to be very small probability values.

29
Q

When you select a value for alpha at the beginning of a hypothesis test, your decision influences both of these functions:

A

(1) alpha level helps to determine the boundaries for the critical region by defining the concept of “very unlikely” outcomes.
(2) alpha level helps to determine the probability of a Type I error if the null hypothesis is true.

30
Q

Steps of Hypothesis Testing: Step 1

A
(1) State the two possible hypothesis
Null Hypothesis (H0): the treatment has no effect (no change)
- µ = 80 (no change)
Alternative Hypothesis (H1): the treatment has an effect
- µ ≠ 80 (treatment causes a change)
31
Q

Statistically Significant

A

A result is said to be significant, or statistically significant, if it is very unlikely to occur when the null hypothesis is true. That is, the result is sufficient to reject the null hypothesis. Thus, a treatment has a significant effect if the decision from the hypothesis test is to reject H0.

32
Q

The assumptions for hypothesis tests with z-scores

A

• Random Sampling
• Independent Observation
- two events (or observations) are independent if the occurrence of the first event has no effect on the probability of the second event.
• The value of σ is unchanged by the treatment
- we assume that the standard deviation for the unknown population (after treatment) is the same as it was for the population
> before treatment.
the effect of the treatment is to add a constant amount to (or subtract a constant amount from) every score in the population.
• Normal sampling distribution
- To evaluate hypotheses with z-scores, we have used the unit normal table to identify the critical region (only if distribution is normal)

33
Q

Factors that influence a hypothesis test

A

(1) The variability of the scores, which is measured by either the standard deviation or the variance. The variability influences the size of the standard error in the denominator of the z-score.
(2) The number of scores in the sample. This value also influences the size of the standard error in the denominator.

34
Q

Directional (One-Tailed) Hypothesis Test

A

In a directional hypothesis test, or a one-tailed test, the statistical hypotheses (H0 and H1) specify either an increase or a decrease in the population mean. That is, they make a statement about the direction of the effect.
• the two hypotheses would state:
H0: Test scores are not increased. (The treatment does not work.)
H1: Test scores are increased. (The treatment works as predicted.)

35
Q

A directional (one-tailed) test requires changes in the first two steps of the step-by-step hypothesis-testing procedure:

A

(1) In the first step, the directional prediction is included in the statement of the hypotheses.
(2) In the second step of the process, the critical region is located entirely in one tail of the distribution.

36
Q

One-Tailed VS Two-Tailed Tests

A
  • The major distinction between one-tailed and two-tailed tests is the criteria that they use for rejecting H0.
  • A one-tailed test allows you to reject the null hypothesis when the difference between the sample and the population is relatively small, provided that the difference is in the specified direction.
  • A two-tailed test, on the other hand, requires a relatively large difference independent of direction.
37
Q

Concerns about Hypothesis Testing

A

There are two serious limitations with using a hypothesis test to establish the significance of a treatment effect.
Concerns:
(1) that the focus of a hypothesis test is on the data rather than the hypothesis. Specifically, when the null hypothesis is rejected, we are actually making a strong probability statement about the sample data, not about the null hypothesis.
(2) that demonstrating a significant treatment effect does not necessarily indicate a substantial treatment effect. In particular, statistical significance doesn’t provide any real information about the absolute size of a treatment effect. Instead, the hypothesis test has simply established that the results obtained in the research study are very unlikely to have occurred if there is no treatment effect.

38
Q

Effect Size

A

A measure of effect size is intended to provide a measurement of the absolute magnitude of a treatment effect, independent of the size of the sample(s) being used.

39
Q

Cohen’s d

A

One of the simplest and most direct methods for measuring effect size is Cohen’s d.
Cohen (1988) recommended that effect size can be standardized by measuring the mean difference in terms of the standard deviation. The resulting measure of effect size is computed as
Cohen’s d = mean difference / standard deviation = μ treatment - μ no treatment / σ

40
Q

estimated Cohen’s d

A

estimated Cohen’s d = mean difference / standard deviation = M treatment - μ no treatment / σ
• One of the simplest and most direct methods for measuring effect size
• Cohen (1988) recommended that effect size can be standardized by measuring the mean difference in terms of the standard deviation.
• The sample mean is expected to be representative of the population mean and provides the best measure of the treatment effect. , thus it is the estimated Cohen’s d
• Cohen’s d measures the distance between two means and is typically reported as a positive number even when the formula produces a negative value.

41
Q

Statistical Power

A

The power of a statistical test is the probability that the test will correctly reject a false null hypothesis. That is, power is the probability that the test will identify a treatment effect if one really exists.
• usually calculate the power of a hypothesis test before they actually conduct the research study. In this way, they can determine the probability that the results will be significant (reject H0) before investing time and effort in the actual research.

42
Q

Power and Effect Size

A

As the effect size increases, the probability of rejecting H0 also increases, which means that the power of the test increases.
Thus, measures of effect size such as Cohen’s d and measures of power both provide an indication of the strength or magnitude of a treatment effect.

43
Q

Factors that Affect Power

A

(1) Effect Size
(2) Sample Size
(3) Alpha Level
(4) One-Tailed versus Two-Tailed Tests

44
Q

Power and Sample Size

A

one of the primary reasons for computing power is to determine what sample size is necessary to achieve a reasonable probability for a successful research study.
If the probability (power) is too small, they always have the option of increasing sample size to increase power.

45
Q

Power and Alpha Level

A

• Reducing the alpha level for a hypothesis test also reduces the power of the test.
• Moving the critical boundary to the right means that a smaller portion of the treatment distribution (the distribution on the right-hand side) will be in the critical region.
- Thus, there would be a lower probability of rejecting the null hypothesis and a lower value for the power of the test.

46
Q

Power and One-Tailed versus Two-Tailed Tests

A
  • Changing from a regular two-tailed test to a one-tailed test increases the power of the hypothesis test.
  • Moving the boundary to the left would cause a larger proportion of the treatment distribution to be in the critical region and, therefore, would increase the power of the test.
47
Q

Increasing Sample Size

A
  • increases the likelihood of rejecting the null hypothesis
  • has no effect on Cohen’s d
  • increases the power of the test
  • produces a smaller standard error in the denominator, which produces a larger z-score.
48
Q

To Include in a Report Statement

A

(i) independent variable
(ii) significant effect (H1) vs no significant effect (H0)
(iii) dependent variable
(iv) sample
(v) calculated mean
(vi) calculated z-score
(vii) proportion
(vii) effect size

(i) The blueberry supplements had a (ii) significant effect on the (iii) cognitive function of (iv) adults (v) (M = 92) (iv) over 65 years of age, (vi) z = +3.00, (vii) p < 0.05. (viii) The effect size was small, d=0.22.

49
Q

Most Likely to Have a Significant Effect if

A

sample number is high
alpha level is high
have a one-tailed test

50
Q

Steps of Hypothesis Testing: Step 2

A

2) Set the criteria for the Alternative Hypothesis
• Null Hypothesis (H0): the treatment has no effect (no change)
- sample mean should be near M = 80
• Alternative Hypothesis (H1): the treatment has an effect
- sample mean should be very different from 80 (M ≠ 80)
- determine this exact value using the alpha value or level of significance
> critical region: z critical i.e. zc= +- (over top of each other) 1.96
> We determine the alpha value: α = 0.05 or 5%; α = 0.01 or 1%; α = 0.001 or .01%
> extreme 5% of sample means:
- the 5% is split between the two tails of the distribution (2.5% each tail)
- look up in table z-scores for the critical region (2.5% of each tail)

51
Q

Steps of Hypothesis Testing: Step 3

A

Collect data and compute sample statistics
• calculate the sample mean
• calculate a z-score to determine where the sample mean is located in the distribution of sample means

52
Q

Steps of Hypothesis Testing: Step 4

A

4) Make a decision

• reject the null hypothesis or fail to reject the null hypothesis

53
Q

Steps of Hypothesis Testing: Step 5

A

Report the results of the statistical test

54
Q

What happens to the population after a treatment is given?

A
  • treatment is adding/subtracting a constant to each individual scores
  • the mean changes
  • the shape of the distribution does not change
  • the standard deviation does not change
55
Q

Factors that Influence Whether to Reject the Null Hypothesis

A

(1) the size of the z-score
• the larger the z-score, the further M is from µ
(2) the variability of the individual scores
• variability influences the size of the standard error
(3) the size of the sample
• the larger the sample (n), the smaller the standard error

56
Q

Assumptions for Hypothesis Testing

A

• Subjects are obtained using random sampling
- the sample must be representative of the population
• Independent Observations
- each measurement is not influenced by any other measurement
- typically satisfied using random sampling
• The treatment does not influence the variance
• The shape of the distribution is normal
- use z-scores only with a normal distribution of sample mean

57
Q

Directional Hypothesis Testing

A

• the alternative hypothesis (H1) specifies the direction of the treatment
• predicts either an increase or decrease
• one-tailed test
- the prediction is in one tail of the distribution of sample means
• allows for rejection of H0 when the difference between M and µ are small

58
Q

Measuring Effect

A
  • statistical significance does not describe the size of the effect
  • if a sample is large enough, any treatment effect will appear significant
  • Cohen’s d
59
Q

If The Alternative Hypothesis Proves Correct, What Does the Proportion Look Like?
What about if the Null Hypothesis is Correct?

A
Alternative Hypothesis (H1):
p < \_\_\_\_\_ (alpha value)
Null Hypothesis (H0):
p > \_\_\_\_\_ (alpha value)