Chapter 18: Sampling Distribution Models Flashcards

1
Q

Define ‘Sampling distribution model’.

A

Different random samples give different values for a statistic. The sampling distribution model shows the behaviour of the satistic over all the possible samples for the same size n.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define ‘Sampling variability (sampling error)’.

A

The variability we expect to see from one random sample to another. It is sometimes called sampling error, but sampling variability is the better term.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define ‘Sampling distribution model for a proportion’.

A

If assumptions of independence and random sampling are met, and we expect at least 10 successes and 10 failures, then the sampling distribution is modelled by a Normal model with a mean equal to the true proportion value, p, and a standard deviation equal to
sqrt( pq/n ).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define ‘Sampling distribution model for a mean’.

A

If assumptions of independence and random sampling are met, and the sample size is large enough, the sampling distribution of the sample mean is modelled by a Normal model with a mean equal to the population mean, μ, and a standard deviation equal to
σ / sqrt(n).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define ‘Central Limit Theorem (CLT)’.

A

States that the sampling distribution model of the sample mean (and proportion) from a random sample is approximately Normal for large n, regardless of the distribution of the population, as long as the observations are independent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Usually the mean of a sampling distribution is the value of…?

A

The parameter estimated. I.e, for the sampling distribution of p̂ the mean is p and for the sampling distribution of ȳ the mean is μ.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The sampling distribution of the mean is Normal, no matter what the underlying distribution of the data is, however…?

A

The CLT says that this happens in the limit, as the sample size grows. The Normal model applies sooner when sampling from a unimodal, symmetric population and more gradually when the population is very non-Normal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Populations with a true proportion, p, close to 0 or 1 can be a problem. What happens in these cases.

A

When p is close to 0 the distribution is skewed to the right and when it is close to 1, it is skewed to the left.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the two assumptions needed to use the Normal sampling distribution model for a sample proportion?

A
  1. The Independence Assumption (Randomization Condition)
  2. The Sample Size Assumption (10% Condition - sample size must be no larger than 10% of population, and Success/Failure Condition - at least 10 successes and 10 failures)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The Success/Failure Condition wants sufficient data. How much depends on p. Explain.

A

If p is near 0.5, we need a sample of only 20 or so. If p is only 0.01, however, we’d need a sample of 1000. How about if p was 0.99?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the two assumptions needed to use the Normal sampling distribution model for a sample mean(CLT)?

A
  1. The Independence Assumption (Randomization Condition)
  2. The Sample Size Assumption (Large Enough Sample Condition - does not say, likely dependent mostly on data distribution, i.e. skewed vs. normal)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When we have categorical data, do we utilize sample mean or proportion? For quantitative data?

A
Categorical = Proportion
Quantitative = Mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Sampling distributions arise because …?

A

Samples vary.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The CLT quantifies…?

A

Sampling error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

The denominator in the variability of sample means shows that it decreases as the sample size increases. What’s the catch?

A

The standard deviation decreases of the sampling distribution declines only with the square root of the sample size and not, for example, with 1/n. This limits how much we can make a sample tell about the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What happens when we sample more than 10% of the population for sampling distributions of the mean?

A

Mean = The SD formula overestimates the true SD.

For the Proportion?