SAMPLING VARIATION AND CONFIDENCE INTERVALS - LEARNING OUTCOMES Flashcards

1
Q

What is the purpose of using random sampling?

A

Random sampling is the best way to ensure that the sample we are taking is representative of the whole population. This is because every individual in the population has an equal chance of being in the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a sampling frame?

A

The list of people that you can choose or sample from is known as the sampling frame. For example for school children the sampling frame may be a collection of school registers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is systematic sampling?

A

Systematic sampling is a type of probability sampling method in which sample members from a larger population are selected according to a random starting point and a fixed, periodic interval.

You have to be careful with this sort of sampling because it is only valid if the order of subjects in the population is effectively random.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a simple random sample?

A

‘Simple Random Sample’ - A subset of a statistical population in which each member of the subset has an equal probability of being chosen. A simple random sample is meant to be an unbiased representation of a group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How might you obtain a random sample for a group?

A

First you identify your sampling frame. You then need to select from the sampling frame. You can do this by numbering the list from 1 to N and using random number tables to decide who is in the random sample.

Alternatively, you can use a suitable computer programme, such as SPSS. If you have all the names listed in SPSS you can select a random sample of a certain size from among them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

List two types of random sampling.

A
  1. Simple random sampling

2. Systematic sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is meant by the term ‘sampling error’?

A

Sampling error is defined as error in statistical analysis arising from the unrepresentativeness of the sample taken.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why is it unlikely that for any given sample we are unlikely to obtain the true mean for the actual population from that sample?

A

Because our estimates are subject to sampling error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the central limit theorem state?

A

The central limit theorem states that the sampling distribution of the mean approaches a normal distribution as the sample size increases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What statistic do can we use to tell us how representative our sample is likely to be compared to the overall population?

A

This statistic is called the standard error of the mean, and it is simply the standard deviation of the sampling distribution.

SE = s / sqrt(N)

where s is the sample standard deviation, and N is the sample size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does a large value of standard error tell us?

A

A large value will tell us that the sample means can be quite different to each other, and therefore a particular sample may not be particularly representative of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does a small value of standard error tell us?

A

Small values for the standard error tell us that the sample means will be fairly similar, and therefore our particular sample is likely to be a fair reflection of the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Other than standard error, what other measures can we use to assess sampling error?

A

We could also use confidence intervals to define the limits between which we would expect most of the sample means to fall. Most studies will use 95% confidence intervals, although you may sometimes see 99% confidence intervals reported.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

If the mean of your sample represents the data well what will the confidence intervals look like?

A

If the mean of your sample represents the data well, then the confidence intervals of that mean should be small (much like the standard error).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

If the mean of your sample represents the data poorly what will the confidence intervals look like?

A

If the sample mean is a bad representation of the population, then the confidence interval will be wider, indicating that a different sample might produce a mean quite different to that from our original sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How are confidence intervals calculated?

A

The confidence intervals can be calculated once the standard error is known. The 95% confidence interval is simply the mean ± 1.96 multiplied by the standard error.

mean ± 1.96 x SE

17
Q

How many standard deviations away from the mean encompasses 95% of the population in a normal distribution?

A

1.96

18
Q

What adaptation of the normal distribution do we make when sample size is small and why?

A

When sample size is small, we should allow for the fact that even our estimate of the standard deviation (SD) will have some error in it. We do this by using a slight adaptation of the normal distribution, called a ‘t-distribution’ and we use the equivalent 95% cut off from this distribution instead.

19
Q

What measure gives an indication of how well the sample mean reflects the (unknown) population mean?

A

The standard error.

20
Q

What determines the accuracy of the sample mean?

A
  1. Sample size

2. Variability of the measurement

21
Q

How may confidence intervals be interpreted?

A
  1. In terms of plausible range of values (e.g. the plausible range of values is between 500 and 512).
  2. In terms of repeated experiments - i.e. if you went back and repeated the experiment 100 times and calculated a mean for each of those samples 99% of the CIs calculated during this experiment would contain the true mean value.