Chapter 12: Sample Surveys Flashcards

1
Q

What are the 3 ideas of sampling?

A
  1. Examine a part of the whole: A sample can give information about the population.
  2. Randomize to make the sample representative.
  3. The sample size is what matters. It’s the size of the sample - and not its fraction of the larger population - that determines the precision of the statistics it yields.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are some sampling methods?

A
  1. Simple random sample (SRS)
  2. Stratified samples
  3. Cluster samples
  4. Systematic samples
  5. Multistage samples
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are some causes of bias?

A
  1. Voluntary response
  2. Convenience samples
  3. Bad sampling frames
  4. Undercover age
  5. Nonresponse bias
  6. Response bias
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define ‘Population’.

A

The entire group of individuals or instances about whom we hope to learn.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define ‘Sample’.

A

A (representative) subset of a population, examines in hope of learning about the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define ‘Sample survey’.

A

A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population. Polls taken to assess voter preferences are common sample surveys.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define ‘Bias’.

A

Any systematic failure of a sampling method to represent its population. It is almost impossible to recover from bias, so efforts to avoid it are well spent. Common errors include relying on voluntary response, undercoverage of the population, nonresponse bias and response bias.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define ‘Randomization’.

A

The best defense against bias is randomization, in which each individual is given a fair, random chance of selection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define ‘Sample size’.

A

The number of individuals in a sample. The sample size determines how well the sample represents the population, not the fraction of the population sampled.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define ‘Census’.

A

A sample that consists of the entire population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define ‘Population parameter’.

A

A numerically values attribute of a model for a population. We rarely expect to know the true value of a population parameter, but we do hope to estimate it from sampled data. For example, the mean income of all employed people in the country is a population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Define ‘Statistic, sample statistic’.

A

Values calculated for samples data. Those that correspond to, and thus estimate, a population parameter are of particular interest. For example, the mean income of all employed people in a representative sample can provide a good estimate of the corresponding population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define ‘Representative’.

A

A sample is said to be representative if the statistics computed from it accurately reflect the corresponding population parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define ‘Simple random sample (SRS)’.

A

A simple random sample of sample size n is a sample in which each set of n elements in the population has an equal chance of selection.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define ‘Sampling frame’.

A

A list of individuals from who the sample is drawn. Individuals who may be in the population of interest, but who are not in the sampling frame, cannot be included in any sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define ‘Sampling variability’.

A

The natural tendency of randomly drawn samples to differ, one from another. Sometimes - unfortunately - called sampling error, sampling variability is no error at all, but just the natural result of random sampling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Define ‘Stratified random sample’.

A

A sampling design in which the population is divided into several subpopulations, or strata, and random samples are then drawn from each stratum. If the strata are homogenous, but are different from each other, a stratified sample may yield more consistent results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Define ‘Cluster sample’.

A

A sampling design in which entire groups, or clusters, are chosen at random. Cluster sampling is usually selected as a matter of convenience, practicality or cost.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Define ‘Multistage sample’.

A

A sampling scheme that involves multiple stages of random sampling, where at each successive stage, we sample from lists of ever smaller clusters (hierarchal in nature).

20
Q

Define ‘Systematic sample’.

A

A sample drawn by selecting individuals systematically from a sampling frame. When there is no relationship between the order of the sampling frame and the variables of interest, a systematic sample can be representative.

21
Q

Define ‘Pilot’.

A

A small trial run of a survey to check whether questions are clear. A pilot study can reduce errors due to ambiguous questions.

22
Q

Define ‘Voluntary response bias’.

A

Bias introduced to a sample when individuals can choose on their own whether to participate in the sample. Samples based on voluntary response are always invalid and cannot be recovered, no matter how large the sample size.

23
Q

Define ‘Convenience sample’.

A

A sample of individuals who are conveniently available. Convenience samples often fail to be representative because every individual in the population is not equally convenient to sample.

24
Q

Define ‘Undercoverage’.

A

A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population suffers from undercoverage.

25
Q

Define ‘Nonresponse bias’.

A

Bias introduced when a large fraction of those samples fails to respond. Those who do respond are likely to not represent the entire population. Voluntary response bias is a form of nonresponse bias. but nonresponse bias may occur for other reasons. For example, those who work during the day won’t respond to a telephone survey during working hours.

26
Q

Define ‘Response bias’.

A

Anything in a survey design that influences responses. One typical response bias arises from the wording of questions, which may suggest a favoured response.

27
Q

Explain the difference between a population, a sampling frame, and a sample

A

Pop.-Entire group of indv.
Sampling Frame-List of indv. from whom sample is drawn
Sample-Represents a pop.

28
Q

What does it mean for a sample to be representative of a population

A

Small sample basically covers what entire population thinks or does

29
Q

What is meant by a biased sample

A

Fails to represent its population accuracy

30
Q

What is the role of randomization in selecting a sample

A

protects influences of all features of a population

31
Q

What is meant by a census? Why is a census often impractical?

A

Census- special sample, everyone included, responses from entire pop.
-impractical because pop. is constantly changing

32
Q

Explain the difference between a parameter and statistic

A

A parameter is something we hope to estimate from data

33
Q

A Simple Random Sample (SRS) must satisfy what two conditions?

A

Every subject/unit/etc. must have an equal chance for being selected and each combo of subject/unit/etc. must have equal chance of being selected.

34
Q

What is meant by sampling variability

A

differences between each randomly chosen sample

35
Q

When is stratified random sampling useful

A

When two or more diff. groups may bias your results. Split them and analyze separately

36
Q

When is cluster sampling useful

A

When sample size is too large

37
Q

What is meant by a multistage sampling

A

combining several sampling methods together

38
Q

When is systematic sampling appropriate

A

When there is no relationship between order of sampling frame and variables of interest

39
Q

In what way are voluntary response samples often biased

A

Usually biased towards those with strong opinions or strongly motivated

40
Q

Why is convenience sampling unreliable

A

Only including individuals convenient to you isnot necessarily representative of population

41
Q

What is meant by under coverage? Give an example

A

Proportion of pop. not sample at all or has small representation in sample than it has in pop.
ex: telephone survey and you eat out, less likely to answer telephone and be surveyed

42
Q

Explain the difference between non-response bias and response bias

A

non response bias-lack of response bias results, impossible what non respondents have said
response bias-refers to anything in survey design that influences responses

43
Q

parameter

A

numbers in model that have to be chosen to explicitly determine value of model

44
Q

statistic

A

any summary found from the data

45
Q

response bias

A

Preconceived notions of a person answering [a survey] which may alter the experiments purpose. One typical example of this arises from the wording of questions, which may suggest a favored response. Voters, for example, are more likely to express support of “the president” than support of the particular person holding that office at the moment

46
Q

non response bias

A

bias introduced when a large fraction of those sampled fails to respond. Those who do respond are likely to not represent the entire population.Voluntary response bias is one form of this, but can occur for other reasons. For example, those who are at work during the day won’t respond to a telephone survey conducted only during working hours