Chapter 12: Sample Surveys Flashcards

Question 1

Q

What are the 3 ideas of sampling?

Answer

A

Examine a part of the whole: A sample can give information about the population.
Randomize to make the sample representative.
The sample size is what matters. It’s the size of the sample - and not its fraction of the larger population - that determines the precision of the statistics it yields.

Question 2

Q

What are some sampling methods?

Answer

A

Simple random sample (SRS)
Stratified samples
Cluster samples
Systematic samples
Multistage samples

Question 3

Q

What are some causes of bias?

Answer

A

Voluntary response
Convenience samples
Bad sampling frames
Undercover age
Nonresponse bias
Response bias

Question 4

Q

Define ‘Population’.

Answer

A

The entire group of individuals or instances about whom we hope to learn.

Question 5

Q

Define ‘Sample’.

Answer

A

A (representative) subset of a population, examines in hope of learning about the population.

Question 6

Q

Define ‘Sample survey’.

Answer

A

A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population. Polls taken to assess voter preferences are common sample surveys.

Question 7

Q

Define ‘Bias’.

Answer

A

Any systematic failure of a sampling method to represent its population. It is almost impossible to recover from bias, so efforts to avoid it are well spent. Common errors include relying on voluntary response, undercoverage of the population, nonresponse bias and response bias.

Question 8

Q

Define ‘Randomization’.

Answer

A

The best defense against bias is randomization, in which each individual is given a fair, random chance of selection.

Question 9

Q

Define ‘Sample size’.

Answer

A

The number of individuals in a sample. The sample size determines how well the sample represents the population, not the fraction of the population sampled.

Question 10

Q

Define ‘Census’.

Answer

A

A sample that consists of the entire population.

Question 11

Q

Define ‘Population parameter’.

Answer

A

A numerically values attribute of a model for a population. We rarely expect to know the true value of a population parameter, but we do hope to estimate it from sampled data. For example, the mean income of all employed people in the country is a population parameter.

Question 12

Q

Define ‘Statistic, sample statistic’.

Answer

A

Values calculated for samples data. Those that correspond to, and thus estimate, a population parameter are of particular interest. For example, the mean income of all employed people in a representative sample can provide a good estimate of the corresponding population parameter.

Question 13

Q

Define ‘Representative’.

Answer

A

A sample is said to be representative if the statistics computed from it accurately reflect the corresponding population parameters.

Question 14

Q

Define ‘Simple random sample (SRS)’.

Answer

A

A simple random sample of sample size n is a sample in which each set of n elements in the population has an equal chance of selection.

Question 15

Q

Define ‘Sampling frame’.

Answer

A

A list of individuals from who the sample is drawn. Individuals who may be in the population of interest, but who are not in the sampling frame, cannot be included in any sample.

Question 16

Q

Define ‘Sampling variability’.

Answer

A

The natural tendency of randomly drawn samples to differ, one from another. Sometimes - unfortunately - called sampling error, sampling variability is no error at all, but just the natural result of random sampling.

Question 17

Q

Define ‘Stratified random sample’.

Answer

A

A sampling design in which the population is divided into several subpopulations, or strata, and random samples are then drawn from each stratum. If the strata are homogenous, but are different from each other, a stratified sample may yield more consistent results.

Question 18

Q

Define ‘Cluster sample’.

Answer

A

A sampling design in which entire groups, or clusters, are chosen at random. Cluster sampling is usually selected as a matter of convenience, practicality or cost.

Question 19

Q

Define ‘Multistage sample’.

Answer

A

A sampling scheme that involves multiple stages of random sampling, where at each successive stage, we sample from lists of ever smaller clusters (hierarchal in nature).

Question 20

Q

Define ‘Systematic sample’.

Answer

A

A sample drawn by selecting individuals systematically from a sampling frame. When there is no relationship between the order of the sampling frame and the variables of interest, a systematic sample can be representative.

Question 21

Q

Define ‘Pilot’.

Answer

A

A small trial run of a survey to check whether questions are clear. A pilot study can reduce errors due to ambiguous questions.

Question 22

Q

Define ‘Voluntary response bias’.

Answer

A

Bias introduced to a sample when individuals can choose on their own whether to participate in the sample. Samples based on voluntary response are always invalid and cannot be recovered, no matter how large the sample size.

Question 23

Q

Define ‘Convenience sample’.

Answer

A

A sample of individuals who are conveniently available. Convenience samples often fail to be representative because every individual in the population is not equally convenient to sample.

Question 24

Q

Define ‘Undercoverage’.

Answer

A

A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population suffers from undercoverage.

Question 25

Q

Define ‘Nonresponse bias’.

Answer

A

Bias introduced when a large fraction of those samples fails to respond. Those who do respond are likely to not represent the entire population. Voluntary response bias is a form of nonresponse bias. but nonresponse bias may occur for other reasons. For example, those who work during the day won’t respond to a telephone survey during working hours.

Question 26

Q

Define ‘Response bias’.

Answer

A

Anything in a survey design that influences responses. One typical response bias arises from the wording of questions, which may suggest a favoured response.

Question 27

Q

Explain the difference between a population, a sampling frame, and a sample

Answer

A

Pop.-Entire group of indv.
Sampling Frame-List of indv. from whom sample is drawn
Sample-Represents a pop.

Question 28

Q

What does it mean for a sample to be representative of a population

Answer

A

Small sample basically covers what entire population thinks or does

Question 29

Q

What is meant by a biased sample

Answer

A

Fails to represent its population accuracy

Question 30

Q

What is the role of randomization in selecting a sample

Answer

A

protects influences of all features of a population

Question 31

Q

What is meant by a census? Why is a census often impractical?

Answer

A

Census- special sample, everyone included, responses from entire pop.
-impractical because pop. is constantly changing

Question 32

Q

Explain the difference between a parameter and statistic

Answer

A

A parameter is something we hope to estimate from data

Question 33

Q

A Simple Random Sample (SRS) must satisfy what two conditions?

Answer

A

Every subject/unit/etc. must have an equal chance for being selected and each combo of subject/unit/etc. must have equal chance of being selected.

Question 34

Q

What is meant by sampling variability

Answer

A

differences between each randomly chosen sample

Question 35

Q

When is stratified random sampling useful

Answer

A

When two or more diff. groups may bias your results. Split them and analyze separately

Question 36

Q

When is cluster sampling useful

Answer

A

When sample size is too large

Question 37

Q

What is meant by a multistage sampling

Answer

A

combining several sampling methods together

Question 38

Q

When is systematic sampling appropriate

Answer

A

When there is no relationship between order of sampling frame and variables of interest

Question 39

Q

In what way are voluntary response samples often biased

Answer

A

Usually biased towards those with strong opinions or strongly motivated

Question 40

Q

Why is convenience sampling unreliable

Answer

A

Only including individuals convenient to you isnot necessarily representative of population

Question 41

Q

What is meant by under coverage? Give an example

Answer

A

Proportion of pop. not sample at all or has small representation in sample than it has in pop.
ex: telephone survey and you eat out, less likely to answer telephone and be surveyed

Question 42

Q

Explain the difference between non-response bias and response bias

Answer

A

non response bias-lack of response bias results, impossible what non respondents have said
response bias-refers to anything in survey design that influences responses

Question 43

Q

parameter

Answer

A

numbers in model that have to be chosen to explicitly determine value of model

Question 44

Q

statistic

Answer

A

any summary found from the data

Question 45

Q

response bias

Answer

A

Preconceived notions of a person answering [a survey] which may alter the experiments purpose. One typical example of this arises from the wording of questions, which may suggest a favored response. Voters, for example, are more likely to express support of “the president” than support of the particular person holding that office at the moment

Question 46

Q

non response bias

Answer

A

bias introduced when a large fraction of those sampled fails to respond. Those who do respond are likely to not represent the entire population.Voluntary response bias is one form of this, but can occur for other reasons. For example, those who are at work during the day won’t respond to a telephone survey conducted only during working hours