Stat 354 Flashcards

Question

very important step in methods

Answer 1

test questionare on small-scale - pilot study, pre-test | improve and re-assess

Answer 2

- train people in goals and methods - early quality checking - plan for non-response

Answer 3

- edit questionnaire, record errors - methods for handling non-response - different estimation methods - estimation of precision

Answer 4

some elements of sample fail to provide responses to survey

Answer 5

if non-responders have differing opinions/ measurement from responders, bias occurs

Answer 6

non-response rate is high

Answer 7

some units more likely to be included in sample than other | -cannot be overcome by increased n

Answer 8

collection of sampling units drawn from sampling frame (single or multiple frames)

Answer 9

predicted 57% for Landon highest response in history, 2.4million Roosevelt won 62%

Answer 10

SRS from phone book and club membership -- selection bias (only rich 1/4 of pop. had phones)

Answer 11

when selection procedure is biased, no size of n will help

Answer 12

personal ca. 65% | mailed ca. 25%

Answer 13

ask how it was taken

Answer 14

George Gallup n = 50,000ppl predicted Roosevelt victory (56% vs truth 62%) predicted Digest results (44% vs truth 43%)

Answer 15

- interviewer assigned fixed number (quota) of subjects to interview - #s w/i categories are fixed

Answer 16

residence age sex economic status

Answer 17

aims to be representative based on census data | ex. design sampling based on % men vs women in population

Answer 18

- while sample controls for certain variables, not the one of interest (ex. can't control of republican vs democratic) - interviewers are free to choose who they want within quota

Answer 19

Errors of non-observation | Errors of observation

Answer 20

sampling error coverage error non-response

Answer 21

deviation between sample estimate and true population value

Answer 22

sampling frame does not match perfectly w/ target population

Answer 23

interviewers | respondents

Answer 24

effect response of respondent in some way

Answer 25

body language

Answer 26

- sampling design - sample size - investigator

Answer 27

people who are unlisted in telephone book

Answer 28

differ in their ability and motivation to answer correctly | -response error

Answer 29

recall bias prestige bias intentional deception incorrect measurement

Answer 30

different responders recall differently

Answer 31

exaggerate to appear more prestigious

Answer 32

exaggerate income

Answer 33

don't want to admit to breaking the law

Answer 34

respondent doesn't understand measurement units | ex. report on cm vs m; cups of coffee vs travel mugs

Answer 35

reward for responding inform ahead of time shortened, concise, focused questionnaire callback, persistence marketing - train interviewers to 'sell it' data cleaning - check for errors

Answer 36

distribution of values of ȳ over repeated samples of same size

Answer 37

- mean = µ - standard deviation σ/n - approximately bell-shaped - assumes population is infinite

Answer 38

shorter tails than normal truncated non-normal

Answer 39

large | Cov(y1, y2) | = greater dependence btw y1, y2 depends on scale of measurement (units) standardize by correlation

Answer 40

n independent samples of size 1 | may include duplicates

Answer 41

every possible subset of n from N equally likely to be chosen

Answer 42

1/ (N choose n)

Answer 43

(N!) / n!(N-n)!

Answer 44

product of all positive integers less than or equal to n | ex. 5 ! = 5 × 4 × 3 × 2 × 1 = 120

Answer 45

n/N | P(ith unit in sample) = n/N = πi

Answer 46

samples that contain i / total number of possible samples

Answer 47

- haphazard sampling - list all (N choose n) subsets, choose at random - random number generator - blind sampling - draw elements at random, include if not duplicates

Answer 48

using own judgement to draw a sample | ≠ random sample

Answer 49

finite population correction | 1 - (n/N)

Answer 50

ca. 1 | 1 - (n/N) = 1 - (ca. 0)

Answer 51

n --> N --> ∞ n/N --> C less than 1 n, N, N-n must be 'sufficiently large' n ≥ 50 usually ok

Answer 52

blocking (analogous to stratification)

Answer 53

division of population into a number of non-overlapping groups

Answer 54

SRS drawn from each stratum

Answer 55

- if different means in sub pop.'s may be more precise - administrative advantages - can obtain separate estimates of each parameter for each strata

Answer 56

proportion sampled in each stratum

Answer 57

small variance | lowest cost

Answer 58

Ni (# of elements in each stratum) Si^2 (variability in each stratum) Cost of obtaining an observation in each stratum

Answer 59

larger sample sizes to strata w/ larger pop.'s larger sample sizes to strata w/ larger variability smaller sample sizes if costs are high

Answer 60

Optimal allocation Neyman allocation Proportional allocation

Answer 61

most information for least cost choose ni to minimize V(yst) for a fixed C or minimize C for a fixed V(yst) C = Co + E cini

Answer 62

special case of optimal allocation | used when costs are equal in all strata

Answer 63

split sample into strata w/ same proportion as population ni/n = Ni/N the stratified estimator (yst) is the average of all observations

Answer 64

always round up for n, except for optimal allocation (don't cross budget)

Stat 354 Flashcards

(88 cards)