Topic 8 - Sample Surveys Flashcards

1
Q

L.O.

A

LO6 Use the box model to describe chance and chance variability, including sample surveys and the Central Limit Theorem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Limitations of a census

A

Census:
- collect data from the whole population

Limitations:
- Time consuming
- Resources
- Cost
- DIfficult to ensure participation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Parameters and estimates

A

Parameter:
- numerical fact about population, usually what we want to know

Estimate:
- A calculation of sample values best predicts the parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Types of bias

A

Selection bias:
- exclude/ include one type of person

Non-Response bias:
- People who dont want to participate

Interview bias:
- Interviewer has to make a choice of participants in the survey

Measurement bias:
- When the form of questions in survey affects answers

Recall bias:
- People forget things

Social desireability bias:
- People lie

Lack of clarity:
- misinterpret question

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Bias and sample size relationship

A

Larger sample sizes may amplify the bias, not reduce

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Types of samples

A

Simple random sampling:
- Random individuals from population taken without replacement
- Equally random chance

Multi-stage Cluster sampling:
- Probability sampling technique, taking samples in stages and simple random sampling in each stage

Convenience Sampling:
- Not good
- non-probability sampling technique
- Subjects selected due to convenience
- NOT representative

Quota sampling:
- Not good
- non-probability sampling technique
- Assessbled sample has the same proportions as individuals as the entire population with respect to known characteristics

eg. 1500 males, 1000 females
20% from each = 300 males, 200 females

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Modelling a sample survey

A

Can use a survey as a box model

Box = population/ true parameter
Every sample = New survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Chance errors in Sample Surveys

A

Always contain chance error from population
Box model can be used to quantify size of chance error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What affects accuracy?

A

Standard Error (SE) will be smaller in larger samples of the population
SE = SD/ root(n)

Surveys drawn without replacement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Correction factor

A
  • Adjusts the SE from the box model to get exact SE
  • Involves both sample size and population sample

As the population gets very BIG compared to the sample size, the correction factor is almost 1

[heft]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Confidence interval

A

How precisely does the survey measure the population

  • Use sample to find an estimate

68%, 95%, 99.7%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Simulating confidence intervals

A

[heft]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Boot strapping

A
  • The computational process that allows us to estimate the properties of the population, by using the properties of a particular sample
  • Useful when there is no information about the population or where the assumptions about the population are questionable

Sample from population, then resample from original sample, then using means etc. create a histogram
[heft]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Potential problem: Normal assumption

A
  • If sample is large enough, the central limit theorum may apply
  • Box model may grow past sensible results (110%)
  • Instead of using a box model, bootstrapping can be sued
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Example of Boot strapping procedure

A
  1. Assume original sample best represents population
  2. Take a large amount of resamples of size n from the original sample with replacement
  3. For each resample, calculate the proportions of 1’s, so there are n simulated proportions
  4. Plot n proportions to investigate the true distribution of proportions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Box Model Vs Boot strapping

A

Boot strap doesn’t exceed 1 as proportion cannot be over 1
Box Model = For population
Bootstrap = For sample

Spreads are similar