Sampling Flashcards
What is Sampling?
Sampling is a process of selecting a sufficient number of elements from a population of interest so that by studying the sample, we may generalize the results back to the population from which these elements were chosen. (Sekaran, 1992)
Why do we sample?
- Research projects usually have budget constraints
- Information can be gathered quickly - could be days, weeks, hours (or long-term like census)
- Samples (if properly selected) are sufficiently accurate
Sample may even be more accurate
– If you did research for the whole population, so first results would be invalid by the time you finish gathering info
– Should do fieldwork as quickly as possible
– Higher sample = higher probability of mistakes
You may not need to sample at all; may not make sense
To choose a sample representing the whole population, you’d think about:
- Not practical or ethical to ask very old people
- Highest age is usually 75, 65, sometimes 79
- Depends on target group - if children, your range might be 3-10
- Usually restricted by budget, time, number of questions, practicalities, etc.
- Qualitative research is never “representative”
Size doesn’t matter
- If you choose only two balls from the basket, you will likely not get representative data, so you might need 10
- If you have two pots of soup, one big one small, you would take the same spoonful to test each one
What does sample size depend on?
Research purpose
Type of data analysis
On population characteristics
If population is homogeneous, the smaller sample would be ok
If population has 50 different regions, you would need a bigger sample
What is sample size influenced by?
Judgement about how typical the same may be of total population
Degree of required accuracy (95% is a commonly used figure)
Difficulty and cost of using a large sample
Number of categories in which you intend to subdivide the data
Number of anticipated non-responses
– If it’s not enough, you have to draw another sample
Sampling Element
Case or unit of analysis of the population that can be selected for a sample ( person, organization, group, company etc.)
Sampling Frame
“That list or quasi list of units composing a population from which a sample is selected. If the sample is to be representative of the population, it is essential that the sampling frame include all (or nearly all) members of the population. See Chapter 5.”
Specific list of sampling elements in the theoretical population
– Perfect = List of 10,000 names with name and email
Population Parameter
Characteristic of entire population that you estimate from a sample
Sampling Ratio
“The proportion of elements in the population that are selected to be in a sample. See Chapter 5.”
Ratio of the sample size to the size of target population
Theoretical Population
The group you wish to generalize to (AKA target population)
Study Population
“That aggregation of elements from which a sample is actually selected. See Chapter 5.”
Part of the theoretical population that is accessible for the research (AKA accessible p, survey p, or reachable p)
Sample
Group of people who you select to be in the study
The Problem - From All to Some
Variable ↓ Statistic ↓ Parameter
- Must be very careful about interpreting findings
- The result is always the result just from the sample but not always representative of the whole population; not really the findings
- Research estimates rather than research findings
- Based on this number you only estimate the percent of the whole population; an assumption/estimation
Stages in Selection of a Sample (6 steps)
1. Define the theoretical and study populations ↓ 2. Identify a sampling frame ↓ 3. Determine sampling method ↓ 4. Determine sample size ↓ 5. Select actual sampling units ↓ 6. Collect data
Defining the Theoretical and Study Populations
1st stage of selecting a sample
- Group of specific population elements
- Concerns questions about critical characteristics of the population
Identifying a Sampling Frame
2nd stage of selecting a sample
- List of elements from which a sample may be drawn (ex: phone book)
- Ideally, the source should be representative of the population
- Source should not bias the results (Sampling Frame Error occurs when certain sample elements are not represented in the study population) (ex: people not listed in a phone book)
- RDD - Random Digit Dialing
- – you’ll still only get people with phones
- – one number might be shared or one person might have multiple numbers