Sampling and population surveys Flashcards
Probability vs non-probability sampling
Probability: involves random selection
+ random sampling to make inferences from the statistics,
- quite difficult in reality
Non: more realistic but not random.
+Usually cost-effective and time efficient
- High risk of sampling bias
Random Sample
- S_mple ra_dom sa_ple
- S_stemat_c
- Mul_i-st_ge
- Simple random sample: Obtain a list of names (marked w/ a number) then randomly choose/sample
- Systematic sample: Fixed interval, e.g. every 5th person
- Multi-stage - Different stages
=> first sample unis
=> second sample students at random within those unis
Stratified random sample
- St_uctured
- d_spr_portionate
- Cl_ster
Stratified sample structured, proportion of participants match the population
1. divide the pop. into strata (smaller subgroups) based on characteristics => uni level
2. take a random sample that is proportional => 50 first years, 30 second years, etc…
3. pool the subsets to form a random sample
Disproportionate: oversample some groups, usually the minority, to ensure the sample size is sufficient and balance the ss of different groups
Cluster sample: use geographical areas (eg district neighborhoods), randomly select entire clusters
1. define the population
2. cluster them
3. randomly select
4. collect data
Note: more risk of error, could have substantial differences between clusters bc ppl within one are likely to be more similar to one another, therefore less representative
Non probability sampling
- Qu_ta sampling
- C_nven_ence
- Sn_wb_l_
- Pur_osi__
- The_re_c__
- Quota: participants fir one of the specified categories
- usually marketing researching - Convenience: due to easily recruited
- usually quantitative research in psychology - ## Snowball: asked to suggest other SIMILAR PEOPLE to participate
- Purposive: select participants as they are of theoretical interest
- Theoretical: select further participants to “test” as ideas develop
- occurs after some data has been collected, analysis formulated so that further recruits to the study may inform/challenge the developing theory
Socio-demographic info
- abt the nature of the sample studied
- characteristics described depend on the kind of participants and purpose of the study
Considerations when carrying out research (sample size)
- population size
- number of people you can contact, proportion that will participate
- variability of their responses
- how confidence (sig level) for the results
- how accurate for the estimate to be compared with actual pop.
How to determine whether our sample is representative of the population?
- check sample against other available data abt the pop.
- compare it with census/national data on characteristics eg marital status
Sampling error
- occurs due to a sample being a subset of the population
- may not precisely represent it: difference between mean of sample and mean of population
How to reduce sampling error?
- use proper sampling methods
- make the sample size larger so there is more variability
- know your population: ensure only target the sample that matters
Important characteristics for sample and population
- inclu_ion cri_eria
- repre_entativ_ness
- state of constant fl_x + ti_e and c_st
inclusion criteria
- specific what population to focus on
- all possible cases that meet the criteria (not possible cuz pop. always changing)
representativeness
- sample mirrors the pop. in important respects for generalizability
State of constant flux + time and cost
- turn a sample, not pop. (portion of population)
What is sampling frame?
- a tool
- set of info abt the acessible units in a sample
- a list of ALL possible cases within a pop. that can be sampled
(e.g telephone directory, school lists, employment records, etc…)
What is the purpose of sampling?
- to approximate the characteristics that are relevant to the RQ abt a larger population
Note: sample must be representative so that researchers can make “accurate” inferences abt the larger population
Descriptive vs Inferential statistics
- summarize or describe (data distributions from samples and populations)
- statistic = characteristic of sample
- parament = of the pop.
=> statistic is used to infer the parameter(s)