Sampling Statistics Flashcards
What is a population?
- All people (or items, locations, etc.) of interest
- Who you want your results be relevant for, generalize to
- Can be large (i.e. all 4-year-old children who are English-Spanish bilinguals) or relatively small (i.e. all children in a particular education center)
What is a sample?
- The individuals actually in your study
- Representative of the population (equal chance of people selected; intended vs. accessible population)
- Use sample statistics to make inferences about population parameters
What are parameters?
- Numbers used to describe a population
- EX: mu = population mean
What are statistics?
- Numbers used too describe a sample
- EX: x bar = sample mean
What occurs in a census?
-Population = Sample
What is sampling bias?
- Failure to identify/examine all members of a population
- Sources: samples of convenience, volunteerism
What are the two main types of sampling?
- Probability
- Non-probability
What are the types of probability sampling?
- Simple random sampling
- Systematic random sampling
- Stratified random sampling
- Cluster sampling
- Multistage sampling
What are the types of non-probability sampling?
- Convenience sampling
- Purposive sampling
What is probability sampling?
- Uses some form of random selection, based on probability
- Requires setting up a procedure that assures that the different members of your population have equal probabilities of being chosen
What is simple random sampling?
- Choose such that each sample in the population has an equal chance of being selected (i.e. picking out of a hat)
- Advantages: equal chances of selection, fair, free from sampling bias
- Disadvantages: need to know entire population, not most statistically efficient method, luck of the draw (may not represent subgroups well)
What is systematic random sampling?
- Selecting one member randomly and then choose additional members at evenly spaced intervals
- EX: want a sample of 20/100 students, select 1 every 5th person in the alphabetical class list until you have N= 20
- Disadvantages: you need a complete listing, need to watch out for periodicity in the list
- Advantages: fairly easy to do
What is stratified random sampling?
- Population can be divided into different groups based on criteria (i.e. strata)
- Separate simple random sample from each population stratum
- EX: men vs. women who are ASHA members
- Advantages over simple: assures representation of overall population AND key subgroups, potentially apply results to subgroups
What is cluster sampling?
- Select clusters from population on the basis of simple random sampling, then sample all people in the cluster
- EX: if you want to sample all pre-k kids in MD, take a random sample of MD schools with pre-k programs, sample all those kids in the sampled schools
- Economical, but still susceptible to sampling bias (clusters are intrinsically homogeneous)
What is multistage sampling?
- Combine different methods of probability sampling
- EX: using cluster sampling to select certain schools, and then random sampling within each school
What is non-probability sampling?
- Does not involve random selection
- May or may not represent the population well, hard to know how well (even with large N)
- Susceptible to researcher bias
What is convenience sampling?
- Convenient samples are chosen from a population (what we do most frequently)
- EX: college students, local volunteers
- Disadvantage: no evidence that they are representative of the populations we’re interested in generalizing to (and often would suspect that they are not)
What is purposive sampling?
- Specific, predefined groups that we seek
- Frequent in qualitative research
- Smaller N
- Useful for situations where you need to reach a targeted sample quickly and where sampling for proportionality is not the primary concern
- EX: expert, extreme/deviant cases, criterion sampling (“all white cars”)
Describe how to determine sample size.
- Determine BEFORE you start an experiment (a priori)
- Practical concerns
- Sample size estimates depend on:
- The size of the effect you’re interested in (effect size)
- Variability across the sample (i.e. participants)
- Reliability of your measure