Chapter 7- Sampling- estimating the frequency of behaviors and beliefs Flashcards
External validity
Can the results of a study be generalized to some larger population of interest? Good external validity means that the sample is representative of the population
How is external validity calculated for frequency claims?
To interrogate external validity for frequency claims, we ask whether people who responded to a survey (the sample) can adequately represent the larger population or can be generalized to another setting.
Population
The entire set of people or products in which you are interested.
Sample
A smaller set taken from that population
Census
A census would include surveying every individual in the population- this usually isn’t necessary in research
Population of interest
Researchers have to specify which population they want to generalize- this is the population of interest. Researchers are rarely looking at the world’s entire population. For example, if a sample of people rated a style of shoes on how well they fit, we might be interested in generalizing to the population of people who have worn those shoes
Biased sample
Some members of the population of interest have a much higher probability than other members of being included in the sample.
Unbiased sample
All members of the population have an equal chance of being included in the sample. Only unbiased samples allow us to make inferences about the population of interest
When is a sample considered biased?
In a consumer survey or an online opinion poll, a biased sample would contain too many unusual people. Ex- the students who rate a professor on a website might tend to be the most angry or disgruntled and not represent the group of students. A researcher’s sample might include only one kind of people when the population of interest has more variety. Ex- a sample of only men when the population of interest contains men and women. Sometimes, the population of interest might actually be one specific group, like men. In this case, the researcher has to make sure that the same of men represents the general population of men
Ways a sample might be biased (2)
- Sampling only those who are easy to contact
2. Sampling only those who volunteer
Which validity does a biased sample threaten?
External
Convenience sampling
Using a sample of people who are easy to contact and readily available to participate. College professors conducting studies might tend to use college students, but these samples might not be representative of populations that are less educated, older, or younger. Researchers might also end up with a convenience sample if they are unable to contact a certain subset of people. They might not be able to study those who live far away or those who don’t show up to a study appointment
Self-selection
When a sample is known to contain only people who volunteer to participate. When Internet users choose to rate something, like an Amazon product or a professor on RMP, they are self-selecting when doing so- people who take the time to rate things might have stronger opinions or might be more willing to share ideas with others. We can rule out this bias when participants are invited via random selection
Probability sampling/random sampling
Every member of the population of interest has an equal and known chance of being selected for the sample, regardless of if they are convenient or motivated to volunteer.
Nonprobability sampling
Involves nonrandom sampling and results in a biased sample
Probability sampling techniques (5)
- Simple random sample
- Oversample
- Stratified random sample
- Systematic sample
- Cluster sample or multistage sample
Nonprobability sampling techniques (4)
- Convenience sample
- Quota sample
- Purposive sample
- Snowball sample
Simple random sampling
Some examples- the names of every member of a population are written on a plastic ball, and a machine spits out the number of balls necessary for a sample. A number is assigned to each individual in a population and then certain ones are selected using a table of random numbers- researchers use software to generate random numbers
Systematic sampling
Using a computer or a random number table, the researcher starts by selecting 2 random numbers (4 and 7, for example). If the population of interest is a roomful of students, the researcher would start with the fourth person in the room, counting off every seventh person until the sample is the desired size. The Mehl study is an example of this, since researchers sampled conversations every 12.5 minutes
Why are simple random sampling and systematic sampling not frequently used?
Simple random sampling and systematic sampling can be difficult and time consuming. It can be almost impossible to find and enumerate every member of the population of interest, so the next 3 are variants of the basic technique.
Cluster sampling
An option when people are already divided into arbitrary groups. Clusters of participants within a population of interest are randomly selected, and then all individuals in each selected cluster are used. Example- studying all public high schools (clusters) in a state. A researcher could randomly select 100 of those schools and then select students from each school
Multistage sampling
Two random samples are selected, a random sample of clusters and then a random sample of people within those clusters. High school example- the researcher would select 100 schools randomly, and then select a random sample of students from each of the 100 schools.
Stratified random sampling
Stratified random sampling- the researcher purposefully selects particular demographic categories (strata) and then randomly selects individuals within each of the categories, proportionate to their assumed membership in the population. Example- researchers want to make sure their sample of 1,000 Canadians includes people of South Asian descent in the same proportion as the Canadian population (4%). In a sample of 1,000, they would select 40 people of South Asian descent. All members of the sample would be selected randomly, including the South Asian individuals.
How is stratified random sampling different from cluster sampling? (2)
- Strata are meaningful categories (like ethnic or religious groups), while clusters are more arbitrary.
- The final sample size of the strata reflect their proportion in the population, while clusters aren’t selected with such proportions in mind.