Lecture 3: Small-N Comparative Designs Flashcards
Is causation observable?
Causation is fundamentally unobservable. We can make inferences about causation from the associations that we can observe. We can observe associations only
Confounding variables
Definition:
Confounding variables (or confounders) are external variables in a statistical model that correlate with both the dependent variable and the independent variable.
Example:
In a study assessing the relationship between physical activity and heart disease, age could be a confounding variable. Age is associated with heart disease risk (older individuals may have a higher risk), and it’s also associated with physical activity levels (older individuals may be less physically active). Therefore, without proper control, age could distort the perceived relationship between physical activity and heart disease.
Dependent variable (definition + example)
Definition: The dependent variable is what you measure or observe to see if it changes when you tweak something else.
Example: If you’re seeing how more study hours might improve test scores, the test scores are your dependent variable. It’s the thing you’re curious about changing.
Independent variable (definition + example)
Definition: The independent variable is what you change or control to see if it affects something else.
Example: If you’re seeing how more study hours might improve test scores, the number of study hours is your independent variable. It’s the thing you’re changing to see what happens.
Confounder bias
Definition: Confounder bias happens when you’re not sure if the result of your experiment is because of what you changed, or because of some other thing you didn’t think about.
Example: Imagine you’re studying if eating more ice cream leads to more sunburns. You see a connection, but then realize people eat more ice cream and get more sunburns in the summer. Here, the season (summer) is a confounding variable that creates a bias in your results. You can’t be sure if it’s really the ice cream or just more sun exposure causing the sunburns.
Collider variable
Definition: A collider variable is one that is influenced by two other variables. Not considering this variable can make the relationship between the two other variables misleading.
Example: Imagine you’re studying whether owning a pet influences happiness. You find that people who own pets and people who don’t own pets have similar levels of happiness. But then, you discover that both these groups have a high income. High income is a collider variable here, as it can influence both pet ownership (people with high income are more likely to afford pets) and happiness (high income can contribute to happiness). If you don’t account for income, you might incorrectly conclude that owning a pet has no influence on happiness.
What is the fundamental problem of causal inference?
Since causation is defined counterfactually, it is by definition unobservable
Give five different reasons of association between two variables (X and Y)
Chance
X causes Y
Y causes X
A third variable Z causes both X and Y (confounder)
X and Y both cause a third variable Z (collider, if we condition on Z)
Difference between Confounder Bias and Collider Bias
Confounder Bias: This happens when a third variable, a confounder, influences both variables you’re studying (X and Y). If you don’t consider the confounder, it can trick you into thinking X causes Y, when really, the confounder is influencing both.
Example: You think more chocolate eating causes more cavities. But you forgot about tooth brushing. People who eat more chocolate might also brush their teeth less often. Here, tooth brushing is the confounder, and if you don’t consider it, you might wrongly think chocolate alone causes cavities.
Collider Bias: This happens when both variables you’re studying (X and Y) influence a third variable, a collider. If you consider (or control for) this collider, it can trick you into thinking X and Y are not related, when they really are.
Example: You think people who exercise more (X) are less likely to have heart disease (Y). But both X and Y influence a third variable, being thin (Z). If you only look at thin people (controlling for the collider), you might wrongly think exercise doesn’t protect against heart disease, because among thin people, those who exercise and those who don’t might have similar rates of heart disease.
Importance of Research Design
Research designs are crucial in explanatory research as they help us approximate the counterfactual, which is a hypothetical scenario representing what would have happened if a different choice had been made.
Research designs aim to infer causation from association. They help us understand if one variable (X) influences another variable (Y), based on the observed relationship between them.
These designs also attempt to address the non-causal reasons for observing association. This means they help us consider other factors or variables that might be creating a link between X and Y, ensuring that the inferred causality is as accurate as possible.
Overall, a good research design allows us to make more valid causal inferences, reducing the risk of being misled by confounding or collider biases.
What is Comparative Research and its different forms?
Comparative research involves making comparisons to draw conclusions. It is a fundamental aspect of all types of research:
- Experiments: They compare treatment and control groups.
- Large-N analyses: These make comparisons between many units, like a nationwide survey comparing attitudes across different regions.
- Small-N analyses: These compare a smaller number of units. This is often achieved through careful case selection, like comparing different policies across a few carefully selected countries.
- Case-study research: Here, empirical evidence for the case is compared with expectations from different theories and hypotheses. For example, studying one company’s success and comparing it to predictions made by business theories.
- Unobservable causal effect: This refers to a hypothetical comparison, as in comparing what happened with what would have happened under different circumstances.
What is an Experiment in Research?
An experiment is a type of comparative research that compares treatment and control groups. For example, a medical trial may compare outcomes between a group receiving a new drug (treatment group) and a group receiving a placebo (control group).
What are Large-N Analyses in Research?
Large-N analyses are a form of comparative research that compare many units. For example, a nationwide survey that compares attitudes across different regions or populations would be a large-N analysis. The ‘N’ refers to the number of units being compared, which is large in this case.
What are Small-N Analyses in Research?
Small-N analyses compare a smaller number of units. This often involves careful case selection to ensure meaningful comparison. For example, comparing different policies across a few carefully selected countries is a small-N analysis. The ‘N’ refers to the number of units being compared, which is small in this case.
What is Case-Study Research?
Case-study research involves comparing empirical evidence from a single case or a few cases with expectations from different theories and hypotheses. For instance, studying one company’s success and comparing it to predictions made by different business theories is an example of case-study research.
What is the Unobservable Causal Effect in Research?
The unobservable causal effect refers to a hypothetical comparison in research. This involves comparing what happened with what would have happened under different circumstances. This concept is central to the idea of counterfactuals in research design, which is a crucial part of inferring causal relationships.
What are Small-N Comparative Designs in Research?
Small-N comparative designs are hybrid research designs that combine the in-depth analysis of individual cases (within-case analysis) with the logic of comparing multiple units (as in large-N research).
Key points to consider with small-N designs include:
- Qualitative vs. Quantitative?: While small-N designs often involve qualitative data, they can also be used with quantitative data. The choice depends on the nature of the research question and the available data.
How big/small is ‘n’?: The ‘n’ in ‘small-N’ refers to the number of cases or units being studied. There’s no strict rule for what counts as ‘small’, but it’s typically fewer than would be studied in a large-N design.
The goal is to select a number of cases that allows for in-depth analysis while also enabling meaningful comparisons.
These designs are particularly useful when the researcher wants to understand a phenomenon in great depth but also wants to draw comparisons across cases to generate more general insights.