16; Dummy Variables and Collider Bias Flashcards
What do dummy variables allow you to incorporate into regression analysis?
Nominal and ordinal variables
Examples include sex, county of birth, and church attendance.
How are dummy variables coded?
As binary variables with values of 0 or 1.
What is the coding for male and female in a dummy variable for sex?
Male = 1, Female = 0.
What is a key characteristic of dummy variables when there are more than two categories?
Create multiple dummy variables to cover each category.
In Cinnirella’s paper, how are birth counties represented?
By creating a variable for each county, e.g., Essex = 1 if born in Essex, otherwise 0.
What effect do dummy variables have on the regression line?
They shift the regression line up or down without affecting the slope.
What is the reference category in a regression with dummy variables?
The category against which other categories are compared.
If we regress multiple dummiess witth multtiple categories, we compare against the reference group
e.g. all other regional coeffficient arre in relation to reference group of London
In the regression of height on sex, which category is the reference category?
Female. (m=1, f=0)
Thus coefficient on sex is the amounnt that men are taller than women
What happens to coefficients and statistical significance when changing the reference group?
They will change.
What is the dependent variable in the dummy variable practice exercise?
Infant Mortality Rate (infant deaths per 1000 births).
What do interactions in regression analysis allow you to test?
Whether there is a differential effect.
How do you figure out the reference group
Explain what is going on in these regression tables
First column has ‘Other District’ as the reference point
Second Column has ‘Textile’ as the reference point
Constant is always the intercept point relative to the reference i.e. Predicted IMR for Textile at log(pop. density)=0
What are interactions
While Dummy variables asssume identical effects of other variabnles (doesn’t change slsope)
Interactions allow you to test for a differential effect. Adds on a ‘B3PopDensi * Mining’
Explain how the Mining and Others Equation is being calculated here using interactions
What is a common rule of thumb for T-statistics in terms of significance?
T-statistics above 2 or less than -2 are significant at the 5% level.
How can statistical significance be measured using standard error?
If the coefficient is greater than 2 times the standard error.
What do Ziliak and McCloskey (2008) argue about statistical significance?
Statistical significance does not imply historical or economic significance.
With very large samples, many variables stat. sig.
What does statistical significance indicate? What do we look for as well?
The probability that the mean difference or coefficient is different from zero.
How much X influes Y, this comes from the relative magnitude of the mean difference or coefficient
What do regression techniques allow for, according to the conclusions?
They are more flexible than what first appears.
Fill in the blank: Dummy variables allow us to incorporate _______ and ordinal variables into regressions.
categorical.
True or False: Not everything that is statistically significant is meaningful.
True.
What is Collider Bias
Collider bias occurs when you control for a variable (the “collider”) that is influenced by two other variables — and this opens up a spurious association between them, even if none existed before.
Why is this an example of collider bias
It’s collider bias because Movie Star status is a variable influenced by both beauty and talent, and conditioning on it (i.e., analyzing only movie stars) creates a false negative correlation between beauty and talent — even though they are uncorrelated in the general population.