CHAPTER 9 Why Correlation Doesn’t Imply Causation Flashcards

1
Q

What does correlation not necessarily imply?

A

Causation

Correlation indicates a relationship between variables but does not confirm one causes the other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two key reasons why an observed correlation might be a biased estimate of a causal relationship?

A
  • Confounders
  • Reverse causation

Confounders are external factors that can influence both variables, while reverse causation occurs when the outcome affects the predictor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why is understanding the distinction between correlation and causation important?

A

It prevents making decisions based on misguided beliefs about how actions will affect outcomes

Misinterpreting correlation as causation can lead to significant errors in decision-making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the definition of causation?

A

A change in some feature of the world that would result from a change to some other feature of the world

Causation implies that one event is the result of the occurrence of another event.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the main argument behind charter schools?

A

To encourage innovation and choice in education

Charter schools operate independently of the public school system and aim to provide alternatives to traditional education.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What happens when charter schools are oversubscribed?

A

Students are admitted by random lottery

This process ensures a fair chance for all applicants regardless of background.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What correlation exists between charter school attendance and academic performance?

A

Low-income students in charter schools have better educational outcomes than those in traditional public schools

This correlation is based on observed performance metrics such as test scores and graduation rates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the significance of the Preuss School in the discussion of charter schools?

A

It has demonstrated outstanding academic outcomes for its students

The school serves low-income students and has a high rate of college admissions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What critical question arises when evaluating the effectiveness of charter schools?

A

Does attending a charter school improve a child’s educational outcomes compared to attending a local public school?

This question addresses the core of the debate regarding charter schools’ impact.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can we better assess the causal impact of charter schools?

A

By comparing the performance of lottery winners to lottery losers

This method accounts for the characteristics of students who applied to charter schools.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What was found when comparing lottery winners and losers at the Preuss School?

A

The correlation between charter school attendance and performance disappeared; no performance difference was found

This suggests that prior academic differences, not charter school attendance, may account for performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What conclusion can be drawn from studies on charter school effectiveness?

A

In many cases, there is no evidence that winning a charter school lottery impacts achievement

This highlights the importance of comparing similar groups to draw causal inferences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are potential outcomes in the context of causal relationships?

A

They represent the hypothetical outcomes for individuals based on whether they received a treatment or not

This framework helps clarify the causal impact of interventions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the treatment variable represented by T in causal analysis?

A

Going to the charter school

T = 1 indicates attendance at a charter school, while T = 0 indicates attendance at a public school.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does T = 1 indicate for an individual in the context of school attendance?

A

T = 1 indicates that the individual attended the charter school.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does T = 0 indicate for an individual in the context of school attendance?

A

T = 0 indicates that the individual attended a public school.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are the two potential standardized test scores for an individual based on school attendance?

A

Y1i (if attended charter school) and Y0i (if attended public school).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the effect of going to the charter school on person i’s test scores denoted as?

A

Y1i - Y0i.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the Average Treatment Effect (ATE)?

A

ATE is the average effect of going to a charter school across a population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the population difference in means?

A

The difference in average test scores between charter school students and public school students.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the Average Treatment Effect on the Treated (ATT)?

A

ATT is the average effect of going to the charter school among those who attended the charter school.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the Average Treatment Effect on the Untreated (ATU)?

A

ATU is the average effect of going to the charter school among those who attended public schools.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

True or False: The ATE can be directly observed.

A

False.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What does the bias term indicate when estimating the ATT?

A

It indicates that the two groups have baseline differences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What are baseline differences?

A

Differences between treated and untreated groups that exist even in a counterfactual world.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What does the correlation between test scores and charter school attendance potentially indicate?

A

It may indicate a bias rather than a causal relationship.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is the role of randomization in estimating causal effects?

A

Randomization helps ensure that treated and untreated groups are statistically similar.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What are confounders?

A

Features of the world that affect both treatment status and outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What are the two conditions for a variable to be considered a confounder?

A
  • It affects treatment status. * It affects the outcome independently of treatment status.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What example is given to illustrate the concept of confounders?

A

Academic talent as a confounder affecting both treatment and outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is reverse causality?

A

When the outcome affects treatment status.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What example is provided to illustrate reverse causality?

A

The effect of civil war on a country’s economic prosperity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What should be assessed to determine if a correlation is causal?

A

The presence of confounders and potential reverse causality.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

How does reverse causality create baseline differences?

A

It leads to systematic differences in outcomes that are not due to treatment effects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What does the ATE represent in relation to ATT and ATU?

A

ATE is a weighted average of ATT and ATU.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is reverse causality?

A

Reverse causality is when the outcome affects treatment status

It raises concerns about interpreting correlations as causal relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What should you check for before interpreting a correlation as causal?

A

Potential sources of reverse causality

Consider if the causal arrow might run from the outcome to the treatment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What are the two key sources of bias for estimating causal relationships mentioned?

A
  • Confounders
  • Reverse causality

These sources can distort the true relationship between treatment and outcome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What is the 10,000-Hour Rule?

A

The idea that great success is achieved by committing to 10,000 hours of deliberate practice

Popularized by Malcolm Gladwell, it suggests that talent is secondary to practice.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Who inspired Dan McLaughlin’s pursuit of professional golf?

A

K. Anders Ericsson

Ericsson’s research focused on the importance of deliberate practice for high performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What distinguishes true masters from good experts according to Ericsson?

A

The amount of time devoted to deliberate practice

Deliberate practice is specifically targeted to improve performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

What was the finding of Ericsson’s study on violinists?

A

The best violinists practiced at least 10,000 hours by age eighteen

The least accomplished practiced about 5,000 hours.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What is a confounder?

A

A variable that affects both the treatment and the outcome

It can distort the apparent relationship between them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

True or False: Innate talent is not a factor in achieving high performance.

A

False

Innate talent can influence the amount of deliberate practice and success.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What does the correlation between practice and success not necessarily imply?

A

A causal relationship

Other factors, like innate talent, can confound this correlation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

What is the consensus among nutrition experts regarding diet soda?

A

Diet soda is considered bad for health

Linked to obesity, diabetes, and heart attacks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Fill in the blank: There is a negative correlation between drinking diet soda and _______.

A

health outcomes

People who drink diet soda are more likely to experience health issues.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

What are two potential confounders in the diet soda and health outcomes correlation?

A
  • Snacking
  • Obesity/Diabetes

These factors could influence both soda consumption and health status.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

How can reverse causality be conceptualized in terms of confounders?

A

Anticipated outcomes can act as confounders

For example, beliefs about civil war risk can affect economic conditions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
50
Q

What is a key concern when interpreting the correlation between practice and performance?

A

The role of innate talent as a confounder

Talent may bias the interpretation of the impact of practice on success.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
51
Q

What is the effect of anticipating a future civil war on a country’s economy?

A

It can deter investment, lead to capital flight, and cause brain drain

This creates a weaker economy due to perceived civil war risks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
52
Q

What is reverse causality in the context of civil war risk and economic weakness?

A

It suggests that civil war risk causes economic weakness.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
53
Q

Define confounding in relation to civil war risk.

A

Confounders are factors that lead to the belief that a country is at high risk of civil war, causing economic weakness.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
54
Q

What is the primary reason political candidates spend time raising money for campaigns?

A

They believe campaign dollars are essential for their electoral prospects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
55
Q

Who conducted an influential study on campaign spending in 1978?

A

Gary Jacobson.

56
Q

What did Jacobson conclude about campaign spending for challengers and incumbents?

A

Spending helps challengers but has little benefit for incumbents.

57
Q

How is challenger spending correlated with their electoral success?

A

It is strongly positively correlated with challengers’ vote shares.

58
Q

What correlation does incumbent spending have with their electoral success?

A

It is negatively correlated with incumbents’ vote shares.

59
Q

What might too much spending by an already well-known incumbent do?

A

It might annoy and turn off potential supporters.

60
Q

What is a concern regarding electoral strength in interpreting campaign spending data?

A

Electoral strength may be a confounder affecting both fundraising and electoral success.

61
Q

True or False: Incumbents typically spend more money when they are electorally strong.

62
Q

What do subsequent studies suggest about campaign spending effects?

A

They generally suggest positive effects for both challengers and incumbents.

63
Q

What is often required to meaningfully influence the outcome of a large election through campaign donations?

A

Tens of millions of dollars.

64
Q

What does signing the bias refer to in the context of correlation and causality?

A

It assesses whether the correlation over- or under-estimates the causal effect.

65
Q

If the bias in an estimate is positive, what does this imply about the observed correlation?

A

It is an over-estimate of the true causal effect.

66
Q

If the bias in an estimate is negative, what does this imply about the observed correlation?

A

It is an under-estimate of the true causal effect.

67
Q

What is a confounder in the context of charter school attendance and standardized test scores?

A

Students applying to charter schools may be more academically gifted.

68
Q

Does the confounder of academic talent lead to an over- or under-estimate of the effect of charter school attendance on test scores?

A

Over-estimate.

69
Q

What happens when a confounder has opposite signed effects on treatment and outcome?

A

It creates negative bias, leading to an under-estimate of the true effect.

70
Q

What is the relationship between living in a poor neighborhood and charter school attendance?

A

It has a positive effect on the likelihood of attending a charter school.

71
Q

What does the observed correlation between charter school attendance and test scores reflect when confounders are present?

A

It reflects the influence of confounders, leading to bias in the correlation.

72
Q

How can one estimate the extent of bias if data on confounders is not available?

A

By making guesses about the confounder’s effect on the outcome and its correlation with the treatment.

73
Q

What is the implication of assuming the true effect is zero in evaluating bias?

A

It helps to determine how large the bias would have to be to explain the observed correlation.

74
Q

What is the main reason simple correlations are not preferred for learning about causal relationships?

A

Bias in the data

Simple correlations can lead to misleading conclusions about causality.

75
Q

What is sensitivity analysis?

A

An approach that starts with the assumption that the true effect is zero and assesses how large the bias would need to be to explain an observed correlation

This method helps evaluate the plausibility of observed correlations.

76
Q

What is a confounder?

A

A variable that affects both the treatment and the outcome

Confounders can distort the perceived relationship between the treatment and the outcome.

77
Q

True or False: Correlation necessarily implies causation.

A

False

Correlation does not imply causation due to potential confounding factors.

78
Q

What did the 2012 study in The Lancet Infectious Diseases investigate?

A

The relationship between hormonal contraception use and HIV transmission

The study analyzed data from couples with one HIV-infected partner.

79
Q

What were the two significant findings of the Lancet study regarding hormonal contraception and HIV?

A
  • HIV-negative women using hormonal contraception were twice as likely to acquire HIV
  • HIV-infected women using hormonal contraception were twice as likely to transmit HIV

These findings were reported widely but raised concerns about confounding variables.

80
Q

What is a mechanism in the context of causal relationships?

A

A feature of the world that the treatment affects, which then affects the outcome

Mechanisms explain how a treatment produces its effects.

81
Q

How can sexual activity act as a confounder in the study of hormonal contraception and HIV transmission?

A

More sexually active women are more likely to use hormonal contraception and also at greater risk of HIV transmission

This can lead to an over-estimation of the causal relationship.

82
Q

What is the distinction between confounders and mechanisms?

A

Confounders are pre-treatment covariates affecting both treatment and outcome, while mechanisms are post-treatment covariates affected by treatment and then affecting the outcome

Understanding this distinction is crucial for causal inference.

83
Q

In the context of a study on statins, what could be a confounder?

A

Wealth

Wealth can affect both the likelihood of taking statins and the risk of dying from heart disease.

84
Q

What could be considered a mechanism in the statin study?

A

Lower cholesterol

If cholesterol levels are measured after starting statins, then it is a mechanism of the treatment’s effect.

85
Q

True or False: Democracy can be both a confounder and a mechanism in studies of civil war risk and economic factors.

A

True

Democracy can influence both economic conditions and civil war risk, making its role context-dependent.

86
Q

What should you assess when someone presents a correlation as evidence of a causal relationship?

A
  • Are we actually observing a correlation?
  • Does the estimated correlation reflect a genuine relationship?
  • Is this correlation convincing evidence of a causal relationship?

These questions help critically evaluate the validity of the claimed relationship.

87
Q

What is noise in the context of causal inference?

A

Idiosyncratic factors that affect the estimate, separate from any causal relationship

Noise can arise from sampling variation or other unrelated variations.

88
Q

What are the two kinds of ways an estimated correlation can deviate from the causal effect of interest?

A
  1. Noise 2. Bias

Noise refers to idiosyncratic factors affecting the estimate, while bias involves confounders or reverse causation.

89
Q

Define ‘noise’ in the context of correlation estimates.

A

Idiosyncratic factors that affect the estimate, which can arise from sampling variation or measurement errors.

90
Q

What does ‘bias’ refer to when discussing correlation and causation?

A

Confounders or reverse causation that makes the estimate different from the estimand on average.

91
Q

What is a ‘spurious correlation’?

A

A correlation that occurs between two variables without a logical or causal connection.

92
Q

What correlation coefficient (r) indicates a very strong correlation?

93
Q

What example illustrates a correlation between suicides by hanging and government spending on science?

A

The correlation over time showing higher hanging suicides coinciding with higher science spending.

94
Q

What might explain the correlation between hanging suicides and science spending?

A

Potential confounders like population growth affecting both variables.

95
Q

What does the Average Treatment Effect (ATE) measure?

A

The difference in average outcome comparing treated and untreated populations.

96
Q

Fill in the blank: A confounder is a feature of the world that _______.

A

[1] has an effect on treatment status and [2] has an effect on the potential outcome.

97
Q

What is the Average Treatment Effect on the Treated (ATT)?

A

The difference in average outcome comparing the treated subgroup to the counterfactual scenario where they are untreated.

98
Q

What is the Average Treatment Effect on the Untreated (ATU)?

A

The difference in average outcome comparing the untreated subgroup to the counterfactual scenario where they are treated.

99
Q

What is an example of a correlation attributed to noise rather than causation?

A

The correlation between Nicolas Cage movies and swimming pool drownings.

100
Q

Identify two potential confounders that might affect the correlation between marriage and happiness.

A
  1. Socioeconomic status 2. Pre-existing mental health conditions

Both confounders could influence both likelihood of marriage and levels of happiness.

101
Q

What does it mean if an estimate is an over-estimate?

A

The bias is positive, making the estimate larger than the true effect in expectation.

102
Q

What is the difference in means?

A

The difference in average outcome comparing treated and untreated subgroups.

103
Q

How can data analytics improve university fundraising according to the example provided?

A

By identifying correlations that suggest strategies to encourage alumni giving.

104
Q

What conclusion can be drawn from the correlation between sociology doctorates and space launches?

A

It is likely due to noise rather than a causal relationship.

105
Q

What is the effect of population growth on the correlation of hanging suicides and science spending?

A

It could plausibly increase both suicides and science spending.

106
Q

What does the term ‘baseline differences’ refer to?

A

Differences in average potential outcomes between two groups with the same treatment status.

107
Q

What is the implication of a correlation not holding outside the sample data?

A

It suggests the correlation may be a result of noise.

108
Q

What happens to the life satisfaction of people who become divorced according to the study mentioned?

A

It decreases significantly after divorce.

109
Q

What challenge does the correlation between Nicolas Cage movies and swimming pool drownings pose?

A

Conceptualizing the correlation as noise despite having complete data.

110
Q

What does the concept of a ‘mechanism’ or ‘mediator’ refer to?

A

A feature of the world affected by treatment that then affects the outcome.

111
Q

What change in happiness is studied in relation to the year spouses passed away?

A

The change in happiness before, during, and after the year in which their spouses passed away.

112
Q

Does the study of happiness before and after the death of a spouse make you more or less confident in Gilbert’s causal interpretation?

A

This question prompts personal reflection on confidence in Gilbert’s interpretation.

113
Q

What dataset is suggested for downloading to analyze House Elections Spending in 2018?

A

HouseElectionsSpending2018.csv and README.txt.

114
Q

What relationship is analyzed using linear regression in the House Elections Spending data?

A

The relationship between incumbent vote share and incumbent spending.

115
Q

Is the correlation between incumbent spending and vote share positive or negative?

A

This requires analysis of the data to determine.

116
Q

According to the data, do incumbents who spend more do better or worse?

A

This requires analysis of the data to determine.

117
Q

What should be interpreted regarding the magnitude and direction of the correlation between incumbent spending and vote share?

A

This requires analysis of the data to determine.

118
Q

What should be done similarly for challengers in the House Elections Spending analysis?

A

Run a linear regression to find the relationship between challenger vote share and spending.

119
Q

What are three confounders you should identify when assessing evidence of campaign spending effects?

A

This requires personal reflection to identify potential confounders.

120
Q

Do you have any variables in the dataset that measure the confounders identified?

A

Identify a variable that might plausibly measure a confounder in the dataset.

121
Q

Using linear regression, what should be assessed regarding spending and potential confounders in the dataset?

A

Assess whether incumbent spending and challenger spending are correlated with a potential confounder.

122
Q

What is an example of a misinterpretation of correlation as causal relationship?

A

Provide a specific example from a researcher, journalist, or analyst.

123
Q

What should be explained regarding the evidence presented in the misinterpretation example?

A

Explain why the correlation is not persuasive evidence of the purported causal relationship.

124
Q

What should be discussed in terms of the likely direction of bias in the misinterpretation example?

A

Discuss the likely direction of the bias related to the misinterpretation.

125
Q

What should be considered as a better way to estimate the causal relationship of interest in the example?

A

This requires personal reflection on better estimation methods.

126
Q

Who authored the study of the Preuss School at UCSD?

A

Larry McClure, Betsy Strick, Rachel Jacob-Almeida, and Christopher Reichher.

127
Q

What is the main focus of the study ‘Who Benefits from KIPP?’

A

Analyzing the effects of the Knowledge is Power Program.

128
Q

What is the focus of the study by Betts et al. regarding charter schools?

A

The effects of school choice on student integration and achievement.

129
Q

What is the significance of the study by Ericsson et al. in relation to expert performance?

A

It discusses the role of deliberate practice in the acquisition of expert performance.

130
Q

What does the study by Heffron et al. investigate?

A

The use of hormonal contraceptives and risk of HIV-1 transmission.

131
Q

What does Jacobson’s study from 1978 analyze?

A

The effects of campaign spending in congressional elections.

132
Q

What book discusses the concept of spurious correlations?

A

Spurious Correlations: Correlation Does Not Equal Causation by Tyler Vigen.

133
Q

What is the main finding of the study by Zimmermann and Easterlin on happiness?

A

Examines happiness in relation to cohabitation, marriage, divorce, and happiness.

134
Q

What do Gardner and Oswald’s findings suggest about divorcing couples?

A

Investigates whether divorcing couples become happier by breaking up.

135
Q

What is a weighted average?

A

An average where different weights are assigned to different items.