Chapter 1: Introduction to Casual Inference Flashcards

Question 1

Q

What is the difference between correlation and causation?

Answer

A

Correlation occurs when two quantities or random variables move together, where as causation is when a change in one variable changes the other.

Question 2

Q

What is causal inference?

Answer

A

Causal inference is the science of inferring causation from association and understanding when and why they differ

Question 3

Q

Why do we do causal inference?

Answer

A

We do causal inference because we want to know cause-and-effect relationships so that we can intervene on the cause to bring upon a desired effect. E.g. if we know certain marketing spend increases sales profits, managers can leverage that to increase profits.

Question 4

Q

What is the fundamental problem of causal inference?

Answer

A

The fundamental problem of causal inference is that you can never observe the same unit with and without treatment.

Question 5

Q

What is potential outcome?

Answer

A

A potential outcome with reference to an outcome i, is what i’s outcome with be if treated with treatment t.

Question 6

Q

What are assumptions in casual inference?

Answer

A

Assumptions are statements you make when expressing a belief about how the data was generated. The catch is that they usually can’t be verified with the data; that’s why you need to assume them.

Question 7

Q

What is the consistency assumption in casual inference?

Answer

A

The consistency assumption in casual inference is that the potential outcome is consistent with the treatment, i.e. there are no hidden multiple versions then the ones you specific.

For example, in marketing, if you are modelling the effect an advert has on the likelihood of a customer buying a product and you model ‘shown advert’ as a binary treatment i.e. shown advert yes, shown advert no, the consisitency assumption can be violated if there are multiple versions of the advert which might have an effect on the customers likelihood of buying.

Question 8

Q

What is the stable unit of treatment value assumption?

Answer

A

The stable unit of treatment value assumption is that there the treatment on one unit has no effect on another unit, the treatment has no spillover or network effect.

For example, when measuring the effectivness of vaccines, a vaccinated person can reduce the risk of those unvaccinated around them, therefore the SUTVA is violated. Violating the SUTVA can make treatment effects appear smaller than they are as the difference between treated and untreated is smaller due to the spillover of treatment.

Question 9

Q

How can violations of the constiency assumption and the single unit of treatment value assumption be mitigated?

Answer

A

Constiency: Include all versions of the treatment in ones analysis.
SUTVA: Expanding the definition of the treatment effect to include the effect that comes with other other units and by using more flexible models.

Question 10

Q

What is the average treatment effect (ATE)?

Answer

A

The average treatment effect represents the impact the treatment T would have on average. Some units will be more impacted by it, some less, and you can never know the individual impact on a unit.

Question 11

Q

What is average treatment effect on the treated?

Answer

A

Average treatment effect on the treated (ATT) is the treatment effect observed on the population that recieved the treatment.

For example, if you did an offline marketing campaign in a city and you want to know how many extra customers this campaign brought you in that city, this would be the ATT: the effect of marketing on the city where the campaign was implemented.

Question 12

Q

What the conditional treatment effect?

Answer

A

Conditional treatment effect is the effect observed in a specific group.

For example, you might want to know the effect of an email on customers that are older than 45 years and on those that are younger than that. Conditional average treatment effect is invaluable for personalization, since it allows you to know which type of unit responds better to an intervention.

Question 13

Q

Causal quanity example:
You can never know the effect of cutting prices have on sales on individual businesses as you cann’t observe both potential outcomes. Given this limit, what is possible to estimate that is useful in this instance?

Answer

A

ATE: The average impact of price cuts on amount sold.
ATT, how the business engaged in price cuts increased sales .
CATE which is the impact of having sales during the week Christmas.

Question 14

Q

What does bias refer to?

Answer

A

TECHNICAL DEFINITION
You can say that an estimator is biased if it differs from the parameter it is trying to estimate.
Bias = E[β^-β], where
β^ is the estimate.
β the thing it is trying to estimate—the estimand.

For example, an estimator for the average treatment effect is biased if it’s systematically under- or overestimating the true ATE.

Question 15

Q

What does identification refer to in causal inference?

Answer

A

Figuring out how to recover causal quantities from observed data.

Question 16

Q

Why is causal identification needed?

Answer

Study These Flashcards

A

Causal quantities cannot be computed directly from data as you can only observe one potential outcome for each unit of data.

Question 17

Q

What is the Independence Assumption in causal inference?

Answer

Study These Flashcards

A

The Independence Assumption is the assumption that the treatment and control group are indistinguishable and comparable, regardless of if they received the treatment.

Question 18

Q

What is the difference between randomized and observational data in causal inference?

Answer

Study These Flashcards

A

Randomized refers to the treatment being assigned randomly or when the assignment method is fully known and non-deterministic.

Observational data refers to data where the treatment is observed but we don’t know how the treatment was assigned.

Question 19

Q

What are the two steps causal inference is broken down to?

Answer

Study These Flashcards

A

Causal identification: Expressing the causal quantity of interest in terms of observed data.
Estimation: Using observed data to estimate the causal quantity of interest from the data.

Chapter 1: Introduction to Casual Inference Flashcards

(19 cards)