Chapter 1: Introduction to Casual Inference Flashcards

1
Q

What is the difference between correlation and causation?

A

Correlation occurs when two quantities or random variables move together, where as causation is when a change in one variable changes the other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is causal inference?

A

Causal inference is the science of inferring causation from association and understanding when and why they differ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why do we do causal inference?

A

We do causal inference because we want to know cause-and-effect relationships so that we can intervene on the cause to bring upon a desired effect. E.g. if we know certain marketing spend increases sales profits, managers can leverage that to increase profits.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the fundamental problem of causal inference?

A

The fundamental problem of causal inference is that you can never observe the same unit with and without treatment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is potential outcome?

A

A potential outcome with reference to an outcome i, is what i’s outcome with be if treated with treatment t.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are assumptions in casual inference?

A

Assumptions are statements you make when expressing a belief about how the data was generated. The catch is that they usually can’t be verified with the data; that’s why you need to assume them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the consistency assumption in casual inference?

A

The consistency assumption in casual inference is that the potential outcome is consistent with the treatment, i.e. there are no hidden multiple versions then the ones you specific.

For example, in marketing, if you are modelling the effect an advert has on the likelihood of a customer buying a product and you model ‘shown advert’ as a binary treatment i.e. shown advert yes, shown advert no, the consisitency assumption can be violated if there are multiple versions of the advert which might have an effect on the customers likelihood of buying.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the stable unit of treatment value assumption?

A

The stable unit of treatment value assumption is that there the treatment on one unit has no effect on another unit, the treatment has no spillover or network effect.

For example, when measuring the effectivness of vaccines, a vaccinated person can reduce the risk of those unvaccinated around them, therefore the SUTVA is violated. Violating the SUTVA can make treatment effects appear smaller than they are as the difference between treated and untreated is smaller due to the spillover of treatment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How can violations of the constiency assumption and the single unit of treatment value assumption be mitigated?

A

Constiency: Include all versions of the treatment in ones analysis.
SUTVA: Expanding the definition of the treatment effect to include the effect that comes with other other units and by using more flexible models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the average treatment effect (ATE)?

A

The average treatment effect represents the impact the treatment T would have on average. Some units will be more impacted by it, some less, and you can never know the individual impact on a unit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is average treatment effect on the treated?

A

Average treatment effect on the treated (ATT) is the treatment effect observed on the population that recieved the treatment.

For example, if you did an offline marketing campaign in a city and you want to know how many extra customers this campaign brought you in that city, this would be the ATT: the effect of marketing on the city where the campaign was implemented.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What the conditional treatment effect?

A

Conditional treatment effect is the effect observed in a specific group.

For example, you might want to know the effect of an email on customers that are older than 45 years and on those that are younger than that. Conditional average treatment effect is invaluable for personalization, since it allows you to know which type of unit responds better to an intervention.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Causal quanity example:
You can never know the effect of cutting prices have on sales on individual businesses as you cann’t observe both potential outcomes. Given this limit, what is possible to estimate that is useful in this instance?

A

ATE: The average impact of price cuts on amount sold.
ATT, how the business engaged in price cuts increased sales .
CATE which is the impact of having sales during the week Christmas.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does bias refer to?

A

TECHNICAL DEFINITION
You can say that an estimator is biased if it differs from the parameter it is trying to estimate.
Bias = E[β^-β], where
β^ is the estimate.
β the thing it is trying to estimate—the estimand.

For example, an estimator for the average treatment effect is biased if it’s systematically under- or overestimating the true ATE.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does identification refer to in causal inference?

A

Figuring out how to recover causal quantities from observed data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is causal identification needed?

A

Causal quantities cannot be computed directly from data as you can only observe one potential outcome for each unit of data.

17
Q

What is the Independence Assumption in causal inference?

A

The Independence Assumption is the assumption that the treatment and control group are indistinguishable and comparable, regardless of if they received the treatment.

18
Q

What is the difference between randomized and observational data in causal inference?

A

Randomized refers to the treatment being assigned randomly or when the assignment method is fully known and non-deterministic.

Observational data refers to data where the treatment is observed but we don’t know how the treatment was assigned.

19
Q

What are the two steps causal inference is broken down to?

A
  1. Causal identification: Expressing the causal quantity of interest in terms of observed data.
  2. Estimation: Using observed data to estimate the causal quantity of interest from the data.