Stats II Flashcards

Pre Mid Term

1
Q

Pearl - Intro

Language of queries

A

The description of an intervention or treatment, written symbolically.

eg. Effect of a drug (D) on lifespan (L)

P(L | do(D))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Pearl - Intro

What is a counterfactual?

A

“What would have happened if we had acted differently (not taken the treatment)?”

Allows us to emulate in retrospect, reflect on past actions and envision alternative scenarios.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Session 2

Individual Treatment Effect

A

The difference of outcomes between a group under treatment and the control group.

ITE = Y1i − Y0i

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Session 2

Fundamental Problem of Causal Inference

A

“It is impossible to observe two states of the world for the same person at the same time (e.g. vaccine effectiveness).”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Session 2

Average Treatment Effect

A

An average of the individual level effect.

ATE = Avg[Y1i − Y0i]

Also known as the Estimand.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Session 2

True ATE

A

AVG[ITE1 + ITE2 + … ITE X]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Session 2

Difference in Means

A

**difference in means = ATE + selection bias
**

naive difference in means = Y1i-Y0i

We can interpret the ATE as a difference in means when there is no selection bias.

Experiments ensure that the selection bias term is 0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Session 2

Randomization

A

Random assignment guarantees that:

**E[Y0i|Di = 1] = E[Y0i|Di = 0]
**
Which means the selection bias term equals zero.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Session 2

Law of Large Numbers - LLN

A

Sample average can be brought as close as we want to the AVG in population by increasing the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Session 2

Constant Treatment Effects

A

When the treatment affects groups in the same way.

In a hypothetical example, an outcome of Treatment ITE = 6 and and Control ITE 4 means different effects. If both had 4 and 4, there would be a constant treatment effect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Session 2

Randomization considerations

A
  • Feasibility: Not everything that counts should be randomized, and not everything that can be randomized counts.
  • Ethics: Concerning the denial of services we know to be good, the effect of bad treatment, the fair allocation of incentives, the repercussions of using human subjects.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Session 3

DAG

A

Directed Acyclic Graphs (DAG) represent the beliefs, relationships and assumptions of a causal model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Session 3

Confounder

A

A variable that has a mixing effect associated with our causal path. Occurs when treatment and outcome have a shared common cause not controllef for.

May be observable or unobservable.

Can lead to OVB and selection bias.

https://jamanetwork.com/journals/jama/fullarticle/2790247

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Session 3

Backdoor Paths

A

Backdoor paths are non-causal, open paths that create associations even in the absence of a causal effect.

D <- X -> Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Session 3

Mediator

A

A mechanism that mediates the causal relationship of a set of variables. Must be added one wants the total effect of the treatment.

D -> X -> Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Session 3

Collider

A

A third variable that has been influenced by both the treatment and the outcome.

It is a closed path unless controlled for, which induces statistical association between the variables (collider bias).

D -> O <- A -> Y with O being the collider.

https://jamanetwork.com/journals/jama/fullarticle/2790247

17
Q

Session 3

Collider Bias

A

Threatens the internal validity of a study and the accurate estimation of causal relationships.

18
Q

Session 3

Conditioning

A

Means holding the confounder (X) fixed at some value.

eg. “adjusting or controlling for”.

19
Q

Session 3

Backdoor Criterion

A
  • Conditioning for a confounder closes an open backdoor path (eliminates selection bias)
  • Conditioning for a collider opens a closed path (and we get collider bias).
  • Conditioning for a mediator is fine, depending on what the effect to measure (to get the total effect, we would need the to add the direct + mediated treatment effects).
20
Q

Session 3

Overcontrolling

A

“Sometimes you end up controlling for the thing you are trying to measure” (Pearl on Ezra Klein’s example).

21
Q

Session 4

Regression

A

Predicts the value of an outcome variable based on one or more input explanatory variables.

What is our best guess of y given an observed x?

Bivariate regression: yi = α+βxi+ei
*Modeling variable y as a function of one variable x

22
Q

Session 4

Loss function

A

Sum of the squared residuals

22
Q

Session 4

Conditional expectation function

A

E[Yi|Xi = X] = α+βxi

give years of schooling = expected income

α+βxi + vi + s1 * 1500 + s2 * log(100,000)
- v is associated with the dummies, eg. SAT score
- we can change Pi to compare treatment / control

**On average and holding all else constant, a one-unit change in P (eg. from 0 to 1) is associated with a B-unit change in log(y).
**

23
Q

Session 4

Bias-variance tradeoff

A

The difference between being systematically off, but consistent (low bias, high variance), or accepting bias but having low variance.

24
Q

Session 4

Regression of conditional means

A

When we know there is selection bias, the naive difference does not equal ATE. We have to:
- look at naive difference between groups themselves
- get mean or weighted AVG

25
Q

Session 4

Overall average effect

A

log() can help us get a normal distribution for eg. income, education

Yi = α+βxi+ vAi + ei

26
Q

Pearl - Intro

Why did the causal revolution arise?

A

It has been possible, unlike other historical periods, thanks to the vocabulary that allowed us to capture:

1- causal diagrams (what we know)
2- symbolic language (what we want to know).

27
Q

Session 4

Regression for causal effects

A

We can include Estimated B by following our DAG.

If we control for a confounder, given a treatment, we include all controls in the X and keep them constant.

E[Yi | Di = 1, X] - E[Yi | Di = 0, X]

28
Q
A