2 - Impact Evaluation Flashcards

1
Q

4 different forms of evaluation:

A
  1. Ex-ante appraisal (potential?)
  2. Programmatic evaluation
  3. Comprehensive expenditure review
  4. Impact analysis (some are just glad to help out but investigate if the project actually worked out well and had good impacts are increasingly popular).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why evaluate? Objectives:

A
  • Lesson learning (has it done what it was supposed to?beneficiaries, program, organization, world).
  • Accountability
  • Result-based management (use results to improve, ex test on small scale and then scale up)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The logical framework/model of evaluation:

A
  1. Needs - ex too low literacy in rural India.
  2. Inputs - ex monitor teacher attendance and activity.
  3. Output - parents vist schools daily and report.
  4. Outcome - teachers attend school more regularly and better quality.
  5. Impact - hopefully higher rate of literacy.
  6. Long-term goal - improved educational outcomes and career opportunities.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Different levels of program evaluation:

A
  • Needs Assessment (who is the pop and what do they need?)
  • Program Assessment (how address the needs and what are the prerequisites and shortcomings?)
  • Process evaluation (are the things delivered? built? don’t assess impact, just process).
  • Impact evaluation (all this lecture is about –> lead to the Q: why and when do it work? Can we scale up?)
  • Cost-benefit analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Theory of change:

A

ToC analyses how inputs lead to intended outcomes/impacts. Identify causal steps and which underlying assumptions need to hold, what data we need etc..

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Different types of correlation:

A
  • Causation: X–>Y.
  • Reverse causality: Y–>X
  • Simultaneity: Y–>X and X–>Y
  • Spurious correlation/OV bias: Z–>X and Z–>Y a third variable affecting both.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Counterfactual:

A

Need a group of people telling us what would have been the case if we did NOT implement the program. This cannot be done 100% since we don’t have two identical worlds… But we do our best to find a good enough counterfactual so that w can measure the impact (difference between T and C). This helps us measure causality.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the basic formula for measuring impacts?

A

To take the difference between outcome for participants vs non-participants:
Yi(1) - Yi(0)
But, as we cannot observe same unit, we must take the average impact:
E(Yi(1)) - E(Yi(0)).
So this is the expected value for the T minus the expected value for the C group.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the bias of the impact measurement?

A

The bias is:
E(Y(0)|T) - E(Y(0)|C).
So it’s the difference between being in the treatment group but not receiving the treatment and being in the control group where you obviously not receive the treatment.
If we have a perfect counterfactual, this bias=0.
This B happens because we use an estimate of ATE.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

3 techniques for impact evaluation:

A
  • Experimental design with randomisation (RCT)
  • Matching methods (PSM)
  • Difference in difference
  • Other in the book… see notes.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Random sampling and assignment:

A

When we randomly select a sample from a population and den randomly assign some of them in the sample to the T and the rest to the C.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

RCT

A

Random control trial. When using random sample and assignment, we create a relevant comparison group.
There shouldn’t be any systematical differences between the groups, no bias. –> T and C have same outcome Y in absence of the program.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Is it ethical to randomise?

A

Not always. If the program involves large benefits for the treated ones, then why should my neighbour get those benefits but not me? Just by luck? If we had the chance to prove who needed it the most, maybe it would have been me. But self selection destroys the properties of a relevant counterfactual…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

ATE=

A

Average treatment effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Issues with RCT:

A
  • External validity (specific context)
  • Hawthorne effects - changed behavior for the observed ones.
  • John Henry effect - changed behavior for the controlled, work harder)
  • Contamination/spillover
  • Dropout or attrition
  • Partial eq - measuring short term effects.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

PSM

A

Propensity score matching. Find a group that are similar in the observable characteristics and assume that the unobservables also are similar across treated and untreated.

17
Q

When use PSM?

A

When RCT is not possible, ex in ex-post situations where program is already implemented or when RCT is too expensive.

18
Q

PSM method’s 3 steps:

A
  1. Use surveys to select several characteristics (X)(age, income etc) that help predict participation. Estimate the probability of participation, p(X). This is someone’s propensity score.
  2. Match treated to untreated using p(X), as close as possible.
  3. Impact = average difference in outcomes between the groups.
19
Q

PSM issues:

A
  • Requires a loooot of data to find relevant characteristics

- Strict assumption that unobservables also are similar.

20
Q

Difference-in-difference

A

No random sample available, maybe because the program aimed at help out a certain group.

  • Look at before and after the program
  • Or alternatively look at the change over time of non-beneficiaries as counterfactual (subtracting these differences creates a diff-in-diff).
21
Q

Diff-in-diff ATE=

A

[E(Yt1|T) - E(Yt0|T)] - E((Yc1|C) - E(Yc0|C)).

SO simply the difference of the changes over time for the two groups.

22
Q

Key assumption of DiD

A

Parallel trends - that the groups have the same pace of change before the program starts, bc then we can assume that if the treated group did had the program they would have been equally well off as the untreated.

23
Q

Issues of DiD

A
  • Are the parallel trends true? Maybe affected by spurious correlation?
  • In practice, ex-ante time-varying unobserved heterogeneity could have been taken care of in the program design to ensure that T and C areas share similar pre-program characteristics. But not possible now…
    (if not similar before, the measured impact is not true since Y will be affected).