Practical Tips Flashcards

Question 1

Q

When conducting our power calculation, we did not consider covariates (control variables) in the formula, even though we plan to use baseline outcomes and covariates when conducting the final analysis. As such, compared to the estimates from our power calculation, in our final analysis

Answer

A

The inclusion of baseline covariates reduces residual variance; in doing so, it increases the power of the experiment and allowing for the detection of a smaller effect size (i.e., the minimum detectable effect size becomes smaller.)

Question 2

Q

One major lesson from observing ICCs?

Answer

A

ICC is context-dependent and hard to predict

Question 3

Q

We would sometimes want to pool different treatment arms (i.e. Groups A and B vs Control rather than either Group A vs. Control or Group B vs. Control) when conducting our analysis because (select all that apply):

Answer

A

It gives us a larger sample size than if we were to only compare either A or B to control & Whether “any treatment” (A or B) is effective is an interesting question to answer

In order to identify interaction effects, one would need to provide both treatment A and treatment B to one group; merely pooling the two separate groups would not allow for the identification of interaction effects. Pooling the groups also does not have a bearing on the allocation ratio between A and B since individuals will still maintain their original treatment assignment. What a pooled treatment arm does is to increase the sample size and power for a comparison of “any treatment” versus a comparison group. This is useful when comparing “any treatment” to the control is a comparison that addresses a question that one is interested in

Question 4

Q

What effect size should you use when designing your experiment?

Answer

A

Smallest effect size that is still cost effective

Question 5

Q

Which statement is NOT correct?

MDE has a big influence on power, for a given sample size
When calculating MDE we need to think about likely take up rates
A smaller MDE means the effect will be estimated more precisely (all else equal)
In general, cheaper programs can be tested with smaller sample sizes

Answer

A

In general, cheaper programs can be tested with smaller sample sizes

Cheaper programs can have a smaller impact and still be cost-effective. Therefore we would likely want to choose a smaller MDE when evaluating these programs, which would require a LARGER sample size. Take-up rates impact the detectable effect of one’s experiment, while a smaller MDE means that the effect will be more precisely estimated, all other things being equal. For a given sample size, one will have less power to measure smaller MDEs than larger ones.

Question 6

Q

(For individual-level randomization) Increasing the sample size by a factor of 4 results in

Answer

A

Making the standard error half the size

Recall that the standard error is a function of the population standard deviation and the sample size. Specifically, for a (non-cluster) randomized trial the standard error is given by the standard deviation divided by the square root of the sample size. Increasing the sample size from N to 4N would thus decrease the standard error by 1 divided by the square root of 4 i.e. 1/2. Quadrupling the sample size would thus halve the standard error.

Question 7

Q

Taking a random sample from the entire target population..

Answer

A

means that one’s sample is more representative of the broader target population. This means that any inferences about impact that one draws from the sample are likely to apply to the target population at large as well (external validity.

Question 8

Q

drawing a random sample from within each cluster…

Answer

A

allows us to draw inferences about the cluster without surveying everyone in it; it is akin to thinking of the cluster as the target population.

Question 9

Q

In the maternal literacy project, we did not want to select clusters that were too big because we were afraid that

Answer

A

If Pratham only held one class in that cluster, take-up of mothers in our sample might be too small

Question 10

Q

In the maternal literacy project, we did not want to select clusters that were too small because:

Answer

A

Our sample size calculations were based on 20 households per cluster

If clusters that were too small were selected (with fewer households in the cluster than were assumed while doing the power calculations), this would have effectively reduced the sample size and lowered the power of the study to detect an impact.

Question 11

Q

We conducted an Rural Resource Assessment primarily because we needed to identify

Answer

A

Appropriately-sized clusters for our unit of randomization

As Marc explains in the Mother Literacy example, the Rural Resource Assessment came after the study location (in villages in a district of Bihar) was determined. Habitations (clusters of approximately 20 households) were selected as the unit of randomization. What the Rural Resource Assessment served to do was to help identify which habitations/clusters would be of a sufficient size for the study.

Question 12

Q

We conducted a census for the maternal literacy project primarily because we needed to identify

Answer

A

Households to randomly select as our unit of analysis

Once the Rural Resource Assessment helped determine which clusters would be of a sufficient size for the study, a census helped identify which households had eligible mothers, which would be randomly selected as the unit of analysis and to collect data on.

Question 13

Q

Within the maternal literacy program, within one stratum of 10 villages, if the combined intervention (ML + CHAMP) was administered in 2 villages, which other arm(s) would also be administered in exactly 2 villages.

Answer

A

Control

As Marc illustrates in the previous video, within a stratum of 20 villages there were 5 villages each that received ML, CHAMP, ML + CHAMP, and neither ML nor CHAMP. However, in a stratum of 10 villages, 3 villages each received either ML or CHAMP, with 2 villages receiving both ML and CHAMP and 2 villages receiving neither, serving as the pure control.

Question 14

Q

Steps to calculate power

Answer

A

Set desired power (80%, 90%) and significance (95%)
Calculate residual variance (& ICC) using pilot data, national data sources, or data from other studies
Decide number of treatments
Set MDE size for T vs C and between treatments
Decide allocation ratio
Calculate sample size
Estimate resulting budget
Adjust parameters above (e.g. cut number of arms)
Repeat

Question 15

Q

Residual variance & power relationship

Answer

A

Variance reduces power because of the risk that we might pick “successful” people for treatment
• Some variation can be explained by observables
• Using controls in analysis soaks up variance, impact more precisely estimated, more power
• Calculate residual variance by regressing outcome on controls in existing data

Question 16

Q

Estimating rho/ICC

Answer

Study These Flashcards

A

Estimating rho
• Rho must be between 0 and 1
• Depends on context and outcome variable
• To estimate Rho need big samples
• Check sensitivity of your power to different possible rhos

Question 17

Q

Number of treatment arms & MDE/sample size

Answer

Study These Flashcards

A

• Different treatment arms help disentangle different
mechanisms behind an effect
• More arms requires lots more sample size
• Have to worry about two different MDEs
– for comparing T vs C
– Comparing T1 vs T2

Question 18

Q

Multiple treatment arm tips

Answer

Study These Flashcards

A

Good to have at least one intensive arm, where a zero would be interesting
In the analysis, will it be useful to pool all the treatment arms, creating an “any treatment” arm
Have to scale up sample size given from statistical packages

Question 19

Q

Unequal allocation ratio

Answer

Study These Flashcards

A

• If budget covers treatment and evaluation, more expensive to add one person to T than C
– Unequal allocation ratio gives a bigger total N, which may be worth it
– Try different allocation ratios within a given budget and explore tradeoff
• With multiple treatments put more sample behind most important question
– If going to pool treatments, have bigger control
– If really care about between treatments need bigger sample in treatment groups

Question 20

Q

Minimum detectable effect size

Answer

Study These Flashcards

A

The most important ingredient for calculating power
• MDE is not the effect size we expect or want
• MDE is the effect size below which we may not be able to distinguish the effect from zero, even if it exists
– i.e. below which effect might as well be zero

Question 21

Q

Questions to ask when determining MDE

Answer

Study These Flashcards

A

Below what effect size would the program not be cost effective?
• How big an effect would we need to make the result interesting?
• May want smaller MDE between arms than between T and C
• Common mistake is powering on T vs. C so don’t have power to distinguish between arms

Practical Tips Flashcards

(21 cards)