Brazy's Exam Hints? Flashcards

Question

What is a functional relationship?

Answer 1

How endogenous variables are produced; studying the cause (D) and effect (Y)

Answer 2

Parametric models contain more assumptions about the nature of the relationship between cause and effect. Non-Parametric models simply state that there is a relationship but we don't know why or how. DAGs conceptualise this.

Answer 3

Represent causal relationships between variables. Graphical representation of models informed by theory.

Answer 4

Backdoor paths are non causal relationships between our treatment and outcome. (eg, bias) Two ways to close open backdoors: 1.Introduce a confounder variable (X) and condition them by 'adding' a control 2. When the confounder is a collider, the backdoor path is closed. NEVER control for colliders. If all backdoor paths have been closed, then we have met the backdoor criterion and you can credibly argue for causal inference.

Answer 5

Exogeneity: there is no confounder between Z and Y Excludability (No other direct effect): Z only affects A through D Monotonicity: The effect of the instrument on treatment is 0 or positive for all units. Assumptions should be based on the theory/model.

Answer 6

TYPE I ERROR: false positive, we rejected the null hypothesis when it is actually true. (Eg, lump declared cancer (H1). You go through chemo, it wasn't cancer, you die) TYPE II ERROR: false negative, we accept the null hypothesis but it's actually false. (Eg, lump wasn't declared cancer (H0), you don't go through chemo, you die)

Answer 7

Causal research: identify cause and effect relationship between variables Descriptive research: identify relationships that are not necessarily causal.

Answer 8

Experimental: Conditions have been manipulated by us to understand the relationship between two variables Observational: We observe natural variations about the relationship between two variables

Answer 9

The inquiry is descriptive, but researcher assigns the treatment/manipulates conditions. So, how is it not causal? Well, we aren't looking to identify causal relationships, we are trying to find a latent variable that might explain the relationship between two variables (like discriminatory views, ideological support, etc) Examples: Audit, List, Conjoint experiments and Behavioural games

Answer 10

Experimental causal research design tries to understand causal relationships by studying what actually happened to what would have happened if the conditions were different (comparing counterfactual states). Random assignment is used. to manipulate conditions. Examples: Two Arm Randomised experiments Two Arm design with pre treatment covariates Block Randomised experiement Cluster randomised experiments Subgroup designs Factorial experiments Encouragement design, Placebo-controlled experiment Stepped wedge experiments Randomised saturation experiments

Answer 11

The inquiry is descriptive and there is no treatment effect because we want an answer to an observation. Examples: Surveys, official statistics, etc. What matters is our sampling method as it will dictate how accurate our observations are.

Answer 12

Our sampling method will dictate how accurate our observations are. Examples: Cluster sampling Intra -Cluster Correlation (ICC) Multilevel regression and Post stratification Partial pooling Index Creation for latent variables

Answer 13

Our inquiry is causal but we use data that already exists. Researchers do not assign treatment, the 'natural world' does, be it random or not. Example: Process Tracing Selection on Observables (multivariate regression) Difference in Difference (DiD) Instrumental Variables 'two stage process'

Answer 14

We want to understand the difference between potential outcomes of our counterfactuals, but we can only see one outcome (treated vs, untreated)

Answer 15

Potential outcomes are the outcomes of our counterfactuals. 'What if' scenario outcomes.

Answer 16

Figure 8.2: Nine kinds of random sampling. In the first row individuals are the sampling units, in the second row clusters are sampled, in the third clusters are sampled and then individuals within these clusters are sampled. In the first column units are sampled independently, in the second units are sampled to hit a target, in the third units are sampled to hit targets within strata.

Answer 17

Stepped-Wedge Random Assignment Procedure: Imagine you're doing an experiment where you want to give some people a treatment, like information on how to vote, over several time periods. Instead of giving everyone the treatment at once, you divide them into groups. In the first period, only some get the treatment. Then in each following period, more people get the treatment until everyone has it. The figure shows how the treatment is assigned over these three periods. For example, in the first period, one third of the units receive the treatment. In the second period, another third receive the treatment, and in the final period, the remaining units are treated.

Answer 18

Every unit has an equal chance of being selected. Design based inference. Example: Simple random - 'every citizen has a 10% chance of being picked' Stratified Random - Every unit within a group has the same chance of being sampled 'every citizen under 18 has 10% chance of being picked' Cluster - randomly pick subgroups within your population, then sample all individuals within the group 'every school in the country has a chance at being picked, 5 were picked and every student in each school was was sampled' Multistage - Take a cluster sample of your population, then take a cluster sample of the first sample, then cluster sample from the second sample. This is your representative sample.

Answer 19

Selecting individuals for sample. Used for targeting specific groups for study. Model based inference. Examples Convenience sampling: Selecting a sample based on what's available at the moment. Bad external validity, potential bias. Can only analyse Sample Average Treatment Effect (SATE) Purposive Sampling: hand picking the participants of your study based on specific criteria. Respondent driven 'Snowball' sampling: ask respondents to recommend others.

Answer 20

Who gets the treatment and how is that being chosen. Similar to sampling assignment. Only used in experimental designs as descriptive designs, the treatment has already been given.

Answer 21

There are only two conditions: treated and untreated/control. Examples Simple random assignment: Everyone has the same chance of getting treatment Complete random assignment: Everyone has the same chance of getting treatment but we need a certain number for our sample Block Random assignment: divide population into 'blocks' or groups based on specific characteristics (same gender) important to the study; then randomly assign within blocks. Good internal validity Cluster Random assignment: group people into 'clusters' (same neighbourhood), randomly sample clusters, assign treatment to all individuals in the cluster. Block and Cluster Assignment: Units are assigned as clusters and clusters are nested within blocks. Saturation Random Assignment: Different clusters are assigned a saturation level of treatment, you then randomly assign individuals within the clusters to receive the prestated saturation level of treatment.

Answer 22

There are more than two conditions: treatment 1, treatment 2, treatment 3, untreated/control You randomly assign a fixed number of participants to each condition groups.

Answer 23

A factorial design is when you study the effects of multiple treatments or factors simultaneously. It allows you to see how each treatment affects the outcome on its own and how they interact with each other. It means studying a group that doesn't watch a propaganda video but talks with trump, studying a group that watches the video but doesn't talk with trump, studying a group that does both, and studying a group that does neither.

Answer 24

Over time designs give treatment to units over multiple time periods. Instead of comparing treatment, you look at how outcomes change within the same units over time (before and after effects). Examples Stepped Wedge design: Assign treatment to different units in different time periods, gradually increasing until all units have received the treatment. Crossover design: Units are initially assigned one treatment, then in a later period, they 'cross over' to the opposite treatment

Answer 25

Researchers assign treatment on chosen units. Alternating assignment: Even numbered participants receive treatment, Odd numbered participants don't receive treatment. Regression discontinuity/cutoff: A cut off 'score' is assigned to the sample. Those above the cutoff receive treatment, those below do not. Researchers compare the outcomes of the two groups. Bayesian: The Bayesian approach in non randomised assignment uses prior information to make educated guesses about the treatment effectiveness for different individuals. It then 'optimally' assigns treatments based on these predictions to maximise the effects of the treatment for each person.

Answer 26

How far estimates are from the expected value for a given sample

Answer 27

1. Point Estimation: estimate a single value that represents a parameter of interest using descriptive statistics (mean) or regression coefficient Then calculate standard error. 2. Hypothesis testing: They are quanitative or qualitative tests. We state there is NO relationship and to get our answer we hope to reject the null hypothesis. We use p-values to give you the probability that your estimate could have occurred if the true population was the sample. 3. Bayesian Formulations: Prior beliefs are incorporated. Bayesian answers are the probabilities of answers being right (ie, the estimand) 4. Interval Estimation: Rather than a point or probabability, we estimate a range of answers where we think the estimand lies with some degree of confidence.

Answer 28

In Frequentist designs, we use a 95% confidence interval (95% of the time our estimand will lie in this range) In Bayesian designs, there is a 95% chance the estimand is in this range. Because Bayesian designs are more informed, the confidence interval may be more narrow. In Extreme Bounds, it is the best and worst case scenario in our sample - on each end.

Answer 29

The main difference is that bayesian formulation used informed priors while frequentists don't.

Answer 30

y = βo + β1x + u y = outcome variable (dependent) x = explanatory variable (independent) u = error term (difference between observed values and values predicted by regression, represents all other factors that influence the outcome variable but not included in the model βo = intercept parameter (value of the outcome variable when the explanatory variable is zero) β1 = slope parameter (the change in the outcome variable for a one unit change in the explanatory variable. Indicates slope strength and direction) Why do we use it? 1. We use the linear regression model minimise the amount of error in our fitted equation. 2. It can also help us control for confounding variables. 3. Measure significance of relationships. 4. Make predictions about future outcomes from what we know about the relationship and provide a basis for future models.

Answer 31

Plug in principle: Estimate a parameter and then 'plug in' observable data into those parameter Analyse as you randomise: Adapting the analysis method (answer strategy) to account for imperfections in our random sampling (data strategy)

Answer 32

Systematic error in estimation; average estimate consistently differs from true value. Because of other factors affecting outcome that have not been addressed

Answer 33

The probability of a statistically significant result. We want more 'power' to reject the null hypothesis which means we need a higher chance to get a statistically significant result.

Answer 34

Root Mean Squared Error is a diagnosand quantity that tells us how many outliers/variance there is in our estimate.

Answer 35

1. declare_model() Includes, number of units, unit characteristics, potential outcomes, treatment effect sizes, effect heterogeneity 2. declare_inquiry() Specifies the research question and the answer you're seeking (the estimand). This could be a causal or descriptive inquiry. Can declare estimands for groups (CATEs, difference in CATEs) 3. declare_sampling() Outlines the procedures used to select your sample units from the population. 4. declare_assignment() Defines how treatments or conditions are assigned to the units in your study. 5. declare_measurement() Details the procedures used to measure the variables in your research (eg, index creation) 6. declare_estimator() Specifies the method for estimating the answer (e.g., linear regression, point estimate, interval estimate) based on your data and inquiry. 7. declare_test() All these functions put together. Defines the statistical tests used to assess the significance of your findings.

Answer 36

Inter cluster correlation represents the difference of variation between clusters and within clusters. If ICC is high (close to 1) then it means that most variation in outcomes is due to differences between clusters If ICC is low (close to 0) then it means that most of the variation in outcomes is due to differences between individuals within clusters. Example: Imagine we're studying political preferences in neighborhoods (clusters) within a city. We randomly select several neighborhoods and survey people in each one about their political views. We find that in neighborhoods with high ICC, like tight-knit communities, most people have similar political preferences. So, if ICC = 0.8, it means 80% of the variation in political preferences is due to differences between neighborhoods, and only 20% is due to differences between individuals within the same neighborhood. On the other hand, in neighborhoods with low ICC, like diverse or transient areas, people have very different political preferences. So, if ICC = 0.2, it means only 20% of the variation in political preferences is due to differences between neighborhoods, and 80% is due to differences between individuals within the same neighborhood

Answer 37

Balance between full and non pooling. Both shared and individual-level estimates within a hierarchical model. It borrows strength from both group-level and individual-level data. Is used in MRP. Statistical technique used to make better estimates by borrowing information from similar groups or clusters when we have limited data from each individual group. If you're trying to measure public opinion in different states, but not enough from each state. You combine all the information from all states and using a 2 step process (MRP) you get a better estimate for each individual state. You use a plug in model into a regression analysis and then adjust these estimates based on what we know about the population of each state.

Answer 38

MRP is used to estimate population level quantities for smaller subgroups For example, we want to analyse public opinion oh high school graduates and non graduates across the US. First, multilevel regression is used to estimate average opinions of these two groups across each state. Second, poststratification is used to 'reweight' these estimates to match the proportions of each groups within each states. So we might have a lot more HS graduates in California then in Wyoming so we can borrow californian data to help wyoming.

Answer 39

In observational descriptive, no treatments are allocated by the researcher. In observational causal, the natural world is responsible for the observed treatment conditions. (Causal inferences imagines the outcomes had they not been treated)

Answer 40

Since the natural world provides the treated conditions and researchers simply observe the treated conditions and try to make causal inferences on counterfactuals, it is difficult to infer causality because it relies on specific circumstances to occur in the natural world. Researchers don't have the ability to control the treatment assignment at all. Process tracing necessitates finding the right clues to understand causality. Selection-on-observables requires measuring variables that eliminate alternative explanations for the observed outcomes. Difference-in-difference designs need stable patterns over time to make valid comparisons. Instrumental variables designs rely on nature randomly assigning a variable that can be measured. Regression discontinuity designs hinge on a clear cutoff point that determines treatment.

Answer 41

Process tracing is a qualitative research method where we try to observe data (usually in a case study) to see whether there is a causal relationship or not. (eg, trade preference = increase in exports?) A 'hoop clue' is a piece of evidence that strongly suggests a causal link. (eg, customs declaration forms that used trade preference) A 'smoking gun' is the exact piece of evidence that directly confirms the causal relationship. (eg, exporters using trade preference leads to higher exports)

Answer 42

DiD evaluates the causal effect of a treatment. It compares the before and after ATT (average treatment effect of the treated) to a control group. The DiD measures the difference between pre and post treatment outcomes and then subtracts it from the control group. An important assumption of the DiD design is the 'parallel trends assumption'. This assumption states that the changes in outcomes over time would have been the same for the treated and untreated groups if the treatment had not been implemented. DiD is often used in settings with two time periods (before and after) and two groups (treated and untreated). For more complex studies with multiple periods/groups, you can use panel data analysis but parallel trends assumption becomes weaker.

Answer 43

The IV approach aims to study the effect of a treatment on an outcome. But confounding variables can influence the outcome too. The IV approach controls for other factors that might affect both treatment and outcomes. For example, you want to see how studying affects good grades. But you cannot tell 50% of the class to study harder (assign treatment). So you might look (observe) at naturally talented (instrument) students who tend to study more (treatment) than others. Ideally, being talented shouldn't affect your grades (outcome) other than through studying more. It's a two step approach. 1. Identify the effects of instrument (Z) on treatment (D) See how talent affects studying: talented students generally study more than non talented students - talent 'nudges' student to study more 2. Isolate the effect of treatment (D) by subtracting the treatment effects that are caused by Z. Assuming talent only affects grades through studying, we can use the difference in studying habits between talented and non talented students to estimate the true effect of studying on grades. The effect of the treatment on outcome is only measurable for compliers = Local Average Treatment Effect (LATE) LATE: In an IV approach there are four groups: Compliers (Instrument nudges treatment in the right direction) Never takers (Population that doesn't take treatment despite instrument) Always takers (Population that always takes treatment despite instrument) Defiers (Instrument nudges treatment in wrong direction).

Answer 44

RDD designs assign treatment based on a cutoff point of a certain characteristic (running variable). If you are above the cut off point you get treatment (D) and if not, you don't receive it. The study focuses on those below the threshold (control) that didn't get the treatment, and those slightly above the threshold (received treatment, but not that much different from those that didn't). Therefore it only studied the LATE. It assumes that the potential outcomes are smooth throughout the threshold (ie, if there was no treatment, there would be no sudden effect on Y if you reached the threshold)

Answer 45

Two arm randomised experiment designs are when subjects are assigned to the treatment or the control. Researcher then measures the difference. Different strategies of random assignment (simple, block, cluster, etc) We assume the Stable Unit Treatment Value Assumption (SUTVA): outcome doesn't change just because a different group got the treatment. It can only make inferences to the sample, therefore it can only measure the Sample Average Treatment Effect (SATE). Internal validity is good, external validity is dependent on how representative the sample is on broader pop.

Answer 46

Subgroup designs are used to study how the effect of treatment might differ between groups of people (CATEs) They are interested in measuring the Conditional Average Treatment Effect (CATE) between subgroups. CATE is basically the average effect of the treatment for a specific group of people. Non random sampling + treatment assignment: needs to be representative, subgroups are naturally occurring Subgroup designs provide a descriptive difference but can't prove causal link between being in a subgroup and effect of treatment Similar to factorial designs but factorial design are randomly assigned treatment.

Answer 47

Use cluster random sampling to understand causal link between treatment and effect. It measures the CATE (conditional average treatment effect) as it only measures treatment effect at the people belonging to the cluster. Remember high ICC (higher variance within the cluster), low ICC (higher variance from cluster to cluster)

Answer 48

MODEL: units, assumptions about behaviours INQUIRY: descriptive, summary of unit characteristics in the population, and not as a function of potential outcomes. Just wants to see proportion of sample that responded to treatment (not trying to infer a causal link, not ATE) DATA STRATEGY: Random assignment ANSWER STRATEGY: Difference in means to estimate the inquiry No counterfactuals. We define the inquiry as a summary of unit characteristics in the population, and not as a function of potential outcomes.

Answer 49

MODEL: Units, assumptions about behaviours INQUIRY: descriptive, prevalance rate (not ATE), summary of unit characteristics in the population, and not as a function of potential outcomes. DATA STRATEGY: Random assignment ANSWER STRATEGY: Difference in means to estimate the inquiry No counterfactuals. We define the inquiry as a summary of unit characteristics in the population, and not as a function of potential outcomes.

Answer 50

MODEL: units (survey respondents), behavioural assumptions INQUIRY: Descriptive. The specific inquiry (AMCE) depends on the researcher's choices about the attribute levels and randomisation scheme used in the experiment design. DATA STRATEGY: Design choices, randomisation of attribute levels ANSWER STRATEGY: descriptive statistics of AMCE No counterfactuals. We define the inquiry as a summary of unit characteristics in the population, and not as a function of potential outcomes.

Answer 51

Conjoint experiments aim to describe preferences in a hypothetical scenario (no counterfactual). Researchers create profiles with different feature, show comparison and then ask people to rate/choose them. It studies the Average Marginal Component Effect (ACME, the average difference in one unit attribute averaging over all the levels of the other unit attributes. Satisficing means that respondents choose the first option that meets their minimum requirements which can lead to bias results. Masking happens when an important attribute is left out of the experiment, but satisficing will make it worse.

Answer 52

Covariates are measurable characteristics of the participants that are related to the outcome but not caused by the treatment (eg, Age (covariate) affecting voter turnout (outcome), but we want to measure affect of phone campaigns (treatment) on voter turnout) It can help account for pre-existing differences between treatment and control that might influence the outcome. Covariate adjustment in two arm randomised experiments is used to increase the precision of the estimated treatment effect, not to reduce bias. (eg, covariate adjustment can help account for natural variation in voter turnout due to age, so that you can het a more precise estimate of the true effect of the phone banking campaign) Two types of covariate adjustment methods: 1. Stratified random assignment 2. Statistical adjustment (regression analysis) The Lin estimator is a specific way to adjust for covariates that is guaranteed to be at least as precise as the unadjusted estimate.

Answer 53

Complier (takes treatment when told to take treatment) Defier (doesn't take treatment when told to take treatment) Always taker (takes treatment regardless of assigned or not) Never taker (never takes treatment regardless of assigned or not)

Answer 54

ATE = Average Treatment Effect Average difference in outcomes between treatment and control. Considers all units in the study Used in - experimental causal (block random, stepped wedge, factorial) - experimental descriptive (audit, conjoint) - observational causal (instrumental variable) CATE = Conditional Average Treatment Effect Refers to the average treatment effect for a specific subgroup which is defined by some condition Used in - Experimental causal (subgroup, factorial, randomised saturation) ----- LATE = Local Average Treatment Effect Refers to the average treatment effect of compliers only Used in - Observational causal (Instrumental variables) - Experimental causal (Encouragement design) ------ ATT = Average Treatment Effect of the Treated Refers to the average treatment effect of units who received the treatment Used in - Observational causal (DiD) ------- SATE = Sample Average treatment effect Refers to the average difference in outcomes limited to the sample only (internal validity) ATU = Average Treatment Effect on the Untreated Refers to the average treatment effect on units who did not recieve the treatment ITT = refers to the average difference in outcomes between units assigned to treatment and assigned to control, regardless of whether they ACTUALLY received treatment or not. Reflects only the effect of being assigned to a treatment group, not the treatment itself. Used in - Observational causal (instrumental variables) - Experimental causal (encouragement design)

Answer 55

Treatment is assigned gradually in multiple stages. Units are divided into groups, treatment is randomly assigned to one group at a time. In each period, a portion recieves the treatment for the first time. Everyone eventually gets the treatment, researchers measure the outcomes for each group. Unlike two arm designs, stepped wedge designs gradually rolls out treatment Unlike DiD, stepped wedge uses randomisation to mitigate bias.