Oral Exam Questions Flashcards

Question

Polynomials. How do we interpret coefficients? [e.g., wage = α + β1 age + β2 age2 + u]

Answer 1

Quadratic term indicates how the effect of age on wage changes as age increases. Positive B2 means impact of age on wage increases as age rises, and opposite.

Answer 2

Log(wage): Beta represents the percentage change in wage for a one-unit change in age. Log(quantity): Beta indicates the price elasticity of quantity demanded. If Beta = -1.2, a 1% increase in price results in a 1.2% decrease in quantity demanded.

Answer 3

𝛽 1 β 1 (age): Represents the change in wage for a one-year increase in age for non-technical jobs (when Ind = 0). 𝛽 2 β 2 (Ind): Indicates the wage difference associated with working in a technical industry at age 0 (the intercept for technical jobs). 𝛽 3 β 3 (age * Ind): Measures how the effect of age on wage changes for those in technical industries.

Answer 4

A binary variable is a variable that can take on the value of 0 or 1. It can be used in regression models as so called “dummy variables”.

Answer 5

LPM model is a regression model used for binary outcomes (0 or 1). It predicts the probability of the outcome happening. The coefficient of interest shows how a one-unit increase in independent variable (wage) changes the probability of the outcome (loan approval)

Answer 6

Benefits of Logit and Probit are: 1. Always predict probs within valid range (0-1) unlike LPM. 2. Better capture non-linear relationship between variables and probabilities. 3. No heteroskedasticity

Answer 7

AME: shows the average change in probability across all observations for a one-unit increase in the independent variable. MEM: Estimates how much the probability of the dependent variable being 1 changes with a one-unit change in the independent variable for an "average" obs.

Answer 8

Maximum Likelihood Estimator: Finds the parameter values that make the observed data most likely, by maximizing likelihood function. Idea is to choose the parameters that best explain the data.

Answer 9

Measures goodness of fit for Logit & Probit models. Compares likelihood of fitted model to a null model. Values generally lower than normal R2. Simple R2 assumes linearity and does not work well for binary outcomes.

Answer 10

Using Likelihood Ratio (LR) test. Compares two models: one with full set of variables and one without variables of interest. Intuition: See if more complex model improves likelihood of the data. If LR stat is large, full model fits better.

Answer 11

Data observations at a single point in time across different entities. Time-Series: Data collected over time for a single entity. Panel Data: Combination of cross-sectional and time-series data. -> Multiple entities observed over time.

Answer 12

Measures change over time by subtracting the previous observation from the current observation (DeltaYt = Yt – Yt-1). Used in panel data to control for entity-fixed effects.

Answer 13

Pooled OLS: Treat panel data like cross-sectional data. Used if individual effects uncorrelated with variables. FD Estimator: Focuses on changes over time, eliminating unobserved, constant factors. FD is Better: If there are constant unobserved factors that could bias results, because FD removes them.

Answer 14

Subtracts the mean of each entity’s observations from their values to focus on within-entity variations. Useful for: Controlling for unobserved factors (eliminate biases). Improves estimates by providing more accurate coefficients in panel data.

Answer 15

𝛼 𝑖 α i (Entity Fixed Effects): Represents unobserved factors unique to each entity (like company culture or management style) that do not change over time. 𝛽 β (Coefficient of Interest): Indicates the change in the market-to-book ratio for a one-unit increase in R&D spending, controlling for all constant differences between entities.

Answer 16

Entity-Demeaning: Subtract the mean of each entity's observations from individual values, focusing on within-entity variations. Including Dummy Variables: Add dummy variables for each entity (except one to avoid multicollinearity) to control for fixed effects directly in the regression.

Answer 17

Specifically focuses on the relationship between the error term and independent variables. Requires the mean of the error term's to be zero for all independent variables in regression.

Answer 18

Used when observations in data is related to each other. We need them to adjust for correlation within the same cluster. Estimate correct SE´s and control for heteroscedasticity.

Answer 19

Price-based: Designed to test the efficient market hypothesis. Testing information efficiency, i.e., the speed and accuracy with which prices reflect new information. Value event: Examine impact of events on the market value of companies, given EMH Need them to understand reactions and economic importance.

Answer 20

1. Identify the event (nature, announcement date) 2. Identify event windows 3. Compute abnormal returns 4. Compute cumulative abnormal returns 5. Hypothesis testing – measure of variance.

Answer 21

1 - Event window: Include only the event. 2 – Hold-out: exclude co-founding events, potential drift. 3 – Estimation window: Long enough to get precise estimates, but short enough to not have structural breaks.

Answer 22

Estimation window helps to calculate expected returns under normal conditions, without effects of the event. It also isolates the event’s effect by comparing actual returns during event to expected return without event

Answer 23

We compute abnormal return and cumulative abnormal return, then test for significance to determine if event led to significant market reaction.

Answer 24

IV stands for Instrumental Variables, and is used when explanatory variable is correlated with error term. 2SLS stands for 2 Stage Least Squares, and is an extension of OLS method.

Answer 25

To break x into two parts: one that is correlated with u, and one that is not. By isolating the part not correlated with u, it is possible to estimate true Beta1. It can solve omitted variable and simultaneity bias.

Answer 26

Relevance: at least one instrument must enter the population counterpart of the first stage regression. If it is missing the instruments explain very little of the variation in x, beyond what is explained by w

Answer 27

We run an F-test on first stage to see if the instruments are weak. If the F-stat is lower than 10 we can conclude that the set of instruments are weak.

Answer 28

Exogenous: all the instruments must be uncorrelated with the error term. If missing, first stage of 2SLS cannot isolate a component of x that is uncorrelated with error term. Cant help with omitted variable bias.

Answer 29

You do it to get correct standard errors and avoid errors that may arise from two separate stages. R does not understand that fitted values x^hat is estimated.

Answer 30

You will need to have at least 3 instruments so that m = k, and we can estimate Beta_1. These instruments should have relevance: enters the population counterpart) and be exogenous: all are uncorrelated with error term.

Answer 31

It is a test for overidentifying restrictions in IV regression (m>k). Checks if instruments are exogenous. If test-statistic is significant, it suggest that instruments might be correlated with the error term.

Answer 32

Testing Hypotheses: Experiments allow researchers to test their hypotheses in controlled environments, ensuring that the results are due to the variables being tested and not other factors. Establishing Cause and Effect: can establish causal relationships, to exactly get effect of interest.

Answer 33

**Experiments** involve random assignment to treatment and control groups, ensuring that any differences observed are due to the treatment itself. **Quasi-experiments** lack random assignment, relying instead on natural variations or pre-existing groups. This makes it harder to rule out other factors influencing the results.

Answer 34

Internal validity refers to the extent to which a study can establish a causal relationship between the treatment and the observed outcome, free from confounding variables. It can break down when: There are confounding variables that aren’t controlled. Selection bias occurs, meaning groups differ in ways other than the treatment.

Answer 35

External validity is the extent to which the results of a study can be generalized to other settings, populations, and times. It can break down when: The sample isn’t representative of the larger population.

Answer 36

parallel trends assumption. This means that in the absence of treatment, the difference between the treatment and control groups would remain constant over time.

Answer 37

To test the parallel trends assumption, check if the outcome trends for both groups are similar before the intervention. If they are, the assumption likely holds. You can also run statistical tests on the pre-treatment period to confirm this.

Answer 38

Matching is a method used to pair units in the treatment group with similar units in the control group based on certain characteristics. We perform it to reduce bias and make the treatment and control groups more comparable, ensuring a more accurate estimate of the treatment effect.

Answer 39

nearest neighbor matching pairs each treated unit with the closest control unit based on certain characteristics. propensity score matching pairs units based on the probability of receiving the treatment, given their characteristics.