WWWW Flashcards

Question

When to use Logit model

Answer 1

Use logit models whenever your dependent variable is binary (also called dummy) which takes values 0 or 1. Logit regression is a nonlinear regression model that forces the output (predicted values) to be either 0 or 1.

Answer 2

- allow for heterogenity | - and control for omitted biases

Answer 3

No, assuming the ommited variable does not change over time, the change in Y must be caused by the observed factors

Answer 4

A regression performed on panel data to test the effect of being in state i. The model can be either entity demeaned, time demeaned or both. All regressions will have the same slope, but different intersections.

Answer 5

- Standard errors are found under the assumption that there is no autocorrelation or heterosced - so the given standard error cannot be used - that is why we use Clustered Standard Errors. They allow for heterosked and multicorr WITHIN an entity, but not across. - multicorr and heterosked does not affect coefficient value, only standard error - clustered: allows for heterosked and autocorr

Answer 6

1. Error term must have mean of zero FOR ALL OBSERVATIONS OF THE VARIABLE X. past, present and future 2. I.I.D ACCROS Entitites: - observations within an entity can autocorr, but not corr across entities (3. large outliers unlikely 4. no perfect multicorr)

Answer 7

RELVEVANT and EXOGENOUS: The two conditions for a valid instrument: 1. Instrument relevance: if an instrument is relevant, then variation in the instrument is relevant to the variation in X. 2. Instrument Exogeneity: Z is correlated with Y solely through its correlation with X. Relevant: It is relevant in a way that the IV actually affects X Exogenous: Our IV only affect Y through X

Answer 8

- Z is correlated with X, but not with error term. It has to satidy conditions for Relevance and exogeneity Use Two Stage Least Square: 1. Regress X on the Instrument Variable (X as dependent) 2. Use the calculated Y variable of this regression in the original Lets call the instrument for Z. if it satisfy the two conditions of relevance and exogeneity, we can estimate B1 by using an IV estimator called two least squares (TSLS). TSLS is calculated in two stages. First stage splits X into two parts; one part that is problematic and might be correlated to the error term, and one other part that is problem-free. The second stage uses the problem free part to estimate B1. In the first stage you regress x on its instrument that gives X(hatt). You then put X(hatt) in the regression. From our example, the intuition is now that we only regress future salary on those who got draftet, thus eliminates the bias that veterans usually earn less.

Answer 9

TSLS | Two least squares regression

Answer 10

1. Relevance F-test H0: coeff = 0, if rejected, they are relevant 2. Exogeneity J-test H0: All IV are exogeneous

Answer 11

It should be relevant (there should be a (strong) correlation to the explanatory variable) and exogenous (there should be no correlation to the error term)

Answer 12

1. hard to find good IV that capures all of variance of exog + not corr with error 2. IV often not well corr with endo

Answer 13

- Relevance | - Exogeneity

Answer 14

If so, the TSLS estimator will be - biased, and - statistical inferences (standard errors, hypothesis tests, confidence intervals) can be misleading

Answer 15

F-test H0: IV are engodenous - so p-value over 0,05 = all IV are exog

Answer 16

The variation in the instrument is relevant to the variation in X

Answer 17

Assumption 1: The Error Term has Conditional Mean of Zero - Error term must not show any systematic pattern - Cannot have omitted variable biases Assumption 2: For all n are Independently and Identically Distributed - Independently: The variables are independent from each other. The variables does not carry information of each other. If you roll two dices, the value you get on the first dice does not affect the value you get on the second. - Identically Distributed: Each variable in the observation is has the same probability distribution. If you have a deck of cards, the probability of drawing a diamond king is 1 in 52. All of the participants has 1 in 52 chance of drawing a king. Main: If you flip a coin 100 times, the probability of getting heads/tails will be 50/50 for every throw (coin has no memory), so it is “Indepentent”. The probability for every throw stays the same, so it is “Identical” Assumption 3: Large Outliers Are Unlikely X and Y have finite kurtosis, as several outliers can give wrong estimations

Answer 18

sed to determine if a data set is well-modeled by a normal distribution and to compute how likely it is for a random variable underlying the data set to be normally distributed.

Answer 19

1. Parameters are linear 2. IID 3. No perfect multicorr 4. Error term zero mean 5. Homosked: 6. No autocorr

Answer 20

- quantify models. - provides causal effect or relationship between .... - eco theory rarely gives precise values, so better to turn to econometrics and reg analysis - the causal effect between variables can be quantified and evaluated - key toolkit for scientists

Answer 21

- error term doesn’t have a constant variance.

Answer 22

- one or more of the independent variables are jointly determined with the dependent variable. - X causes Y but Y causes also X - two variables on either side influence each other - dont give the real causal effect - Violate mean of zero assumption ``` Supply/demand a good example. Quantity and price Investments and Productivity Sales and advertisement This leads to violation of LS.1, hence our coefficient is biased. ```

Answer 23

A type of bias that arises by choosing non-random data for statistical analysis. For example when people volunteer for a study. Those who volunteer might share the same characteristics. For example, you want to study the context between veganism and undergraduate students. You send out a survey to the students in class of art and culture. Because this is not a random draw sample, it is not representative for the target population. These students might be more liberal etc.

Answer 24

There are often error in the data Feks: - Reporting error - Coding error - Estimation error

Answer 25

- no trends or seasonality - its statistical properties does not change over time - constant mean and varianc

Answer 26

Coefficient doesn't change - But it leads to biased Standard Errors (SER) - Biased SER makes hypothesis testing, t-test, p-vaules etc impossible - Not BLUE or Gauss-Markov

Answer 27

- nonstationary have undefined mean and infinite variance. Makes very biased answers

Answer 28

1) FAILURE TO RANDOMIZE - treatment not randomly assigned, based on part of characteristics or preferences - ethnic difference last name - if you use vouchers * Can test for if control variables coefficients W are 0 or not. If Random, X will be uncorrelated with W. 2) Failure to follow treatment protocol / partial complience 3) Attrition 4) Experimental effects / Hawthorne 5) Small Sample Sizes - small sample does not necessary bias estimator of causal effect - raises threat to validity of conf intervalls and hypothesis test

Answer 29

1. Nonrepresentative Sample - The population studied and the population of interest might differ - sample only includes people with one type of characteristics 2. Nonrepresentative program or policy - policy or program must be similar to program studied to give generalizing results - exp might be small-scale, might differ from real world 3. General Equilibrium effects - Turning a small, temporary exp intro a widespread, permanent program - sometimes only works with small groups - Ac training Zimbabwe, 10 villages 40% increase wages. Nationwide: Different effect, become skilled, decrease in wage gains

Answer 30

Two types of Quasi-Experiments: 1. Whether an individual (entity) receives treatment is “as if” randomly assigned, possible conditional on certain characteristics “Treatment (d) “as if” randomly assigned • For example a new policy measure that is implemented in one but not in another are, whereby the implementation is “as if” randomly assigned. - Does immigration reduce wages? Eco theory suggest that if the supply of labor increases, wages will fall. However immigrants tend to go to cities with high labor demand, so the OLS estimator of the effect on wages of immigration will be biased. Was done a Quasi on Cubans that moved to Miami. Estimated the causal effect on wages of an increase in immigration by comparing the change in wages of low-skilled workers in Miami to the change in wages of similar workers in comparable U.S cities. He found no effect. 2. Whether an individual receives treatment partially determined by another variable that is “as if” randomly assigned “A variable (z) that influences treatment (d) is “as if” randomly assigned: use IV regressions • The variable that is “as if” randomly assigned can then be used as an instrument variable in a 2SLS regression analysis.

Answer 31

We cannot test for this, but if the treatment and control firms seem similar before the treatment, this is more likely to be the case.

Answer 32

Internal validity refers to the validity of the findings within the research study. It is primarily concerned with controlling the extraneous variables and outside influences that may impact the outcome. External validity refers to the extent to which the results of study can be generalized or applied to other members of the larger population being studied.

Answer 33

Testing the effect of treated versus control group regressing delta Y against the mean of the treated and control group before and after treatment

Answer 34

maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data.

WWWW Flashcards

(58 cards)