WWWW Flashcards

1
Q

F-Distribution + F-Stat

A

Distribution:
- Chi Squared

  • Want to test if one event is significntly different from another

Group A, B and C put on 10 mg, 5 mg and placebo.
- Mean Square Between (MSB) = Mean square between these groups

  • Mean Square Error (MSE) = Mean Variance of all these groups added together
    F-stat = MSB/MSE
  • A large F-stat might indicate that the population means are not equal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

goodness of fit

A

Explained Sum of Squares (ESS)
- Difference between predicted value and the mean of the dependent variable
Sum of Squared
Residuals (SSR)
- Difference between predicted and observed value at squared level.

Total Sum of Squares

  • Difference between observed dependent variable and its mean
  • TSS = SSR + ESS
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

unbiased estimators conditions

A

linear in parameters,
random sampling,
sample variation in explanatory variable,
zero conditional mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

significance level

A

The significance level is the probability of rejecting the null hypothesis when it is in fact true. a 5% significance level says that we have a prob of 5% of rejecting null when its true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

significance probability

A

The probability of drawing a statistic at least as adverse to the null hypothesis as the one you computed in your sample, assuming that the null hypothesis is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

• What is meant by the size of a test?

A

In hypothesis testing, the size of a test is the probability of committing a Type I error, that is, of incorrectly rejecting the null hypothesis when it is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

p-value

A
  • P-value is the probability of rejecting the null hypothesis, when it should not be rejected (Type 1 error).
  • We usually use 5% significance level, meaning that if a medicine is tested and it doesn’t have a real effect, the result will tell us it has an effect 1/20 times.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

critical value

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are degrees of freedom?

A
  • independent values that are free to vary in a data set

Intuitively:
A data set of four numbers. Three of the values are 4, 4, 4. and average of data is 4.
This must mean that the last number also has to be 4. It must be 4, it is not allowed to vary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What happen to a confidence interval when then sample is bigger and bigger?

A

It becomes smaller and smaller., sample size increase, more precise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do we interpret Binary dependent variable regression

A

interpreted as a conditional probability function

depends on which model

  • LPM easy
  • Probit, cpf
  • Logit, lcpf
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

interpret coeff probit logit

A

pdf

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Standard errors in LPM are always

A

Heteroscedastic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why is it called LPM

A

Because the probability that Y = 1 is a linear function of the regressors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is cumulative distribution function

A

It is the probability that the variable takes a value less than or equal to X

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does Probit and Logit regression allow for that LPM doesnt

A

Probit and Logit models allows for non-linear relationship between regressors and dependent variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the z value

A

Rule of thumb: Z should be over 2 and p under 0,05 for H0 to be rejected

estimated intercept divided by the standard error

  • the number of standard deviations the estimated intercept is away from 0 on a standard normal curve (Wald test)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Maximum Likelihood Intuitive of the mean

A
  1. Imagine that you have a line of observed values.
  2. Then imagine that you test every point on that line for where you get the highest likelihood of observing the data
  3. when all areas are checked you pick the one that maximizes the likelihood
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

logistic distribution is what

A

continious distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

likelihood in statistics means

A

trying to find the optimal value for the mean or std for a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How do wee find the best regression line

A

maximum likelihood

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

if p-value is < 0,05

probit logit

A

there is a statistically significant association between the response variable and dependent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Unbiasedness

A

Same Distribution as population and thus close to real result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are the assumptions on the parameters

Logit Probit

A

Consistency, Unbiased

25
Q

When to use Logit model

A

Use logit models whenever your dependent variable is binary (also called dummy) which takes values 0 or 1. Logit regression is a nonlinear regression model that forces the output (predicted values) to be either 0 or 1.

26
Q

what is the main advantage of panel data

A
  • allow for heterogenity

- and control for omitted biases

27
Q

In panel data, is omitted variables a problem?

A

No, assuming the ommited variable does not change over time, the change in Y must be caused by the observed factors

28
Q

what is fixed effects

A

A regression performed on panel data to test the effect of being in state i. The model can be either entity demeaned, time demeaned or both. All regressions will have the same slope, but different intersections.

29
Q

standard errors panel

A
  • Standard errors are found under the assumption that there is no autocorrelation or heterosced
  • so the given standard error cannot be used
  • that is why we use Clustered Standard Errors. They allow for heterosked and multicorr WITHIN an entity, but not across.
  • multicorr and heterosked does not affect coefficient value, only standard error
  • clustered: allows for heterosked and autocorr
30
Q

panel assumptions

A
  1. Error term must have mean of zero FOR ALL OBSERVATIONS OF THE VARIABLE X. past, present and future
  2. I.I.D ACCROS Entitites:
    - observations within an entity can autocorr, but not corr across entities

(3. large outliers unlikely
4. no perfect multicorr)

31
Q

What are the conditions for a valid instrument?

A

RELVEVANT and EXOGENOUS: The two conditions for a valid instrument:

  1. Instrument relevance: if an instrument is relevant, then variation in the instrument is relevant to the variation in X.
  2. Instrument Exogeneity: Z is correlated with Y solely through its correlation with X.

Relevant: It is relevant in a way that the IV actually affects X

Exogenous: Our IV only affect Y through X

32
Q

How do we use IV?

A
  • Z is correlated with X, but not with error term. It has to satidy conditions for Relevance and exogeneity

Use Two Stage Least Square:

  1. Regress X on the Instrument Variable (X as dependent)
  2. Use the calculated Y variable of this regression in the original

Lets call the instrument for Z. if it satisfy the two conditions of relevance and exogeneity, we can estimate B1 by using an IV estimator called two least squares (TSLS). TSLS is calculated in two stages. First stage splits X into two parts; one part that is problematic and might be correlated to the error term, and one other part that is problem-free. The second stage uses the problem free part to estimate B1.

In the first stage you regress x on its instrument that gives X(hatt).

You then put X(hatt) in the regression.

From our example, the intuition is now that we only regress future salary on those who got draftet, thus eliminates the bias that veterans usually earn less.

33
Q

which model to calculate IV

A

TSLS

Two least squares regression

34
Q

How do we test the instrument variables?

A
  1. Relevance

F-test
H0: coeff = 0,
if rejected, they are relevant

  1. Exogeneity

J-test
H0: All IV are exogeneous

35
Q

What are the two assumptions of including an instrumental varable?

A

It should be relevant (there should be a (strong) correlation to the explanatory variable) and exogenous (there should be no correlation to the error term)

36
Q

What are some of the drawbacks of using IVs?

A
  1. hard to find good IV that capures all of variance of exog + not corr with error
  2. IV often not well corr with endo
37
Q

IV variable need to satisfy which assumptions?

A
  • Relevance

- Exogeneity

38
Q

Consequences of weak instruments

A

If so, the TSLS estimator will be

  • biased, and
  • statistical inferences (standard errors, hypothesis tests, confidence intervals) can be misleading
39
Q

Test Weak IV’s with single X

A

F-test

H0: IV are engodenous

  • so p-value over 0,05 = all IV are exog
40
Q

Relevancy means that

A

The variation in the instrument is relevant to the variation in X

41
Q

What is the Least Square Assumptions?

A

Assumption 1: The Error Term has Conditional Mean of Zero

  • Error term must not show any systematic pattern
  • Cannot have omitted variable biases

Assumption 2: For all n are Independently and Identically Distributed

  • Independently: The variables are independent from each other. The variables does not carry information of each other. If you roll two dices, the value you get on the first dice does not affect the value you get on the second.
  • Identically Distributed: Each variable in the observation is has the same probability distribution. If you have a deck of cards, the probability of drawing a diamond king is 1 in 52. All of the participants has 1 in 52 chance of drawing a king.

Main: If you flip a coin 100 times, the probability of getting heads/tails will be 50/50 for every throw (coin has no memory), so it is “Indepentent”. The probability for every throw stays the same, so it is “Identical”

Assumption 3: Large Outliers Are Unlikely
X and Y have finite kurtosis, as several outliers can give wrong estimations

42
Q

Normality

A

sed to determine if a data set is well-modeled by a normal distribution and to compute how likely it is for a random variable underlying the data set to be normally distributed.

43
Q

What are the Gauss-Markov assumptions?

A
  1. Parameters are linear
  2. IID
  3. No perfect multicorr
  4. Error term zero mean
  5. Homosked:
  6. No autocorr
44
Q

Why is regression analysis useful?

A
  • quantify models.
  • provides causal effect or relationship between ….
  • eco theory rarely gives precise values, so better to turn to econometrics and reg analysis
  • the causal effect between variables can be quantified and evaluated
  • key toolkit for scientists
45
Q

what is Heteroscedasticy

A
  • error term doesn’t have a constant variance.
46
Q

Simultanely bias

A
  • one or more of the independent variables are jointly determined with the dependent variable.
  • X causes Y but Y causes also X
  • two variables on either side influence each other
  • dont give the real causal effect
  • Violate mean of zero assumption
Supply/demand a good example.
Quantity and price
Investments and Productivity
Sales and advertisement
This leads to violation of LS.1, hence our coefficient is biased.
47
Q

Sample Selection bias

A

A type of bias that arises by choosing non-random data for statistical analysis. For example when people volunteer for a study. Those who volunteer might share the same characteristics.

For example, you want to study the context between veganism and undergraduate students. You send out a survey to the students in class of art and culture. Because this is not a random draw sample, it is not representative for the target population. These students might be more liberal etc.

48
Q

Measurement error in independent variable

A

There are often error in the data

Feks:

  • Reporting error
  • Coding error
  • Estimation error
49
Q

what is stationarity

A
  • no trends or seasonality
  • its statistical properties does not change over time
  • constant mean and varianc
50
Q

What does heteroscedasticy lead to?

A

Coefficient doesn’t change

  • But it leads to biased Standard Errors (SER)
  • Biased SER makes hypothesis testing, t-test, p-vaules etc impossible
  • Not BLUE or Gauss-Markov
51
Q

Why do we want time-series to be stationary?

A
  • nonstationary have undefined mean and infinite variance. Makes very biased answers
52
Q

Threats to Internal Validity of Experiments

A

1) FAILURE TO RANDOMIZE
- treatment not randomly assigned, based on part of characteristics or preferences
- ethnic difference last name
- if you use vouchers
* Can test for if control variables coefficients W are 0 or not. If Random, X will be uncorrelated with W.

2) Failure to follow treatment protocol / partial complience
3) Attrition
4) Experimental effects / Hawthorne

5) Small Sample Sizes
- small sample does not necessary bias estimator of causal effect
- raises threat to validity of conf intervalls and hypothesis test

53
Q

What are the threats to External Validity for Idealized Experiments?

A
  1. Nonrepresentative Sample
    - The population studied and the population of interest might differ
    - sample only includes people with one type of characteristics
  2. Nonrepresentative program or policy
    - policy or program must be similar to program studied to give generalizing results
    - exp might be small-scale, might differ from real world
  3. General Equilibrium effects
    - Turning a small, temporary exp intro a widespread, permanent program
    - sometimes only works with small groups
    - Ac training Zimbabwe, 10 villages 40% increase wages. Nationwide: Different effect, become skilled, decrease in wage gains
54
Q

What is Quasi-Experiments / Natural Experiments?

A

Two types of Quasi-Experiments:

  1. Whether an individual (entity) receives treatment is “as if” randomly assigned, possible conditional on certain characteristics

“Treatment (d) “as if” randomly assigned

• For example a new policy measure that is implemented in one but not in another are, whereby the implementation is “as if” randomly assigned.

  • Does immigration reduce wages? Eco theory suggest that if the supply of labor increases, wages will fall. However immigrants tend to go to cities with high labor demand, so the OLS estimator of the effect on wages of immigration will be biased. Was done a Quasi on Cubans that moved to Miami. Estimated the causal effect on wages of an increase in immigration by comparing the change in wages of low-skilled workers in Miami to the change in wages of similar workers in comparable U.S cities. He found no effect.
    2. Whether an individual receives treatment partially determined by another variable that is “as if” randomly assigned

“A variable (z) that influences treatment (d) is “as if” randomly assigned: use IV regressions

• The variable that is “as if” randomly assigned can then be used as an instrument variable in a 2SLS regression analysis.

55
Q

What is the parallel trend assumption?

A

We cannot test for this, but if the treatment and control firms seem similar before the treatment, this is more likely to be the case.

56
Q

What is internal validity? what is external validity?

A

Internal validity refers to the validity of the findings within the research study. It is primarily concerned with controlling the extraneous variables and outside influences that may impact the outcome. External validity refers to the extent to which the results of study can be generalized or applied to other members of the larger population being studied.

57
Q

How do we use difference-in-differences?

A

Testing the effect of treated versus control group regressing delta Y against the mean of the treated and control group before and after treatment

58
Q

MLE

A

maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data.