L2 Flashcards

1
Q

What assumptions are used to show that beta(hat1) is approximately distributed by a normal distribution?

A

3 LSAs and the assumption that n is large!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Diagramatically, what is the p-value?

A

The probability in tails outside of the +-t(actual) of a normal distribution. If p-value is less than 5%, then can reject at the 5% significance level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

n is large, n=(maybe)…?

A

50+

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A 95% confidence level is…? (2, just need one of them!)

A

1) the set of points that cannot be rejected at the 5% SL.

2) a set-valued function of the data that contains the TRUE PARAMETER VALUE 95% of the time in repeated samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What regression error does Stata report in parentheses?

A

Room MSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In Stata, what is: a) _cons, b) p>|t|, and c) ‘t’ column?

A

a) constant (ie. beta0)
b) p-value
c) t-statistic for β(i)=0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How do we interpret the coefficient of a binary variable? Equation showing this?

A

If β=1, it is the effect on Y that the variable has (ie. the effect of ‘being a man’ on ‘wage rate’)
Can be described by the equation:

β(1) = E(Yi|Xi=1) - E(Yi|Xi=0)
tf is the population difference in group means

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are homoskedastic errors?

A

The errors when the error term is uncorrelated with the X variable(s)

ie. var(u|X=x) = σ^2 (ie. constant) (if not then HTSK)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Check

A

Check that I can tell from data is errors are homo or hetero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Advantage and disadvantage of the homo-only SE formula?

A

Pro: simple
Con: only works for homo errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are Robust SEs?

A

Usual standard errors, valid whether homo or heteroskedastic errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is better to always use robust SEs?

A

Since the two different formulas coincide when n is large anyway!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the extended LSAs?

A

Original 3 plus two more:

4) u is homoskedastic
5) u is distributed N(0,σ^2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Why do we need to be careful with LSAs 4 and 5?

A

They are more restrictive than LSAs 1-3, tf apply to less cases (but give opportunity to prove strong results)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the Gauss-Markov Theorem?

A

Under LSAs 1-4 (ie. originals plus homoskedasticity), β(hat1) has the smallest variance among all linear estimators (ie. efficient) (don’t think I need proof of this, page 36 if I do need it)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Efficiency of OLS under all 5 LSAs? Explain.

A

β(hat1) has the smallest variance of all consistent estimators (ie. linear or non-linear combinations of Yi) as n->infinity

This means that OLS is the best you can do among consistent estimators, and since non-consistent ones aren’t really an option, OLS is really the best you can do!

17
Q

3 problems with OLS estimation? (2 efficiency assumption points + one other)

A

1) GM Theorem isn’t that compelling (result is only for linear estimators, homo often doesn’t hold)
2) 5LSA efficiency requires homo and normal errors, which is often not plausible!
3) OLS is more sensitive to outliers than some other estimators

18
Q

Possible estimator to use if there are large outliers in OLS?

A

Least Absolute Deviations (LAD) estimator (see notes)

19
Q

What can we conclude if all 5 assumptions hold?

A

1) Beta(hat0) and beta(hat1) are normally distributed

2) The t-statistic has a student t-distribution with n-2 degrees of freedom

20
Q

When can you use t(n-2) CVs instead of N(0,1) for hypothesis tests and CIs?

A

When u is definitely homoskedastic and normally distributed, and n is LESS THAN 50