Tentamen Flashcards

1
Q

What is the LS estimator of beta?

A

β hat = (X’X)-1X’y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the LS estimator of σ2?

A

s2 = e’e/(n-k)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How to show that beta hat (LS) is unbiased?

A

E(β hat) = E[(X’X)-1X’y] = (X’X)-1X’ E[y] = (X’X)-1X’X β = β

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to show that s2 is unbiased, or how to derive it?

A

Let B, a positive semidefinite matrix. Then we can rewrite s2 = y’By = (Xβ + u)’B(Xβ +u) = u’Bu + 2 β’X’Bu + β’ X’ BX β.

If we take the expectation of this:
E(s2) = E(u’Bu) + β’ X’ BX’ β = σ2 tr(B) + β’ X’ BX β.

Since we know that BX = 0 is satisfied if B = M (residual maker), and tr(M) = n - k, s2 = y’My/(n-k) = e’e/(n-k) is unbiased.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to derive the variance of beta hat?

A

Var(β hat) = Var[(X’X)-1X’y] = (X’X)-1X’ Var(y) ((X’X)-1X)’ = (X’X)-1X’ σ2 In X’(X’X)-1 = σ2 (X’X)-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Show that no other linear unbiased estimator of β has a lower variance.

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When to use the F-statistic for testing, and what is it?

A

The F-statistic should be used when testing multiple linear restrictions, where R is a matrix with the formula of the restriction, and r is the value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the statistic for testing a single restriction?

A

w is a vector of the restriction, r is the value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the statistic for testing if beta equals zero?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How to derive a ML-estimator?

A
  1. First create the Likelihood function (the product of n draws of the CDF)
  2. Take the ln of this function
  3. Derive it
  4. Equate to zero
  5. Possibly take the second derivative to show that it is in fact a minimum
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Show the Cramer-Rao inequality, what does this mean?

A

It means that if an unbiased estimator achieves the lower bound it’s efficient.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the adjusted R2, and what is R2 (formulas)?

A

Note: SST = y’Ay

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the difference between a model and a data generating process?

A

A model tries to approximate the DGP, but is does not equal the DGP (generally).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How can be shown that the normal distribution is a second order approximation around te mode?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is σ hat, and β hat derived (using the ML)?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the formula for the covariance and the variance?

A

Cov(a, b) = E[(a - E(a))(b - E(b))’]

Var(a) = E[(a - E(a))(a - E(a))’]

17
Q

What is the formula of the SST? And A?

A

SST = y’Ay

A = In - ɩɩ’/n

18
Q

How to get the restricted ML values?

A
19
Q

On what is the Wald, the Lagrangian Multiplier and the Likelyhood ratio test based?

A

Wald: Based on the idea that Rβ hat should be close to r if the null hypothesis is true.

LM: Based on the fact that the gradient q(β hat) vanishes if the null hypothesis is true, thus q(β hat) should be close to zero (for the null hypothesis).

LR: Based on the ratio of the maximized likelihood with the restriction to the maximized likelihood without the restiction, which should be close to one if the null hypothesis is true.

20
Q

What is the Wald, LR, and LM test (when σ is given)?

A

(Rβ hat - r)’(R(X’X)-1R’)-1(Rβ hat - r)/σ2, it is chi-squared (m) distributed, where m is the number of restrictions.

21
Q

How to show the order of the Wald, LM and LR tests?

A
22
Q

What is the matrix M and what does it do?

A

M is the residual maker matrix, s.t. e = y - Xβ hat = My = Mu,

var(e) = var(Mu) = σ2M.

M = In - X(X’X)-1X’

23
Q

How to do confidence interval?

A

j hat - t sqrt(s2vjj), βj hat + t sqrt(s2vjj)), where t is the 95% confidence statistic (for two sided test).

24
Q

How to do prediction interval?

A

(y*hat - A, y*hat + A), with A = t s sqrt(x*‘(X’X)-1x* + 1), t is the 95% test statistic

25
Q

What is the LS derivation?

A
26
Q

What are the advantages and disadvantages of the Least Absolute Estimation?

A
27
Q

What are the ideal 5 conditions/assumptions?

A
  1. The model is linear
  2. The n x k matrix X is nonrandom and has rank k (thus all columns are linearly independent)
  3. The n x 1 vector u has mean zero and variance σ2In
  4. The N x 1 vector u follows a multivariate normal distribution
  5. If we let Qn = X’X/n then Qn –> Q as n –> inf, where Q is finite and positive definite k x k matrix. (Asymptotic, thus not very important for us)
28
Q

What is meant by significance and importance?

A

Often we can ignore small unimportant variables, although our estimation could then be slightly more biased, it often also gains precision.

Significance: how precisely we manage to estimate a parameter

Importance: the impact of the parameter on the total outcome.

29
Q

What does the Crame-Rao inequality mean?

A

It says that the varaince of an unbiased estimator achieves the Cramer-Rao lower bound it is efficient.