PE_L2_(without Appendix) Flashcards

1
Q

What is the Simple Linear Regression (SLR) population model?

A
  • Model form: y = β0 + β1x + u
  • y: dependent (outcome) variable, consequence
  • x: independent (explanatory) variable, cause
  • u: error (disturbance) term

“population model”
We want to find causal relations (“! causes “”), not (mere) correlations!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which assumptions define the SLR framework?

A
  • SLR.1: Model is linear in parameters (beta)
  • SLR.2: Random sampling
  • SLR.3: Sample variation in x
  • SLR.4: Zero conditional mean (E(u|x)= E(u)=0)
    E ist der Erwartungswert
  • SLR.5: Homoskedasticity (Var(u|x)=σ2)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does ‘zero conditional mean’ imply?

A
  • E(u|x) = 0
  • The error term u does not systematically depend on x
  • Guarantees no omitted-variable bias if included variables capture relevant effects
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why is sample variation in x important?

A
  • Prevents the denominator in slope formulas from being zero
  • Ensures x has enough variability to estimate β1
  • Without it, the slope cannot be computed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do we interpret the OLS estimates?

A
  • β1 (slope): Estimated change in y for a one-unit change in x
  • β0 (intercept (value of : when & is zero)): Estimated value of y when x=0
  • Both are unbiased if assumptions hold
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the idea of ‘least squares’?

A
  • Minimise the sum of squared residuals: ∑(yi − ŷi)2
  • Ensures the fitted line is as close as possible to the data points (in a vertical sense)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is R2?

A
  • Coefficient of determination
  • R2 = SSE / SST = 1 − SSR / SST
  • Measures the proportion of the total variation in y explained by the model
  • Goodness of fit
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does homoskedasticity mean?

A
  • Var(u|x) = σ2 (constant)
  • The spread of the error term does not depend on x
  • A key assumption for deriving simple variance formulas
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is the error variance estimated?

A
  • Using the residuals: ûi = yi − ŷi
  • σ̂2 = SSR / (n − 2)
  • (n − 2) because two parameters (β0, β1) are estimated from the data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can we handle non-linear relationships?

A
  • ‘Linear in parameters’ does not require a strictly straight-line in x
  • Can use log forms: ln(y), ln(x)
  • Interpretation changes: e.g., log-log model implies elasticity interpretation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the unobserved error term?

A
  1. Strictly unpredictable random behaviour that may be unique to that observation
  2. Unspecified (unobserved) factors / explanatory variables, which are not in the model
  3. An approximation error if the relationship between : and & is not exactly linear
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

OLS

A

Ordinary Least Squares

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Estimate β0 + β1

A

Step 1:
E(u) = 0
E(u|x) = 0
??
Step 2:
??

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Implications of Simple Linear Regression (SLR) Model

A

Implication 1: OLS is unbiased (because of SLR1-4)
Implication 2: Properties of variance of the OLS Estimators

How well did you know this?
1
Not at all
2
3
4
5
Perfectly