Week 4: Properties of Estimators, Gauss-Markov, Assumptions Part I Flashcards

1
Q

Gauss-Markov theorem

Name the definition as well as the 5 conditions that have to be met

A

The Gaus-Markov theorem says that, under certain conditions, the ordinary least squares (OLS) estimator of the coefficients of a linear regression model is the best linear unbiased estimator (BLUE), that is, the smallest variance among those that are unbiased and linear in the observed output variables.

1: linear function of X’s plus disturbances
2: (Conditional) mean independence (disturbances have mean zero)
3: Homoskedasticity
4: Uncorrelated disturbances
5: Disturbances are normally distributed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Regression

Definition

A

Regression allows researchers to predict or explain the variation in one variable based on another variable (Dependent - Independent variables).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Regression

Why regression 1

A

To predict the dependent variable - easier:
- We don’t care about the warning correlation ≠ causation
- We just want to choose X’s to minimize our errors of predictions.
- In other words - we want to maximize R-squared
- No assumptions needed

Examples: Predicting stocks, financial markets, economic indicators, election outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Regression

Why regression 2

A

Regression to make causal inferences: We want to know if X causes Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Violations of mean independence

A
  • Measurement error in the independent variables
  • Reverse causation
  • Specification error-omission of relevant variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

measurement error

A

The difference between the observed value (the result of measurement) and the actual value of what we are measuring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Systematic measurement error

A
  • Measuring something in addition to, instead of, or as an incomplete part of the true concept of interest
  • Because relative to true concept of interest, depends on first having a good definition
  • Always results in bias, inefficiency, and nonsense
  • Can only be dealt with during research design

Example: GDP as a measure of national wealth:
- GDP measures only the monetary value of goods and services produced in a country
- Values destruction of ecosystems that generate short-term revenues, undervalues unpaid ‘household’ and other work

Example: survey design, measuring feminism, and surveyor
- induced measurement error-“Should men and women get equal pay for equal work?”
- Measures the extent to which social pressure induces individuals to answer questions in a certain way

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Random measurement error

A
  • For a particular observation, the observed value differs from the true value
  • This difference is callede “error”
  • These errors are random
  • Faulty measuring tool, carelessness, rounding

To fix, you need better data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Reverse causation

A
  • Instead of X causing Y, Y causes X (The effect of something could actually be its cause)
  • Reverse causation leads to biasedness
  • Solution is very difficult: Theory, advanced methods are needed. It’s potentially unsolvable

For example, some studies have observed that diet soda drinkers are more likely to be obese than people who don’t drink diet soda.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Specification error

A

Wrong variables:
- Including an irrelevvant variable - inefficiency
- Excluding a relevant variable - biasedness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Ubiasedness

A

Wheter the expected value of the sampling distribution of an estimator is equal to the unknown true value of the population parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Biasedness

A

Any instance that creates a difference between an expected value and the true value of a parameter being estimated.

In other words, it occurs when a statistic is unrepresentative of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Consequence of including an irrelevant variable

A
  1. Partial slope coefficients remain unbiased
  2. Estimates are inefficient
    * The greater the correlation between the included variables, the more inefficient
    * If they are uncorrelated, estimates are efficient
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Excluding a relevant variable

A
  • Does the variable have a causal effect on the dependent variable?
  • Is the variable correlated with those variables whose effects are the focus of the study?

If answer “yes” to both, then the excluding the variable leads to bias.

Solution: add the variable as a “control”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How to detect and deal with wrong variable errors

A
  • Exclusion of relevant variables is a problem of theory. Always ask: is there a third variable Z that is linked to BOTH X and Y?
  • Inclusion of irrelevant variables can be diagnosed by the t-statistics (e.g. hypothesis testing)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly