assessing studies based on multiple regression Flashcards

1
Q

what is internal validity?

A

the statistical inferences about causal effects are valid for the population being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is external validity?

A

the statistical inferences can be generalized from the population and setting studied toother populations and settings, where the “setting” refers to the legal, policy, and physical environment and related salient features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what does assessing threats to external validity require?

A

it requires detailed substantiated knowledge and judgement on a case by case basis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are the 5 threats to internal validity of regression studies?

A

ommitted variable bias
wrong functional form
errors in variables bias
sample selection bias
simultaneous causality bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what do all the 5 threats to internal validity imply?

A

it implies that the expected value of the error term given all values of X is not equal to 0 or that the conditional mean independence fails in which case the OLS is biased and inconsistent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what are the 5 solutions to ommitted variable bias?

A

1) if Omitted causal variable can be measured, include it as regressor
2) if you have data on one or more controls and they are adequate (ie conditional mean independence plausibly holds), then include the control variable
3) use pananl data in which each entity is observed more than once
4) if the ommitted variable cannot be measured , use instrumental variables regression
5) run a randomised controlled experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is a bad control?

A

variables that are themselves outcome variables in the notional experiment at hand ie what would happen to your wages if you get a college degree ( the coefficient of college wont have a causal interpretation due to causal interpretation even if college degree were randomly assigned

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is wrong functional form?

A

arises if the functional form is incorrect??

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what are the solutions to function form misspecification?

A

1) continuous dependent variables- use the appropiate non linear specificiations in X
2) discrete dependent variables: need an extension of multiple regression methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is errors in variable bias?

A

economic data can often have measurement errors
ie data entry errors in administrative data, recollection errors in surveys, ambigous questions
this leads to correlation between error term and measured variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are the solutions to errors in variable bais?

A

obtain better data
develop a specific model of the measurement error process. this is only possible if a lot is known about the nature of the measurement error
instrumental variables regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is the missing data and sample selction bais?

A

data often missing, sometimes this data introduces bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what are the three cases of missing data?

A

data are missing at random
data are missing based on the value on the value of one or more X’s
data are missing based in part on the value of Y or u

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what cases of missing data dont introduce bias and why?

A

when data is missing at random or when data are missing based on the value of one or more X’s. these dont introduce bias because the standard errors are larger than they would be if the data werent missing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what case of missing data does cause bias?

A

wehn data are missing based in part on the value of Y or u, this bias is called sample selection bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

when does sample selection bias arise during a selection process?

A

when the selection process influences the availaiblity of data and is related to the dependent variable

17
Q

what is survivorship bias?

A

survivorship bias is a special form of sample selection bias, it is when the only ones sampled are those which have survived an event such as managed firms being measured against hold the market funds. only the surviving managed that have outperformed in the past has survived

18
Q

what are the solutions to the sample selection bias?

A

collect the sample ina way that avoids sample selection:
- obtain a true random sample, measure at the beginning of the period

19
Q

what is simultaneous causality bias?

A

if X causes Y and Y causes X too then a large ui will mean a larger Y which in turn means a larger X

20
Q

what are the solutions to simulataneous causality bias?

A

run a randomised controlled experiment- if X is chosen at random then there is no feedback from the outcome variable to Y
develop and estimate a complete model of both directions of causality- extremely difficult in practise
use instrumental variables regression to estimate the casual effect of interest

21
Q

what is the external validity requirement for a prediction model?

A

the data used to estimate the prediction model must be from the same distribution as the out of sample observation for which the prediction is made