Econ B: L11 Flashcards
Give some examples of types of functional forms that may imply a non-linear relationship between y and x but not a non-linear model?
squared, interaction terms, logs (see slides 7->10)
eg (cont. use)) Given wage(hat)=5+0.5educ:
What does this imply?
What are the 2 options around this?
Implies each additional year of education will increase wage by £0.5 but this i unrealistic (inc. returns to educ.)
2 options:
1) specify a non linear model (bad)
2) specify a linear model using logarithms (good!)
eg) suppose % increase in wage is same given each additional year of education:
1) What is the regression equation for this?
2) How will β1 be interpreted?
1) log(wage) = β0 + β1educ +u
2) %∆wage ~=(100β1)∆educ
On average, holding all other factors fixed:
A one year change in educ leads to a (100β1)% change in wages.
Steps to making a logarithmic model?
1) Define y=log(wage)
2) Mechanics of OLS are same tf just do same as usual
What is a semi-elasticity model?
A log-level model; y is in log form and x is in level
What is an elasticity model?
Log-log model: both sides are in logarithmic form
Given an elasticity model, what will the interpretation of β1 be? (2)
Elasticity of y wrt x1
tf:
%∆y~=β1%∆x1
What is a level-log model? What is the interpretation of it?
y is in normal form, x is in log form (ie. log(x)):
∆y~=(β1/100)%∆x
3 advantages of taking logs?
1) when y > 0, models using log(y) as the dependent variable often satisfy the MLR assumptions more closely than models using the level of y.
2) Strictly positive variables often have conditional distributions that are heteroskedastic or skewed; taking the log can mitigate, if not eliminate, both problems.
3) Taking logs usually narrows the range of the variable, in some cases by a considerable amount. This makes estimates less sensitive to outlying (or extreme) observations on the dependent or independent variables.
2 standard examples where you may take logs?
1) If positive monetary amount (can’t have -ve pounds)
2) If large integer values (eg. pop)
What form do variables in years normally appear in?
Their original form
A variable that is a proportion or a percent can appear in either original or logarithmic form, although there is a tendency to use them in level form. Why?
any regression coefficients involving the original variable whether it is the dependent or independent variable will have a percentage point change interpretation
Difference between percentage point change and percentage change?
eg) if U goes from 8% to 9% this is an increase of one percentage point
But a 12.5% increase from the initial U level (see slides)
using logs means we are looking at % change in U rate
2 limitations of logs?
1) can’t be used if a variable takes on zero/negative values
2) If log is on dependent variable can be more effort to predict/transform it back to the original variable
Note:
Not legitimate to compare R^2s from models where y is DV in one and log(y) is DV in other; measures explain variations in different variables!