Chapter 4 - Logs, Polynomials, & Interaction Terms Flashcards
1) ln((A)(B)) =
2) ln(A/B) =
3) lne =
4) ln1 =
1) lnA + lnB
2) lnA - lnB
3) 1
4) 0
What to log and what not to log?
1) Currency values are often logged
2) Variables with larger integer values, population are often logged
3) Years are not logged
4) Percentages typically not logged
5) If the variable can take on a zero or a negative, not logged
When interpreting the coefficient of a estimated equation from a regression you conducted, what do you do?
Always remember to take the partial derivative
You run a regression and get the following equation: log(price) = 9.23 - 0.718log(pollution) + 0.306rooms
Interpret each of the coefficients, and use the correction formula on the log-lin interpretation.
For β1: On average, a 1% increase in the level of pollution leads to a 0.718% decrease in the price of the house, ceteris paribus.
For β2: On average, for every 1 additional room in a house, results in a 30.6% (100*β2%) increase in the price of the house, ceteris paribus.
Correction Formula: 100[(e^0.306) -1] = 35.8%
When interpreting log-lin models there is an approximation error that occurs as changes in log(y) become bigger and bigger. There is a correction formula for this, what is that correction formula?
%Δy = 100[(e^β) – 1]
Wage = e^(β0 + β1Ed + β2Exp)
Interpret the coefficients for Ed and Exp, where Ed = Years of education, and Exp = Years of work experience
1) ln both sides to get rid of the e.
2) ln(Wage) = β0 + β1Ed + β2Exp
3) Interpretation: On average, for every 1 additional year in education, wage increases by 100*β1%, ceteris paribus.
Same for experience.
You run a regression and get the following: prate = 97.32 + 5.02mrate + .314age - 2.66log(totemp) prate = participation rate in 401k mrate = 401k plan match rate age = age of 401k plan totemp = total number of firm employees Interpret the coefficients
For β1: On average, for every additional 1% matched rate in the 401k plan results in a 5.02% participation rate increase in the 401k plan, ceteris paribus.
For β2: On average, for every 1 additional year in the age of the 401k plan, results in a .314 participation rate increase in the 401k plan, ceteris paribus.
For β3: On average, a 1% increase in total firm employees results in a 0.0266 (β2/100) participation rate decrease in the 401k plan, ceteris paribus.
When dealing with polynomials in a regression what is the first step taken in order to understand the interpretation?
1) Take the derivative with respect to the variable that is being raised, x^2.
2) For interpretation how the x variable correlates with the y variable we must do it at a particular point, or a particular value of x. Typically the value of x is mean, but it can be any value of interest.
What is the general interpretation for the equation below, (interpret for x2)?
y = β0 + β1x1 + β2x2 + β3x2^2 + u
1) Take derivative with respect to x2: β2 +2β3x2
2) On average, a 1 unit change in x2 results in a β2 + 2β3x2 units change in y, ceteris paribus.
3) Never interpret β2 or β3 by themselves because they are both related to x2
Take the first and second derivative of the equation with respect to x2:
y = β0 + β1x1 + β2x2 + β3x2^2 + u.
What are the rules for knowing if the relationship is concave, convex, or linear?
1) First derivative: β2 + 2β3x2
2) Second derivative: 2β3
3) If β30, relationship between x2 and y is convex.
If β3=0 relationship between x2 and y is linear.
y = β0 + β1x1 + β2x2 + β3x2^2 + u
If the relationship between x and y is linear do we need to have a squared term? Give the hypothesis test to determine this.
1) There is no need to have the squared term, because the relationship is just that, linear.
2) H0: β3=0 and H1: β3≠0
If we reject the null hypothesis we should keep the squared term, and if we fail to reject the null hypothesis then we can drop it.
Conduct the t-stat test in the normal way.
y = β0 + β1x1 + β2x2 + β3x2^2 + u
What is the global significance test? (give the definition and the hypothesis test)
1) Test used to see if there is some relationship between x2 and y at any point int he data, or if x2 is ever correlated with y.
2) H0: β2=β3=0 and H1: At least one is not equal to zero
Must calculate the partial F-test with this test, and use a full and reduced model.
If we reject the null hypothesis then there is correlation between x2 and y
What is the local significance test? (give the definition and the hypothesis test)
First derivative: β2 + 2β3x2
1) Test used to see if there is some relationship between x2 and y at a specific point in the data.
2) H0: β2 + 2β3x2 = 0 and H1: β2 + 2β3x2 ≠ 0
If we reject the null hypothesis there is correlation between x2 and y at a specific value of x2
wage = 3.73 + 0.298expr - 0.0061expr^2 wage = wage rate expr = years of experience Interpret the coefficients. If experience is 5 years what is the interpretation?
1) First take the derivative: (∂wage)/(∂expr) = 0.298 - 2(0.0061)expr
2) On average, a 1 year increase in experience results in a 0.298-2(.0061)expr change in the wage rate, ceteris paribus.
3) On average, a 1 year increase in experience results in a $0.237 increase in the wage rate, ceteris paribus.
y = β0+β1x1+β2x2+β3x1x2+u
Take the first derivative with respect to x1, and then x2. Interpret the interaction term.
(∂y)/(∂x1) = β1 + β3x2
(∂y)/(∂x2) = β2 + β3x1
On average, a 1 unit change in x1 results in a β1 + β3x2 units change in y, ceteris paribus.
On average, a 1 unit change in x2 results in a β2 + β3x1 units change in y, ceteris paribus.