Topic 11- Extensions To Regression Analysis Flashcards
This topic looks at alternate ways to specify a regression model. E.g non-linear
4 types of new variables in regression equations
Natural logarithms of variables
Quadratic variables
Interaction variables
Dummy variables
Natural logarithm 2 uses
Condenses the dataset, make a non-linear relationship into a linear.
Can also make a skewed distribution become more normally distributed.
X values for natural logarithm (on the graph)
X=1 on the graph is a 0 log value
X<1 is a negative log value
X>1 is a positive log value
(Line looks roughly like the beginning of a laffer curve)
How to use logarithms in regression. (In words)
Assume a non-linear relationship between independent variable (x) and dependent variables (y), but a linear relationship between lnX and lnY
Logarithm regression equation
LnY=a+blnX+e
Where
B=r(SlnY/SlnX)
A=lnY bar - blnX bar
(Basically just add ln to each X and Y from the basic regression formula)
2: Quadratic variables
Analyse relationships with a quadratic shape e.g tax (laffer curve)
Example: estimating happiness with age
What would the regression equation look like, and also in general
Happiness=a+b₁AGE+b₂AGE²+e
General
Y=a+b₁x₁+b₂x²₁+b₃x₂+…+e
How to find the minimum/maximum
Differentiate to find turning point
Draw out the 4 quadratic variable grpahs
Interaction variables, and example
2 independent variables have an additional impact on y (THE DEPENDENT) if they BOTH OCCUR
E.g stirring coffee, and adding sugar.
Sweetness will not be impacted by just sugar or just stirring
It will be affected by sugar and stirring.
Example: if testing if it is young or unemployed people, or young AND unemployed people that cause high crime
What would the regression equation look like
Crime=a+b₁YOUNG+b₂UNEMPLOYED+b₃YOUNG x UNEMPLOYED+e
Final new type of variable: Dummy variables purpose
Allows us to include nominal scale data or qualitative variables into the regression model e.g gender, unemployed or employed.
Take value of 0-1
E.g 1=female 0=men
0 usually means attribute not present
Example: we want to find out effect of gender on wages
What would the regression equation look like?
WAGE=a+b₁EDUCATION+b₂GENDER+b₃CHILDREN+b₄MARRIED+e
Here we can see we added gender, and included control variables to isolate the effect of gender on wages.
Using the example equation, how do we interpret the b₂ coefficient?
Let gender
1=female
0=male
If b₂>0, women earn more
If b<0 women earn less