Quant lvl 2 - Reading 10 Flashcards
Define multiple linear regression
tool that allows examination of the relationship (if any) between two types of variables
Define partial slope coefficient (or partial regression coefficient)
slope coefficient in a multiple regression. “measures the expected change in dependent variable for 1unit increase in an independent variable, holding the other ind. variables constant.
Define a dummy variable
qualitative independent variable in a regression. usually value of 1 to indicate true and value of 0 for false
What are the three regression violations?
- heteroskedasticity 2. serial correlation 3. multicollinearity
Define heteroskedasticity
we assume that the variance of error is constant, but variance of error differs across observations
What is the Breusch-Pagan test used for?
to test the heteroskedasticity. (Heteroskedasticity is “when the standard deviations of a variable, monitored over a specific amount of time, are non-constant” (investopedia))
What is dummy variable regression
Tests whether the stock returns provide different average returns when the returns are related to other factors. e.g. takes on value of 1 if condition is true and 0 if it is false
how many variables do you need for a dummy variable regression
n-1 since you want to distinguish among n categories.
what is R-squared
coefficient of determination. measures “goodness of fit of estimated regression to the data”
what is formula for R-squared (coefficient of determination)
total var-unexplained / total variation or the SSregression / SStotal
define unconditional heteroskedasticity
“occurs when heteroskedasticity of error variance is not correlated with independ. variables in the multiple regression…creates no major problems for statistical inference”
define conditional heteroskedasticity
heteroskedasticity in error variance that is correlated with the values of indep. variables in the regression.
define multicollinearity
“high correlation in linear regression when the independent variables are highly correlated (2 or more)
how do you correct multicollinearity?
omit one or more of teh correlated ind. variables
3 way regression models are misspecified
- functional form (important variables missing, data improperly pooled)
- explanatory var. correlated with error term in time series model
- other misspecifications resulting in nonstationarity