R5 - Quant - Multiple Regression Flashcards
Multiple Regression
Linear regression involving two or more independent variables.
Y = b0 + b1X1 + b2X2 + E
Where:
Yi = The ith Observation of the dependent variable Y
Xji = The ith observation of the independent variable Xj, j=1,2,…,k
b0 = The intercept of the equation
b1,..,bk = The slope coefficients for each independent var
Adjusted R2
A measure of goodness-of-fit of a regression that is adjusted for degrees of freedom and hence does not automatically increase when another independent variable is added to a regression.
Analysis of variance (ANOVA)
The analysis of the total variability of a dataset (such as observations on the dependent variable in a regression) into components representing different sources of variation; with reference to regression, ANOVA provides the inputs for an F-test of the significance of the regression as a whole.
Breusch−Pagan test
A test for conditional heteroskedasticity in the error term of a regression.
Categorical dependent variables
An alternative term for qualitative dependent variables.
Common size statements
Financial statements in which all elements (accounts) are stated as a percentage of a key figure such as revenue for an income statement or total assets for a balance sheet.
Conditional heteroskedasticity
Heteroskedasticity in the error variance that is correlated with the values of the independent variable(s) in the regression.
Data mining
The practice of determining a model by extensive searching through a dataset for statistically significant patterns.
Discriminant analysis
A multivariate classification technique used to discriminate between groups, such as companies that either will or will not become bankrupt during some time frame.
Dummy variable
A type of qualitative variable that takes on a value of 1 if a particular condition is true and 0 if that condition is false.
First-order serial correlation
Correlation between adjacent observations in a time series.
Generalized least squares
A regression estimation technique that addresses heteroskedasticity of the error term.
Heteroskedastic
With reference to the error term of regression, having a variance that differs across observations.
Heteroskedasticity-consistent standard errors
Standard errors of the estimated parameters of a regression that correct for the presence of heteroskedasticity in the regression’s error term.
Log-log regression model
A regression that expresses the dependent and independent variables as natural logarithms.