Reading 12: Multiple Regression and Issues in Regression Analysis Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Formulate a multiple regression equation to describe the relation between a dependent varible and several independent variables, and determine the statistical significance of each independent variable.

A

Answer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Interpret estimated regression coefficients and their p-values.

A

p-value: what is the smallest significance level at which we can reject Ho.

If p-value < alpha, then reject Ho

If p-value > alpha, then fail to reject Ho

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Formulate a null and an alternative hypothesis about the population value of a regression coefficient, calculate the value of the test statistic, and determine whether to reject the null hypothesis at a given significance level.

A

Answer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Interpret the results of hypothesis tests of regession coefficients.

A

Answer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Calculate and Interpret the following:

  1. a confidence interval for the population value of a regression coefficeint
  2. a predicted value for the dependent variable, given an estimated regression model and assumed values for the independent variables
A
  1. Coefficient +- (t-stat x standard error)
  2. Plug in the values BUT do not leave out any statistically insignicant coefficients
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Explain the assumptions of a multiple regression model:

A
  1. Linear relationship between Y and X
  2. No exact linear relationship among X’s
  3. Expected value of error term = 0
  4. Variance of error term is constant
  5. Errors not serially correlated
  6. Error term normally distributed

IMPORTANT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Calculate and interpret the F-statisitc, and describe how it is used in regression analysis:

A
  • F-Statistic: tests whether any independent variables explain variation in dependent variable
  • Hypothesis Test
    • Ho: All slope coefficients = 0
    • Ha: At least one slope coefficient does not equal 0
  • REJECT if Ho exceeds critical value
  • Shape is determined by numerator and denominator Df
  • OVERALL: the F-test tests the significance of the model
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Distinguish between and interpret the R-squared and adjusted R-squared in multiple regression.

A
  • R-squared: % of SST explained by RSS
  • Adj. R-squared: compensates for the problem that as new variables are added to the model the R2 increases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Evaluate how well a regression model explains the dependent variable by analyzing the output of the regression equation and an ANOVA table.

A

Answer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Formulate a multiple regression equation by using dummy variables to represent qualitative factors, and interpret the coefficients and regression results.

A

Dummy Variable Trap: Always use onne less dummy variable than states of the world. (i.e. four quarters, use three dummies)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Explain the types of heteroskedasticity and how heteroskedasticity and serial correlation affect statistical inference.

A

Types of Heterskedascticity:

  • Type 1: Unconditional (not related to independent variables) CAUSES NO MAJOR PROBLEMS
  • Type 2: Conditional (related to independent variables) IS A PROBLEM and makes T-stats usuallly articifically high by making the standard error too small.

Serial Correlation (or Autocorrelation)

  • Positive: each error term trends in the same direction Impact is the same as Conditional Heteroskedasticity. small SE, high T-stat
  • Negative: opposite progression of figures (not likely in finance)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe multicollinearity, and explain its causes and effects in regression analysis.

A

Multicollinearity: two or more “X” variables are correlated with each other

Causes: correlation between two or more X variables

Effects:

  1. Inflates SEs; reduces t-stats
  2. Variables falsly look unimportant
  3. i.e. FALSE INSIGNIFICANCE
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe how model misspecification affects the results of a regression analysis, and describe how to avoid the common forms of misspecification.

A

Misspecification: selection of explanatory variables, and/or transformation of variables that affects the reliability of the inference/hypothesis tests’

How to avoid common forms:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe models with qualitative dependent variables:

A

Cannot use oridnary least squares regression (OLS) analysis

LOGIT MODELS: calculate a probability based on logistic distribution (CAN HELP)

PROBIT MODEL: calculateds probabilitys based on normal distributions

DISCRIMINANT MODEL: Produce a score/ranking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Evaluate and interpret a multiple regression model and its results.

A

Answer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly