Basics of Multiple Regression Flashcards

1
Q

When should you use Logistic regression models?

A

If the dependent Y variable is discrete

If out independent X variables is qualitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When should you use Multiple regression models?

A

When the dependent variable is continuous (not discrete) and there is more than one explanatory variable (more than one dependent variable).

When multiple independent variables determine the outcome of a single dependent variable.

Dependent Y Variable is continuous
We have more than 1 Dependent Y variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Assumption of Regression models

A

L.I.I.N.H.

Linearity: Relationship between dependent Y variable and Independent X variable is linear.

Independent of Errors: Regression residuals are uncorrelated across observation.

Independent: Independent X variable is not random, there is no exact linear relationship between 2 or more independent variables.

Normality: Regression residuals are normally distributed.

Homoscedasticity: Constant variance of regression residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to determine if a variable is significant?

A

|T-Stat| > 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Degrees of freedom for SSR

A

N-k

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Degrees of freedom for SST

A

N-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Degrees of freedom for SSE

A

N-K+1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What will happen to adjusted R-Square if we have insignificant varibles

A

Adjusted R-Square decreases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

R-Square formula

A

SSR/SST = Explained Variation / Unexplained variation

1-(unexplained variation/total variation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What kind of test is this?

H0: bi = Bi
Ha: bi /= Bi

A

Two tail test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What kind of test is this?

H0: bi <= Bi
Ha: bi > Bi

A

Right tail test

<= - is heading right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What kind of test is this?

H0: bi => Bi
Ha: bi < Bi

A

Left tail test

=> is heading left

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Formula and purpose of AIC

A

AIC = n * ln(SSE/n)+ 2(K+1)

AIC is better for forecasting purposes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Formula and purpose of BIC

A

BIC = n * ln(SSE/n) + Ln(n)(k+1)

Better for evaluating goodness-of-fit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do we test joint coefficients?

A

F-Stat
[(SSE restricted - SSE unrestricted) / q] / (SSE unrestricted / N-k-1)

alternative formula…
(SSE restricted - SSE unrestricted) x ( N-K-1) / (SSE unrestricted x Q)

-SSE restricted: Model 1 does not include the two variables we want to test, so it is the restricted model.
SSE unrestricted: Model 2 that includes the two variables we want to test, so it is the unrestricted model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Adjusted R square formula

A

1 - [(n-1)/(n-k-1)] x (1-R Squared)

17
Q

Holding all other variables constant, the adjusted R-Square will decrease when all of the following variables increase expect…

A

The number of observation

18
Q

Formula For F-Test

A

MSR / MSE

[ RSS / K ] / [SSE / n-(k+1) ]

[ Regression / K ] / [ Residual / n-(k+1) ]

19
Q

Hypothesis test for F test

A

F test > F stat : Reject null. b1 = b2 = bn = 0

F test < F stat : Fail to reject null. b1 =/ b2 =/ bn =/ 0

20
Q

Are the coefficents correlated?

y=2+3x1

y=1.5+2x1+ 3x2

A

Yes, because when we added an aditional coefficient, their values changed

21
Q

The null hypothesis for F-test

A

All regression coefficients are equal to zero. In other words, none of the independent variables have a significant effect on the dependent variable.

H0 = β1=β2 =β3 =⋯=βk=0

This implies that the model has no explanatory power, and the variation in the dependent variable 𝑦.
y is not explained by the independent variables.

22
Q

The Alternative hypothesis for F-test

A

At least one regression coefficient is different from zero. In other words, at least one independent variable has a significant effect on the dependent variable.

H1:Atleastoneβi /=0 (forsomei=1,2,…,k)

23
Q

How Calculate the joint F-statistic

A

[(SSE of restricted model−SSE of unrestricted model)/𝑞] / SSE of unrestricted model/(𝑛−𝑘−1)

restricted model: The model that does not include the coefficient. Value is obtained from SS in Residual row.