7.2 Goodness of Fit and Hypothesis Test Flashcards

Question 1

Q

What is the coefficient of determination?

Answer

A

The percentage of the total variation in the dependent variable explained by the independent variable

Question 2

Q

For simple linear regression with one independent variable, how do you compute the coefficient determination?

Answer

A

It is the square of the correlation coefficient

Question 3

Q

What is the calculation for the coefficient of determination that can be used on linear regressions with either a single or multiple independent variables?

Answer

A

R² = RSS / SST

Question 4

Q

What is SST?

Answer

A

SST = Total Sum of Squares

It measures the total variation in the dependent variable

It is equal to the sum of the squared differences between the actual and the mean value of Y

Question 5

Q

What is RSS?

Answer

A

RSS = Regression sum of squares

It measures the variation in the dependent variable that is explained by the independent variable

It is the sum of squared distances between the predicted Y-values and the mean of Y

Question 6

Q

What is SSE?

Answer

A

SSE = Sum of Squared Errors

SSE measures the unexplained variation in the dependent variable

SSE is the sum of the squared vertical distances between the actual Y-values and the predicted Y-values

Question 7

Q

What equals SST?

Answer

A

RSS + SSE = SST

Question 8

Q

What is MSR? (Mean Square of Regression)

Answer

A

MSR = The explained variation divided by K

RSS / k

Where explained variation = RSS and K = degrees of freedom

At Level 1, K will always = 1… so MSR = RSS/1 = RSS

Question 9

Q

What is MSE? (Mean Square of Error)

Answer

A

MSE = SSE / (n-k-1)

It is a variance computation… we are looking at the variance of the error terms

The variance of the actual values against their forecasted values

Question 10

Q

What is the SEE? (Standard Error of Estimate)

Answer

A

The SEE = Square root of MSE

It is the standard deviation of the errors

The lower the SEE, the better the model fit

Question 11

Q

What does a high and a low R² value indicate?

Answer

A

High R² = low SEE (good fit)

Low R² = high SEE (poor fit)

Question 12

Q

What does the F-stat assess?

What is it known as?

How many tails?

Answer

A

How well a set of independent variables (as a group) explain the variation in the dependent variable

A test of overall model significance

It is ALWAYS a one-tailed test (right hand tail)

Question 13

Q

What is the F-test null and alternative hypothesis?

Answer

A

Null: Slope coefficient = to 0

Alternative: Slope coefficient not equal to zero

Question 14

Q

What are the F-test degrees of freedom?

Answer

A

There are two degrees of freedom

Numerator = K

Denominator = N-K-1

Question 15

Q

How do you use a t-test to test explanatory power?

Answer

A

In multi-linear regression, the F-test is used to determine whether any of the independent variables have explanatory power. If the answer is yes, a t-test is used to determine which individual independent value(s) has/have explanatory power