7.2 Goodness of Fit and Hypothesis Test Flashcards

1
Q

What is the coefficient of determination?

A

The percentage of the total variation in the dependent variable explained by the independent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

For simple linear regression with one independent variable, how do you compute the coefficient determination?

A

It is the square of the correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the calculation for the coefficient of determination that can be used on linear regressions with either a single or multiple independent variables?

A

R² = RSS / SST

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is SST?

A

SST = Total Sum of Squares

It measures the total variation in the dependent variable

It is equal to the sum of the squared differences between the actual and the mean value of Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is RSS?

A

RSS = Regression sum of squares

It measures the variation in the dependent variable that is explained by the independent variable

It is the sum of squared distances between the predicted Y-values and the mean of Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is SSE?

A

SSE = Sum of Squared Errors

SSE measures the unexplained variation in the dependent variable

SSE is the sum of the squared vertical distances between the actual Y-values and the predicted Y-values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What equals SST?

A

RSS + SSE = SST

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is MSR? (Mean Square of Regression)

A

MSR = The explained variation divided by K

RSS / k

Where explained variation = RSS and K = degrees of freedom

At Level 1, K will always = 1… so MSR = RSS/1 = RSS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is MSE? (Mean Square of Error)

A

MSE = SSE / (n-k-1)

It is a variance computation… we are looking at the variance of the error terms

The variance of the actual values against their forecasted values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the SEE? (Standard Error of Estimate)

A

The SEE = Square root of MSE

It is the standard deviation of the errors

The lower the SEE, the better the model fit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does a high and a low R² value indicate?

A

High R² = low SEE (good fit)

Low R² = high SEE (poor fit)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the F-stat assess?

What is it known as?

How many tails?

A

How well a set of independent variables (as a group) explain the variation in the dependent variable

A test of overall model significance

It is ALWAYS a one-tailed test (right hand tail)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the F-test null and alternative hypothesis?

A

Null: Slope coefficient = to 0

Alternative: Slope coefficient not equal to zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the F-test degrees of freedom?

A

There are two degrees of freedom

Numerator = K

Denominator = N-K-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do you use a t-test to test explanatory power?

A

In multi-linear regression, the F-test is used to determine whether any of the independent variables have explanatory power. If the answer is yes, a t-test is used to determine which individual independent value(s) has/have explanatory power

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do you use the t-test for a hypothesis test for the slope coefficient?

A

It is the difference between the point estimate and the hypothesised value

Divided by standard error

With n-2 degrees of freedom

17
Q

How do you calculate the F-stat?

A
18
Q

How many degrees of freedom RSS?

A

k = 1

19
Q

How many degrees of freedom SSE?

A

n-k-1

= n-2

20
Q

For a test of statistical significance, what with the hypothesised value of b1 always be?

A

b1 = 0