Chapter 17 Multiple Regression Flashcards

1
Q

Response surface

A

Graphical depiction of a regression model equation

Can only be drawn if number of independent variables are 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Required conditions for error variable

A

1) probability distribution of e is normal
2) the mean of the distribution is 0
3) the standard deviation of e is sigma e, which is a constant
4) the errors are independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Multiple regression equation

A

Calculated value of y = b0 + b1x1 +b2x2 +…+ bkxk

Where k is the number of independent variables

Some independent variables may be functions of others

Must determine if model fits to determine if analysis is worth continuing vs the model needs importing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Multiple regression step 1

A

Select the independent variables that you believe may be related to the dependant variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Reasons not to include every available integral variable in a multiple regression model

A
  • objective to determine if hypothesized model is valid (determine if there is a relationship between variables) so want to include only independent variables that may affect the dependant variable
  • increasing number of independent variables increases the probability for type 1 errors (rejection of a true null hypothesis)
  • Due to multicollinearity - possible to conclude that none of the independent variables are literally related to the dependant variable when 1 or more actually are related
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Multiple regression step 2: excel calcuations

A

Input data into sheet so that independent variables are in adjacent columns

  • data, data analysis , regression
    • specify input y range (dependent) and x range (independent)

Coefficients listed give the b value for that x value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Multiple regression in excel

A

Input data into sheet so that independent variables are in adjacent columns

  • data, data analysis , regression
    • specify input y range (dependent) and x range (independent)

Coefficients listed give the b value for that x value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Multiple regression step 3: assess the model

A

Three assessments

  • standard error of estimate
  • coefficient of determination
  • f-test of the analysis of variance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Standard error of estimate

A
  • given by excel regression statistics data

Se= square root of (SSE/n-k-1)

n= sample size
k = number of independent variables
SSE =sum of squares for error = (n-1)*(sample variance of the dependant variable - (sample covariance ^2 / sample variance of the independent variable)

SSE also reported as standard error in excel regression analysis

Judged against values for the dependant variable (especially mean) for relative largeness or smallness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Coefficient of determination

A

R^2 = 1 - (SSE/ (sum of all values of y - mean of y)^2))

Or in excel regression analysis: R square value

Gives percentage of total variation in dependant variable explained by the independent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Adjusted R square

A

Coefficient of determination adjusted for degrees of freedom

Takes into account sample size and the number of independent variables (if number of independent variables is large relative to the sample size then unadjusted R2 may be unreasonably high)

Adjusted R square= 1- ((SSE/n-k-1)/((sum of all values y - mean y)^2/ (n-1))

Aka 1- (MSE/ standard deviation of y)

MSE : mean of the sum of squares for error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Testing validity of multiple regression model

A

H0= independent variables = 0 (means none of the independent variables are related to y, model is invalid)

H1: at least one b value (independent variable) does not equal 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Total variation in y

A

Made up of SSR + SSE

SSR = variation explained by regression model
SSE= unexplained variation

If SSR large relative to SSE model is relatively good

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Mean square

A

Sum of squares / degrees of freedom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

F statistic

A

Ratio of mean squares as long as underlying population is normally distributed

Ratio of two mean squares is f distributed as long as the underlying population is normal

Shown in the ANOVA table in excel (data regression analysis)

Large value f = model is valid (most of the variation in y is explained by the regression equation)

Small value of f= most of the variation is unexplained

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Rejection region for F statistic

A

F> F(a, k, n-k-1)