w2 Flashcards

1
Q

Simple Linear Regression

A

A single independent variable predicting a single dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

linear equation

A

y = B0 + B1X + e

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Error Term

A

variability of y that can’t be explained by the relationship between x and y; apart of the linear equation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Y hat

A

predicted outcome using the linear regression equation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Extrapolation

A

using a model to predict ŷ using an independent variable outside of the dataset used to create the model; bad

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

SSE

A

Sum of Squared Errors; difference between actual values and predicted values; bad.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

SSR

A

Sum of Squared Regression; difference between predicted values and mean; good.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

SST

A

Total Sum of Squares; difference between actual values and mean ; SSR + SSE.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Coefficient of Determination

A

R squared; SSR/SST; “The percentage of variability in relationship between independent and dependent variability explained by the independent variable.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Multiple Regression Equation

A

y = B0 + B1X1 + B2X2 … + e

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Multiple Regression

A

Multiple independent variables and one dependent variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Interpret coefficient of 0.31 for mileage onto car price

A

For every additional mile of mileage, car price increases by $0.31, ceteris paribus.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Adjusted R squared

A

Penalizes use of large datasets; R squared is naturely inflated with large datasets and this counteracts that effect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

3 tests to analyze correlation

A
  1. Check residuals plots; residuals (y) against each independent variable and the predicted values (x); should be random.
  2. Check that f-test is significant, rules out the null for the model.
  3. Check that t-test is significant, rules out the null for each independent variable.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Multicollinearity

A

problem of strong correlation between two independent variables; cannot exceed 70; fix is to take one of those matching variables out.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly