Regression Flashcards

1
Q

Variable types

A

Independent/explanatory- this is the one we control
Dependent/response- this is the one we measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Residuals

A

It is the vertical distance of a datapoint to the line of best fit
Residual= Observed- expected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Least squares regression lie

A

y=a+bx
A line of best fit such that-
Sum of residuals= 0
Sum of squares of residuals is as small as possible
Menu 6-2 OPTN 3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Interpretation of a and b

A

a- expected value of y when x is 0
b- expected change in y for every 1x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Outliers and Anomalies

A

An outlier is any data pout with a residual of more than 3 s.d of y.
Do not remove outliers unless they are anomalies which are proven to not belong

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to predict it

A

Either sub x into the equation
Or, use ŷ (OPTN down 4 5)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Reliability

A

Interpolation- predict within the data (reliable)
Extrapolation- predict outside data
(not reliable)
Small residuals- reliable
Large residuals- not reliable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly