Model Performance Flashcards

1
Q

Coefficient of Determination

A

Also known as r-squared (w/ multiple explanatory variables), this measures how well a linear regression line fits the observed values, a larger number indicates a greater fit. More formally, it is defined as the proportion of variance in the response variable that is predictable from the explanatory variable(s). 0 to 1 measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Residual standard error (RSE)

A

measures the typical size of the residuals; smaller is better

Measures the difference between predicted and observed response variable; this is always measured in the units of the response variable; for example, an RSE of 20 would mean the typical difference b/w predicted and observed response variable is 20 units

Not sure why this is called error. Found disagreement when looking it up and it isn’t always referred to as such. It really should be called mean error of the residuals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Degrees of Freedom

A

An estimate of the number of independent pieces of information that went into calculating an estimate; how many calculations have the freedom to vary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Root-mean-square error (RMSE)

A

Close to residual squared error (RSE) but doesn’t take into account the # of coefficients

Worse than the RSE for comparisons b/w models; should usually just use RSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Residual

A

Difference between an observed value and the mean value that a model predicts for that observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly