Interpretation Of Fitted Models Flashcards
Point prediction? (SLRM) and dist?
For an unknown (not observed) response at some x_0
How to form prediction interval (SLRM)
-find dist of μ^^_0 - Y_0
-Standardize μ^^_0 - Y_0 by replacing σ^2 by its estimator to get
a=(1/n + (x_0 - x^-)^2/S_xx)
(1-α)100% PI for Y_0 in SLRM is?
Compare PI to CI for mean response μ_0
PI is wider than CI because to predict a new observation rather than a mean, we need to add variability of additional random error ε_0
Only make predictions for values
Within range of data
To predict a new observation we need to take into account
It’s expectations and also a possible new random error
Point estimator of new observation (below) is (general regression)? And dist?
(1-α)100% PI for Y_0 is? (General regression) vector form
How to write quadratic regression model in vectors
How to compare quadratic regression model w SLRM
-fit Y_i = β_0 + β_1*x_i + β_11 * (x_i)^2 + ε_i
-test H_0: β_11 = 0 against H_1: β_11 != 0
-reject null if quad gives statistically significant better fit
If polynomial model has some higher order terms that are large
Centre x by subtracting it’s mean, eg:
If model is polynomial, which terms do you consider removing first?
Higher order, given that the Lower order is there, eg
-if x_2,i and x^2_2,i are both there, begin w x^2_2,i
Linear form of polynomial regression models
Sequential sum of squares
SS_R - SS_R (without the new variable)
Remember SS_R
Is regression sum of squares (sum((predictions-mean)^2))
How to use sequential sum of squares
Taking SS_R and difference from previous SS_R when adding variables:
If difference is relatively small. Don’t need that variable