Model Selection Flashcards by tyrion lannister

Cochran’s theorem ?

If H_0: all group means are equal, is true
Then
A F can be formed from:
(SSB) group sum of squares, (SSW) within-group sum of squares

F= (SSB/df_between)/(SSW/df_within)

How well did you know this?

Not at all

Perfectly

When to transform observations?

If model checking suggests variance is not constant

How well did you know this?

Not at all

Perfectly

Commonly used transformations of observations

And 1/y

How well did you know this?

Not at all

Perfectly

Box-Cox transformations

This estimates the λ that minimizes sd of standardised transformed variable

How well did you know this?

Not at all

Perfectly

First transformations of observation to try?

ln (y)

How well did you know this?

Not at all

Perfectly

Important thing to remember when transforming observations

All y_i must be >0

How well did you know this?

Not at all

Perfectly

If all other transformations fail, try?

Trig functions, in particular:
Sin^-1 or Tan^-1

How well did you know this?

Not at all

Perfectly

F test for deletion of subset of variables:
Extra sum of squares?

Where β_q,…, β_p-1 are the variables being potentially removed

How well did you know this?

Not at all

Perfectly

F test for deletion of subset of variables:
How to separate variables in vectors

How well did you know this?

Not at all

Perfectly

F test for deletion of subset of variables:
Find SS_extra in vectors

How well did you know this?

Not at all

Perfectly

F test for deletion of subset of variables:
Null hypothesis? H_1?

Where β_q,…, β_p-1 are variables to be removed

How well did you know this?

Not at all

Perfectly

F test for deletion of subset of variables:
Form F test stat and reject H_0 at α level

How well did you know this?

Not at all

Perfectly

When to use all subsets regression

If there is no natural ordering to explanatory variables

How well did you know this?

Not at all

Perfectly

Given p-1 expl variables, how many possible models are there?

2^(p-1)

How well did you know this?

Not at all

Perfectly

Usual statistics used to compare models?

MS_E
R^2
C_p

How well did you know this?

Not at all

Perfectly

MS_E is?

Study These Flashcards

Residual mean square

For full model E(MS_E) = ? And why?

Study These Flashcards

σ^2
Because all candidate explanatory variables are considered

How to model test with MS_E

Study These Flashcards

R^2 is?

Study These Flashcards

Coefficient of determination

Adjusted R^2?

Study These Flashcards

Adding terms to a model has what effect on R^2

Study These Flashcards

Always increases

How to determine # for p in R^2

Study These Flashcards

Plot R^2_(p^~) against p^~ to determine where plot levels off

C_p

Study These Flashcards

Mallows statistic

E(SS^(p^~)_E) =

Study These Flashcards

Use mallows stat to estimate MSE of prediction

Testing With malllows stat we should choose either

C_(p^~) depends on

Unknown σ^2

Estimator of mallows stat

Take MS_E from full model as estimator of σ^2

Expectation of estimator of mallows stat

Adjusted C_(p^~)

Taken from estimator and expectation of estimator

When calculating original R^2 to compare predR^2 to, how to get original?

R^2 = 1 - (SS_R)/(SS_T) Where SS_R is Sum of Squared Residuals And SS_T is total sum of squares Original is also called Multiple R-squared

Model Selection Flashcards

(32 cards)