Model Selection Flashcards

1
Q

Cochran’s theorem ?

A

If H_0: all group means are equal, is true
Then
A F can be formed from:
(SSB) group sum of squares, (SSW) within-group sum of squares

F= (SSB/df_between)/(SSW/df_within)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When to transform observations?

A

If model checking suggests variance is not constant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Commonly used transformations of observations

A

And 1/y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Box-Cox transformations

A

This estimates the λ that minimizes sd of standardised transformed variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

First transformations of observation to try?

A

ln (y)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Important thing to remember when transforming observations

A

All y_i must be >0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

If all other transformations fail, try?

A

Trig functions, in particular:
Sin^-1 or Tan^-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

F test for deletion of subset of variables:
Extra sum of squares?

A

Where β_q,…, β_p-1 are the variables being potentially removed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

F test for deletion of subset of variables:
How to separate variables in vectors

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

F test for deletion of subset of variables:
Find SS_extra in vectors

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

F test for deletion of subset of variables:
Null hypothesis? H_1?

A

Where β_q,…, β_p-1 are variables to be removed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

F test for deletion of subset of variables:
Form F test stat and reject H_0 at α level

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When to use all subsets regression

A

If there is no natural ordering to explanatory variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Given p-1 expl variables, how many possible models are there?

A

2^(p-1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Usual statistics used to compare models?

A

MS_E
R^2
C_p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

MS_E is?

A

Residual mean square

17
Q

For full model E(MS_E) = ? And why?

A

σ^2
Because all candidate explanatory variables are considered

18
Q

How to model test with MS_E

A
19
Q

R^2 is?

A

Coefficient of determination

20
Q

Adjusted R^2?

A
21
Q

Adding terms to a model has what effect on R^2

A

Always increases

22
Q

How to determine # for p in R^2

A

Plot R^2_(p^~) against p^~ to determine where plot levels off

23
Q

C_p

A

Mallows statistic

24
Q

E(SS^(p^~)_E) =

A
25
Q
A
26
Q

Use mallows stat to estimate MSE of prediction

A
27
Q

Testing With malllows stat we should choose either

A
28
Q

C_(p^~) depends on

A

Unknown σ^2

29
Q

Estimator of mallows stat

A

Take MS_E from full model as estimator of σ^2

30
Q

Expectation of estimator of mallows stat

A
31
Q

Adjusted C_(p^~)

A

Taken from estimator and expectation of estimator

32
Q

When calculating original R^2 to compare predR^2 to, how to get original?

A

R^2 = 1 - (SS_R)/(SS_T)
Where SS_R is Sum of Squared Residuals
And SS_T is total sum of squares

Original is also called Multiple R-squared