Splines Flashcards

Question 1

Q

Spline function definition

Answer

A

[a,b] -> R with m knots (m-2 internal) of degree l if:
- (l-1) order continuously differentiable;
- Within each m-1 interval, different polynomial of degree l.

Question 2

Q

Number of free parameters in splines

Answer

A

d = l + m - 1

Question 3

Q

Truncated power basis

Answer

A

Bj(z) =
- z^j j=1,…l+1
- (z-k_j-1)^l+ j=l+2,…d

Question 4

Q

B-splines definition

Answer

A

Defined recursively
- Values > 0 only between a pair of knots
- Σ Bj(z) = 1 ∀ z
- ∀ z &in; [a,b] only (l+1) bases are > 0.

Question 5

Q

MLE for B-splines

Answer

A

β^ = (Z’Z)^-1Z’y

Question 6

Q

Model selection for splines

Answer

A

LRT only if using truncated power basis for add/removal of knots.
All other cases (B-splines, movement of knots), model selection criteria.

Question 7

Q

P-splines definition

Answer

A

We start with a high number of m-2 internal knots;
We remove some using the penalized log-likelihood pl(β,σ²) = l(β,σ²) - λ/2 J_i(β) = -n/2 ln(σ²) - n/(2σ²) (y-Zβ)’(y-Zβ) - λ/2 β‘K_iβ.
If λ=0 we have B-splines with no penalization.
If λ -> + &infty; we tend to have only constant functions (very smooth ones).

Question 8

Q

Order differences

Answer

A

For B-splines, we can use:
J₁(β) = Σ_j=2^d (β_j - β_j-1)² = βK₁β’
J₂(β) = Σ_j=3^d (β_j - 2β_j-1 + β_j-2)² = βK₂β’

K_{i, dXd} matrix made of differential matrices D^(d)_{1, (d-1)Xd} and D^(d-1)_{1, (d-2)Xd}

Question 9

Q

MLE for P-splines

Answer

A

β^ = (Z’Z - ΛK_i)^-1Z’y
- Λ = λσ²
Equivalent to minimizing the penalized least squares: PLS = (y-Zβ)’(y-Zβ) - Λβ‘K_iβ

Question 10

Q

Fitted values P-splines

Answer

A

f^ = y - Zβ^ = y - Z(Z’Z-ΛK)^-1Z’y = [I - Z(Z’Z-ΛK)^-1Z’]y = [I - S]y
S: Smoother matrix

Question 11

Q

Effective number of parameters P-splines

Answer

A

df(S) = trace(S)
{ edf in output }

Question 12

Q

Estimator for σ² P-splines

Answer

A

[Σ(y_i - f^(z_i))²]/[n - df(S)]
If ML (for AIC and BIC): 1/n * Σ(y_i - f^(z_i))²]

Question 13

Q

Choice of best Λ

Answer

A

CV = 1/n Σ [(y_i - f^(z_i))/(1 - S_ii)]²
GCV = 1/n Σ [(y_i - f^(z_i))/(1 - df(S)/n)]²
AIC or BIC, remembering σ²_ML^ and number of parameters = df(S)+1

Question 14

Q

Inference basis for P-splines

Answer

A

Y|Z ∼ N_n (f, σ²I_n) that leads to:
f^|Z ∼ N_n (Sf, σ²SS)
- P-splines are biased estimators;
- Produced p-values for hypothesis testing are anticonservative: they underestimate the true values.

Question 15

Q

B-splines in R

Answer

A

library(splines)
internal = m-2
location = quantile(z, probs=(1:internal)/(internal+1))
bsp = bs(z, knots=location, degree=l, intercept=F)
#Equidistant knots: bsp = bs(z, df=d, degree=l)
fit = lm(y~bsp) #Or -1 if intercept=T

Question 16

Q

P-splines in R

Answer

Study These Flashcards

A

library(mgcv)
smoothers = s(z, bs=”ps”, k=m, m=c((l-1), i), sp=lambda)
fit = gam(y~smoothers)
#edf = df(S) effective number of parameters
#edf+1 total coefficients

Splines Flashcards

(16 cards)