L7: Non-Linear Regression Flashcards

After working your way through this module, you should be able to: - Explain the limits and constraints of nonlinear methods - Implement and assess different nonlinear models, such as splines, additive models, etc.

1
Q

What is the most basic of non-linear models?

A

The polynomial regression model: we adjust the function of x so that it includes higher degrees of the variables:

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the method behind step functions?

Are these non-linear functions?

A

Yes this is a non-linear function.

The step function will split the data into certain cut-points along the predictor axis (usually with cut-points that are based off of prior knowledge)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are Piecewise Polynomials?

A

This is a spline model type.

It involves different polynomials in regions defined by the ‘knots’. In such a way we have different polynomial functions for X in different parts of its domain.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Name the spline type

Continuous 1st and 2nd derivatives

A

Cubic Spline

A piecewise cubic polynomial with continuous derivatives up to order 2 at each knot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Name the spline type:

Not continuous 1st and 2nd derivatives

Polynomial shape

A

Piecewise Cubic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Name the spline type:

Not continuous 1st and 2nd derivatives

Polynomial shape with sudden change of slop at the knot

A

Continuous Piecewise Cubic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What happens at the boundaries of natural cubic splines?

A

The models become highly variable.

  1. There is less data at the boundaries
  2. Less certainty here

We can add constraints at the ends, e.g. to constrain the outer knots 2nd derivatives to be zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A cubic splint with K knots has how many degrees of freedom? How many parameters?

A

It will have K + 4 parameters / degrees of freedom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

A natural spline with K knots has how many degrees of freedom?

A

K degrees of freedom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Explain the following function. What is it? What is the LHS, RHS?

A

This is the smooth spline function.

LHS: This is the Residual Sum of Squares, we minimise this to reduce the error of the model.

RHS: this is the smoothness penalty. This is a penalty that penalises the model if it is too convoluted. (Essentially, the area of the 2nd derivative graph)

Hence, if the area under the graph is large (high amount of change) then the model will be penalised and coerced into being smooth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Local Regression?

A

Local regression is where localities of data points are given separate functions depending on the position of the neighbourhood.

  1. Gather the fraction of training points closest to xo
  2. Assign weight to each point, within certain distance, far = 0 , close = high
  3. Fit weighted least squares regression of the y based on the neighbourhood by finding B that minimises the local error
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the Generalised Additive Model?

A

The GAM is a general framework for extending a standard linear model by allowing non-linear functions of each of the variables

We go from y = B0 + B1x1 + B2x2 …

to

y = B0 + f1(x1) + f2(x2) …

Such that the f(x) can be a non-linear function of the variable x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly