Adaptive basis function models Flashcards
What is a basis function and give an example of when we can use domain knowledge to employ one to a model?
A map from the input space to the reals. E.g. gravitation. If we were trying to learn Force from a dataset of masses and radii, we could construct a basis function m1m1/r which would be a better feature.
Explain the aim of an adaptive basis function model
We want to learn basis functions from the data. Our prediction functions will be linear combinations of our basis functions.
In an adaptive basis function model, what is the empirical risk we wish to minimise?
The sum of losses for a function in the hypothesis space.
Can we use gradient descent to minimise this risk in general?
In general, no. Only if each basis function is parameterised by weights and the objective function is differentiable with respect to those weights.
Describe the algorithm of forward stagewise additive modelling.
Start with some set of weak/base learners
- Initialise prediction function
- Update with the linear combination of basis functions which minimise the loss
continue
What are some possible weak learners?
Shallow trees
Explain gradient boosting
The update step can be learned by noting that the objective function only depends on the prediction at the training points. We can take the gradient of J with respect to f_i for each point, and this would give the set of points that our update sum of basis functions should be closest to. Then we just minimise the difference over our hypothesis space and use this as the update step.