Lecture 9 - Univariate Optimisation Flashcards
What is optimisation?
Finding the maximum or minimum of a function
What is the objective function for least squares?
The sum of the residuals squared
How to find maximum (log)likelihood?
Propose a model f(x, θ) for the data.
Form a likelihood - L(θ) = the product of f(xi, θ) (OBJECTIVE FUNCTION)
Find the parameter values which maximise the likelihood
What is a systematic line search?
Start by bracketing the maximum (find points [a, b] which are known to contain the maximum)
Divide this interval into a regular set of values x, spaced e apart.
Evaluate function at each x
Choose x that has max g(x)
What are pros and cons of iterative methods?
Pro:
Can be considerably more efficient
Con:
Can be less reliable
What is the general iterative algorithm?
- Start at iteration t=0 begin with an intial guess x(0)
- Start next iteration, t=t+1
- At iteration t, produce an improved guess, x(t) using an updating equation, x(t) = f(x(t-1))
- Evaluate whether new guess is sufficiently accurate using stopping rule
- If yes, stop and return x* = x(t) and g(x*) = g(x(t))
- If no consider whether to report no convergence, otherwise go to back to step 2.
What is the algorithm for the bisection method?
- Start an iteration t=0 with (a(0), b(0))
- Set starting value x(0) = (a(0) + b(0))/2
- Start next iteration t=t+1
- Update (a(t), b(t), x(t)) according to:
- If stopping rule met stop and return x* = x(t)
else go to 3.
What is the bisection method?
Simple example of an iterative method for finding the root g’(x) =0
What is implied if g’(x) is continuous on some interval [a,b] and g’(a)g’(b) <= 0
There is at least one x* in [a,b] for which g’(x) = 0 making g(x) a local optimum.
What is convergence order?
Index of how fast convergence happens
What stopping criteria are not reliable?
Criteria based on how close g’(x) is to 0
What are the two main types of convergence criteria?
Absolute and relative
What is absolute convergence criteria?
Stop when |x(t) - x(t-1)| < e, for some desired maximum imprecision e.
What is the problem with absolute convergence criteria?
There is always the same absolute imprecision regardless of the size of numbers.
What is convergence criterion?
( |x(t) - x(t-1)| / |x(t-1)| ) < e