L7 - Gradient Descent, Convex Optimisation, Bracketing Flashcards

1
Q

What is the purpose of Gradient Descent? What type of function is it?

A

An iterative optimisation function to determine the minimum point of a convex set. It is a Convex Function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How does Gradient Descent work to solve Convex Optimisation?

A

Algorithm iteratively tunes the parameters in order to iterate towards the minimum point. As algorithm nears the minimum point, the jumps towards minimum decrease so as not to overshoot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

In the context of ML, what does the gradient represent?

A

The relationship between independent and dependent variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is tuned in order to establish the minimum of a convenient function?

A

Parameter of the GD algorithm.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

In ML, what is GD used to establish?

A

Minimum of the models cost function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Upon each iteration of GD, what happens to the size of the steps towards the min. value? Why is this needed?

A

Size of step decreases to safeguard against overshooting the minimum point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Convex Optimisation problem?

A

A problem in which we want to find the minimum or maximum of a convex function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What determines if a function is convex?

A

A function is convex if for any 2 points in the domain, a line drawn between the points is always above the function graph.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How many minimum points does a convex function have?

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do we use Jensens Inequality or the Second Derivative Test to determine whether a function is convex?

A
  1. Jensens Inequality - If for any 2 X and Y points in the domain, and for all 0 < t < 1, the line connecting the points is entirely above the function graph.
    1. Second Derivative Test - If the second derivative of the function is non-zero for all X’s in the domain, the function is known to be convex.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the epigraph of a function? How can we used it to determine whether a function is convex?

A

The set of points in the domain the are located above the function graph.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Give the definition of a Convex Set…

A

A set in which for any point in the set, the point if above the function graph.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

At the minimum of a convex function, what is the value of the derivative?

A

0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What techniques do we use to find the minimum of a non-convex function?

A

Downhill Simplex or Bracketing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Explain Bracketing…

A

Method used to find the minimum of a non-convex set in 1-d space. Start by establishing initial points either side of the minimal area (we must know the rough area of minimum). Iteratively decrease the bracket size, converging towards the minimum point. Repeat until convergence or timeout.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define Jensens Inequality and how it proves a function is convex…

A

States that a function is convex for is any 2 points on the secant line, the points lie above the graph of the function.

17
Q

What is the formula of Jensens Inequality?

A

For any X1 and X2 in the function domain …

tf(X1) + (1-t)f(X2) > f(tX1 + (1-t)X2), for all 0 < t < 1

18
Q

A function is convex iff…

A

It’s epigraph is a convex set

19
Q

What bracketing, there must be zero-crossing, what does this mean?

A

The bracketing points must be either side of 0.

20
Q

When bracketing, what is Bisection?

A

Iteratively narrow the bracket (a,b) to find new points an and b until convergence of minimum point is reached.

21
Q

What value should we shrink the bracket by?

A

0.382 (Golden ratio)

22
Q

What is Brent’s Method?

A

A method of parabolic interpolation in order to approximate the minimum value of a convex function.