Lecture 15 Flashcards

1
Q

What can we tell about our gradient descent if the slope of the tangent line is positive?

A

the minimum must be to the left of the point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How to we get to the minimum when the slope of the tangent line at f(t) is positive?

A

decrease t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What can we tell about our gradient descent if the slope of the tangent line is negative?

A

the minimum must be to the right of the point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to we get to the minimum when the slope of the tangent line at f(t) is negative?

A

increase t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do we begin to minimize f(t)?

A

start with an initial guess t_0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the equation to update t and reach the minimum?

A

t_1 = t_0 - df/dt (t_o) where the derivative will go in the opposite direction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is alpha?

A

the learning rate or step size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is t_0?

A

the initial guess

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the equation for using the update rule?

A

t_i+1 = t_i - alpha df/dt (t_i)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is convergence?

A

when t doesn’t change much

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the formal definition of gradient descent?

A

a numerical method for finding the input to a function f that minimizes the function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is it called gradient descent?

A

the gradient is the extension of the derivative to functions of multiple variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a numerical method?

A

a technique for approximating the solution to a mathematical problem, usually using a computer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is gradient descent widely used in?

A

machine learning, to train models from linear regression to neural networks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a convex function?

A

one where there is only one minimum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a non-convex function?

A

a function that has multiple minima and maxima