Lecture 4 - Training CNNS Flashcards

Question 1

Q

Regression

Answer

A

A set of processes for modelling the relationship between an output and input features

Question 2

Q

Classification

Answer

A

Problem of classifying inputs

Question 3

Q

How to measure regression performance

Answer

A

Loss Function like Mean Absolute Error or Mean Squared Error,…

Question 4

Q

Loss Function: Value size when it is well classified vs not

Answer

A

well classified = small value
else big value

Question 5

Q

Negative Log Likelihood Loss

Answer

A

Lp = -Log(p)

Question 6

Q

Softmax

Answer

A

Normalise network output tio a probability distribution over predicted output classes

exp(xi) divided by the sum of all j where exp(xj)

Question 7

Q

Cross entropy loss

Answer

A

-Log(softmax)

Question 8

Q

Optimisation

Answer

A

Process to find best weight to minimise loss function

Question 9

Q

Gradient Descent

Answer

A

Gradient direciton and step size. “Walk” towards minima.

Question 10

Q

Learning Rate

Answer

A

Step size.

Too low = no progress
too high = instability and never converges

Question 11

Q

Stochastic Gradient Descent

Answer

A

each w = w - a * gradient value

a here is alpha (learning rate)

see slides for full equation

Question 12

Q

SGD Weight Decay

Answer

A

Used to prevent the weight being too big

w <- w - a(deriv(w)L(x,w)-yw) + p[last update]
a is alpha (learning rate) and y is gamma. yw is weight decay regularisation

Question 13

Q

SGD Momentum

Answer

A

p[last update]

where p is a number less than 1 (meaning p * update i think?)

Question 14

Q

“Gradient Value” equation. Using weights, inputs and derivative of..?

Answer

A

sum(i)(deriv(w)*L(xi,w))

Lecture 4 - Training CNNS Flashcards

(14 cards)