4 Backpropagation Flashcards

Question 1

Q

What is SGD?

Answer

A

Stochastic gradient descent

Question 2

Q

What is backpropagation?

Answer

A

A method for computing gradients in neural nets

Question 3

Q

What is (mini) batch gradient descent?

Answer

A

A simple strategy for optimization

Question 4

Q

What is the expected loss

Answer

A

E_x,y~phat_data L(yhat,y)

Expectation meansaverage over training examples

Question 5

Q

How do we use ML to optimize for w

Answer

A

If we know the distribution of our model:
Log likelihood

W_ML = argmax FnzyL(w) = argmax sum(m,i=1) log p(yi|xi)

Question 6

Q

What is p(x|y) in ML

Answer

A

Least squares it is Gaussian
For classification is is multinomial

Question 7

Q

What is cross-entropy

Answer

A

Statistical divergence between the outputs of the model and the examples in the training set.

Maximizing LL is equilivent to minimizing the CE

sum(m,i=1) log p(yi|xi)

Question 8

Q

What is cross-entropy also called?

Answer

A

Loss of the cost function

Question 9

Q

How do we calculate loss in prqactice?

Answer

A

Take average (1/m) over the presented examples

We specify f that tries to predict p(y|x) output yhat = f(x)
MSE loss: L(yhat,y) = sum(m,i=1) ||yi -yihat||^2
Log loss: L(y,yhat) = sum(m,i=1) yi log yihat

Question 10

Q

How do we do backpropagation?

Answer

A

Backpropergate the gradiants starting fromthe loss.

Question 11

Q

Draw a computational 2 layer grapgh for a NN

Answer

A

h = f1(a1) = f1(w1 x+ b1)
yhat = f2(a2) =f2(w2h + b2)

draw your grapgh on paper

Question 12

Q

Gradiant descent

Question 13

Q

Batch gradiant descent

4 Backpropagation Flashcards

(13 cards)