3 - Backpropagation in computation graphs Flashcards

Question 1

Q

Computational graph

Answer

A

eg if f(x,y,z) = (x+y)*z

x = -2
+ (q =3)
y = 5
- (t=-12)
z = -4

(Imagine lines matching the correct operation)
Backpropagation goes right to left in these

Question 2

Q

partial derivative

Answer

A

How much the output changes when one(?) input changes

Question 3

Q

Chain rule

Answer

A

F(x) = f(g(x)) then F’(x) = f’(g(x))g’(x)

’ means derivative

Question 4

Q

Derivatives: if q = x +y then…

Answer

A

derivative of q w/r to x = 1
derivative of q w/r to y = 1

Question 5

Q

Derivatives: if f = qz then

Answer

A

derivative of f w/r q = z
derivative of f w/r z = q

Question 6

Q

General concept for why chain rule is useful in computational graphs

Answer

A

To determine the “effect” of one input on the output, follow the chain from the output to the input

Ie, in the example, to find deriv f w/r x, we need deriv f w/r q multiplied by deriv q w/r x

Question 7

Q

Is a computational graph the same as a neural network?

Answer

A

NO!
Computational graph is much bigger and shows operations

Question 8

Q

Sigmoid derivative

Answer

A

(1-sig(x))(sig(s))

That’s
(1-σ(x))(σ(s))

Question 9

Q

Patterns in backward flow: ADD gate

Answer

A

gradient distributor

Question 10

Q

Patterns in backward flow: MAX gate

Answer

A

Gradient Router

Question 11

Q

Patterns in backward flow: MUL gate

Answer

A

gradient switcher

EG
x*y where x=3 and y=-4 and the gradient is 2

would mean that x has -8 (-42) and y has 6 (32)

Question 12

Q

Jacobian Matrix

Answer

A

The derivative of each element of z (output) w/r to each element of x

Question 13

Q

Steps of training a simple net

Answer

A

Forward pass
Compute loss
Propagate loss backwards
Step the optimiser (ie update parameters)
reset all the gradients to 0

3 - Backpropagation in computation graphs Flashcards

(13 cards)