Neural nets 2 Flashcards

Question 1

Q

What problems can SLPs not solve?

Answer

A

XOR problems (non linearly separable problems)

Question 2

Q

What is the difference between SLPs and multi-layered perceptrons?

Answer

A

MLPs have hidden layers between the input and output nodes.

Question 3

Q

What is the benefit of a hidden layer?

Answer

A

A hidden layer of neurons
can transform a non-linear
problem into a number of
simpler linearly separable
problems

Question 4

Q

Name a problem with training MLPs

Answer

A

Credit assignment problem. Which weights cause the problem? Hidden->output layer weights or input->hidden layer weights?

Question 5

Q

How do you train MLPs?

Answer

A

target (t) – output (y) error/cost term

Question 6

Q

What is backpropagation used for?

Answer

A

Backpropagation is used to solve the credit assignment problem. We want to back-chain
the error from the output layer to the input layer to update the weights of the network layer by layer

Question 7

Q

What are the 8 backpropagation algorithm steps?

Answer

A

Present the data vector at the
input layer,
Pass weighted input layer
activations to hidden layer
Pass weighted hidden layer
activations to output layer
Apply the target value to the
output layer
Calculate the δ 𝑜𝑜 output error
values
Update the hidden-output layer
weights using using the δ 𝑜𝑜 values
For each hidden node, calculate its
δℎ error value using δ 𝑜𝑜.
Update the input-hidden layer
weights using the δℎ values.
Step 1-3 Forward Pass
Step 4-6Backward Pass

Question 8

Q

But how do we find the delta (error) values for each layer?

Answer

A

*Use gradient descent
principle
*Use the chain rule (from
calculus)

Question 9

Q

What is a problem with the gradient descent principle?

Answer

A

We can get stuck in a local minima. We would like to find the global minimum but with gradient descent we can’t be sure we have.

Question 10

Q

How do we solve the credit assignment problem?

Answer

A

1) Backpropagating (or backchaining) network error (and proxy error)
terms through the layers of weights in the network using calculus;
2) All nodes’ activation from the forward pass is maintained in order to appropriately assign error to the corresponding weights (if the
weights aren’t connected to active nodes they can’t have caused the error)

Neural nets 2 Flashcards

(10 cards)