Question 1

What is the perceptron

Accepted Answer

Consists of a set of weighted connections, the neuron (incorporating the activation function) and output axon. Activation function is the heaviside or threshold function.

Question 2

How does a perceptron learn

Accepted Answer

Initialise weights & threshold
Present input and desired output
Calculate actual output of network
For each input, multiply input data xi by its weight wi
Sum the weighted inputs and pass through activation function
Adapt the weights:
if correct w(t+1) = w(t)
if output 0, should be 1, w(t+1) = w(t) + xi(t)
If output 1, should be 0, w(t+1) = w(t) - xi(t)

Question 3

Modified version of learning

Accepted Answer

Weight update function can use a decimal term a between 0.0 and 1.0 to slow learning. This multiplies the input data so w(t+1) = w(t) + axi(t)

Question 4

What is Widrow-Hoff learning rule

Accepted Answer

Weight updates proportionate to the error made Delta = desired output - actual output w(t+1) = w(t) + a(delta)x(t)

Question 5

Limitations of the perceptron

Accepted Answer

We can only solve linearly separable problems (draw a straight line which separates our two classes) We cannot do this for XOR.

Question 6

How do we address perceptron limitations

Accepted Answer

Adding a further layer to make MLP Input layer Hidden Layer Output layer

Question 7

Two stages of training

Accepted Answer

Feed forward Backpropagation

Question 8

Why use sigmoid for activation function

Accepted Answer

Smoother response Steepness of curve is changed by z Derivative can be easily computed

Question 9

What are weights?

Accepted Answer

Variable strength connections between units Propagate signals from one unit to the next Main learning component - weights are the main component changed during learning.

Question 10

What is feedforward?

Accepted Answer

Initialise weights and thresholds to small random values Present input and desired output Calculate actual output

Question 11

What is backpropagation?

Accepted Answer

Adapting the weights Start from the output layer and work backwards New weight = old weight, plus a learning rate error for pattern p on node j output signal for p on j

Question 12

How do we compute error for different units?

Accepted Answer

For output units Compute error sigmoid derivative (target output - actual output) For hidden units Use the sigmoid derivative weighted error of the k units in the layer above

Question 13

Two types of weight updating

Accepted Answer

Batch updating (faster for training) All patterns are presented, errors are calculated, then the weights are updated Online updating The weights are updated after the presentation of each pattern.

Question 14

What is momentum?

Accepted Answer

Addition to the weight update function

Encourages the network to make large changes to weights if the weight changes are currently large
Allows network to avoid local minima in the early stages as it can overcome hills
weightupdatefunction + a(w(t) - w(t-1))

Question 15

NN properties

Accepted Answer

Able to learn to relate input variables to required output e.g. input car attributes and predict fuel comsumption Is able to generalise between samples Shows graceful degradation

Week 9 Flashcards

(25 cards)