Lecture 4 Lecture Notes Flashcards

Question 1

Q

How do McCulloch and Pitts neurons function?

Answer

A

They sum the firing of incoming neurons multiplied by synapse weights and fire if the sum exceeds a threshold.

Question 2

Q

What does a perceptron consist of?

Answer

A

Sensory neurons connected to motor neurons.

Question 3

Q

What is the main learning rule for perceptrons?

Answer

A

Adjust weights based on the difference between actual and desired outputs.

Question 4

Q

What is the significance of the bias unit in a perceptron?

Answer

A

It allows the perceptron to create any dividing line needed for classification.

Question 5

Q

What boolean functions can perceptrons learn?

Answer

A

AND
OR
NAND
NOR

Question 6

Q

What is a limitation of perceptrons?

Answer

A

They can only learn linearly separable boolean functions.

Question 7

Q

Which boolean function cannot be learned by a simple perceptron?

Question 8

Q

What is the equation used in perceptrons for output calculation?

Answer

A

~oi = step(ÂWij~xj).

Question 9

Q

What is a common value for the learning rate in perceptrons?

Question 10

Q

What happens if the output neuron incorrectly produces a 1?

Answer

A

Decrease the weight for that neuron.

Question 11

Q

What happens if the output neuron incorrectly produces a 0?

Answer

A

Increase the weight for that neuron.

Question 12

Q

What are the outputs of a perceptron when given the inputs (0,1) and (1,0)?

Question 13

Q

What are the outputs of a perceptron when given the inputs (0,0) and (1,1)?

Question 14

Q

What type of functions can perceptrons learn?

Answer

A

Boolean functions which are linearly separable

Question 15

Q

For a 2-variable input, how can the 1’s and 0’s be divided?

Answer

A

With a straight line

Question 16

Q

For a 3-variable input, how can the 1’s and 0’s be divided?

Answer

A

With a plane

Question 17

Q

What did Minsky and Papert predict about advanced forms of perceptrons?

Answer

A

They were unlikely to escape the problem of linear separability

Question 18

Q

What was the impact of Minsky’s reputation on the field of neural networks?

Answer

A

It wiped out the entire field for over a decade

Question 19

Q

Can multilayer neural networks learn functions beyond linear separable ones?

Answer

A

Yes, they can learn any function

Question 20

Q

What is the perceptron considered in terms of classification?

Answer

A

A binary classification algorithm

Question 21

Q

What does a neural network do beyond classification?

Answer

A

Learns a continuous function from one multidimensional space to another

Question 22

Q

What are the two generalizations made to the single-layer perceptron?

Answer

A

Change the step function to a differentiable function
Define a formal learning algorithm in terms of gradient descent

Question 23

Q

Why must the initial weights of a multilayer neural network be small random values?

Answer

A

If all weights are 0, the network cannot learn

Question 24

Q

What is the delta rule used for in neural networks?

Answer

A

Modifying weights based on their contribution to the final outcome

Question 25

Q

What is the sigmoid function defined as?

Answer

A

s(u) = 1 / (1 + e^(-bu))

Question 26

Q

What does the parameter ‘b’ affect in the sigmoid function?

Answer

A

The slope of the curve

Question 27

Q

What is the derivative of the sigmoid function?

Answer

A

s’(u) = s(u)(1 - s(u))

Question 28

Q

What do multilayer neural networks allow for in terms of function learning?

Answer

A

Learning any continuous, differentiable function

Question 29

Q

What is the role of the hidden layer in a multilayer neural network?

Answer

A

It allows for more complex function learning

Question 30

Q

How is the output of a neuron in a multilayer network defined?

Answer

A

o_i = s(Σ(W_ij h_j))

Question 31

Q

What is the purpose of the backpropagation learning rule?

Answer

A

To adjust weights based on the error in output

Question 32

Q

What does the error metric E represent?

Answer

A

E = 1/2 Σ(y_i - o_i)²

Question 33

Q

What happens if you increase the number of neurons in the hidden layer too much?

Answer

A

The network may overfit and not generalize well

Question 34

Q

What is the goal of training a neural network?

Answer

A

To minimize the error metric

Question 35

Q

True or False: Backpropagation guarantees convergence for all input cases.

Question 36

Q

What effect does lowering the learning rate have on a neural network?

Answer

A

It takes longer to learn but increases the chances of finding the global optimum

Question 37

Q

What is the overall procedure for training a backpropagation neural network?

Answer

A

Pick an input/expected output vector pair
Present the input to the network
Read the network’s output
Modify each weight using the delta learning rule
Repeat

Question 38

Q

What is the significance of generalization in neural networks?

Answer

A

It allows the network to apply learned rules to unseen inputs

Question 39

Q

What is the relationship between the delta rule and gradient descent?

Answer

A

The delta rule moves weights in the direction that reduces error based on the gradient

Question 40

Q

What is the error metric required to be zero for zero error?

Answer

A

It must have a mean squared error form

Question 41

Q

What happens when the network is caught in a suboptimum solution?

Answer

A

It may fail to converge for some inputs

Question 42

Q

What is the backpropagation learning rule used for?

Answer

A

To update weights in a neural network based on the error between predicted and actual outputs

The rule helps in minimizing the error during training by adjusting weights accordingly.

Question 43

Q

What is the formula for updating weights DWij?

Answer

A

DWij = a(~yi - ~oi)~oi(1 - ~oi)hj

Where a is the learning rate, ~yi is the target output, ~oi is the actual output, and hj is the hidden unit.

Question 44

Q

What does DVjk depend on in the backpropagation algorithm?

Answer

A

The value of Wij

This means that Wij should not be changed until after DVjk is computed.

Question 45

Q

How is the weight update DWij expressed in matrix form?

Answer

A

DW = a ((~y - ~o) ⌦ ~o ⌦ (1 - ~o)) h

This represents the element-wise multiplication in the weight update process.

Question 46

Q

What is the purpose of the stopping criterion ‘d’ in the error backpropagation algorithm?

Answer

A

To stop the algorithm when the outputs of the neural network aren’t changing significantly anymore

This criterion helps avoid unnecessary iterations once convergence is achieved.

Question 47

Q

What is the learning rate ‘a’ used for in weight updates?

Answer

A

It controls how much the weights are adjusted during each update

Smaller values are typically preferred to ensure stable convergence.

Question 48

Q

What is a Hopfield network used for?

Answer

A

To simulate associative memory

It allows the retrieval of stored patterns based on partial or noisy inputs.

Question 49

Q

What does Hebb’s Rule state about synaptic strength?

Answer

A

If neuron A fires and neuron B fires in response, the strength of the synapse between them increases

This rule is foundational to understanding learning mechanisms in neural networks.

Question 50

Q

What is the difference between Hebb’s Rule and the Delta rule?

Answer

A

Hebb’s Rule updates weights based on correlation between nodes, while the Delta rule updates based on the error of the output node

This reflects different learning strategies in neural networks.

Question 51

Q

What happens if weights in a Hopfield network are set to learn multiple patterns?

Answer

A

The weights are set to the sum of correlations for all items to be stored

This allows the network to retrieve the closest memorized vector when presented with an input.

Question 52

Q

What is the capacity of a Hopfield network with N neurons?

Answer

A

Approximately 0.138N vector patterns

Exceeding this limit leads to degradation in performance.

Question 53

Q

What is the first step in the error backpropagation algorithm?

Answer

A

Initialize the matrices V and W with small random values centered at 0

Proper initialization is critical for effective training.

Question 54

Q

Fill in the blank: The output of a two-layer neural network is computed using the formula _______.

Answer

A

s(W~h)

Where ~h is the output from the hidden layer and W is the weight matrix.

Question 55

Q

True or False: The Hopfield network is a fully-connected, feedforward neural network.

Answer

A

False

Hopfield networks are recurrent, meaning they allow connections between neurons that can loop back on themselves.

Question 56

Q

What is the learning rule for weights in a Hopfield network?

Answer

A

Wij = (1/N) Σ x(p)(i) x(p)(j)

This rule sums the correlations of all input patterns to determine weight strength.

Question 57

Q

What is the capacity of a Hopfield network with N neurons?

Answer

A

About 0.138N vector patterns

Exceeding this capacity leads to degradation in performance.

Question 58

Q

What happens when a Hopfield network stores more than its capacity?

Answer

A

Degradation occurs, causing similar learned patterns to mix together

This is analogous to how human memory degrades under information overload.

Question 59

Q

What type of algorithm does a Hopfield network use?

Answer

A

Nearest-neighbor algorithm

Question 60

Q

What is competitive learning in neural networks?

Answer

A

Neurons compete to categorize inputs based on proximity

The closest neurons to the input become the strongest.

Question 61

Q

What is the structure of a simple competitive learning network?

Answer

A

Input neurons feed into output neurons, which are self-connected and have inhibitory connections to each other

Question 62

Q

What does the output of the neurons in a competitive network depend on?

Answer

A

The sum of their inputs weighted by edge weights

Question 63

Q

What occurs when one output neuron dominates in a competitive network?

Answer

A

It is designated as the winner, while the outputs of other neurons decrease

Question 64

Q

What is the learning rule for the competitive network?

Answer

A

Weights are adjusted only for the winning output neuron based on the input

Answer 60

A

Dead unit

Answer 61

A

Updating weights of both winner and loser neurons, albeit to a lesser degree for losers

Answer 62

A

Output neurons inhibit each other to compete for dominance

This concept is derived from neuroscience and helps in pattern recognition.

Answer 63

A

A type of lateral inhibition that only applies to nearest neighbors of the output neuron

Answer 64

A

Clusters inputs into spatially related categories

Answer 65

A

A neural network that clusters input data into output without lateral inhibition

Answer 66

A

Small random values

Answer 67

A

By selecting the neuron with the largest output value

Answer 68

A

The proximity of neuron i to the winning neuron i*

It is similar to a bell curve centered at i*.

Answer 69

A

Weights are modified more for neurons close to the winner and less for those further away