lecture 2 Flashcards

Question 1

Q

Neural networks, connectionism, parallel distributed processing

Answer

A

Based on abstract view of the neuron

The connections determine the function of the network

Connections can be formed by learning and do not need to be programmed

Question 2

Q

Logic gates

Answer

A

Computers have electronic elements that implement ‘logic gates; and with these you can build and run programs

Question 3

Q

McCullock-Pitts neuron - assumptions

Answer

A

The activity of the neuron is an ‘all or none’ process
a. Activation is either 0 or 1
A certain fixed number of synapses must be excited within the period of latent addition in order to excite a neuron at any time
The only significant delay within the NS is a synaptic delay
The activity of any inhibitory synapse absolutely prevents excitation of the neuron at any time.
a. This is currently not the case anymore. Now its weighted
The structure of the net does not change with time

Question 4

Q

McCullock-Pitts neuron - 1 major thing the didn’t take into account

Answer

A

Real neurons are very noisy

 So computation in the brain is fault tolerant, which doesn’t work in a computational model (that would give an error)

 i.e., the brain does not work like a turing machine

 so neural networks abstract strongly from the details of real neurons

Question 5

Q

neural networks abstract strongly from the details of real neurons

in what way do they differ

Answer

A

Conductivity delays are neglected
An output signal is discrete or a real-valued number
Net input is calculated as the weighed sum of the input sigals
Net input is transformed into an output signal via a simple function

Question 6

Q

Error-correcting learning

Answer

A

form of supervised learning

Question 7

Q

Perceptron

Answer

A

Original perceptron had only 2 layers (input and output layer)

Question 8

Q

Limitations of the perceptron

Answer

A

Only binary values
> remedied by the delta-rule
Only 2 layers

Question 9

Q

Perceptron convergence theorem

Answer

A

If a pattern set can be represented by a two-layer perceptron, the perceptron learning rule will always be able to find some correct weights

o So if it can, it will.
o Does not say anything about how fast. Could be a slow process. But it will find it in the end.

Question 10

Q

needed for error backpropagation

Answer

A

Algorithm to train perceptrons with more than 2 layers
Preferably also one that used continuous and nonlinear activation rules

Question 11

Q

Characteristics of backpropagation

Answer

A

Any number of layers
only feedforward, no cycles
uses continuous nodes
> activation between 0 and 1
intial weights are random
total error never increases
> gradient descent in error space
> so it goes down a little bit or stays the same

Question 12

Q

backprop trick

Answer

A

 We have a node h in the hidden layer

 We go to the error signal on the output layer that is calculated for each node
o Error = the difference between the target and the spontaneous output

 We take all those errors in the output layer and add them up. This is the error we have for our hidden layer
o Can be positive and negative

 Not biologically plausible because axons only work in 1 direction.

Question 13

Q

Backpropagation algorithm in rules

Answer

A

weight change = small constant x error x input activation
for an output node, the error is
o error = (targed activation – output activation) x output activation x (1-output activation)
o you add this to do gradient descent
for a hidden node, the error is
a. error = weighted sum of to-node errors x hidden activation x (1 – hidden activation)
weight change and momentum.
a. weight change = small constant x error x input activation + momentum constant x old weight change

Question 14

Q

Disadvantages backprop

Answer

A

learning is slow
new learning will rapidly overwrite old representations unless they are interleaved with the new patterns
this makes it hard to keep networks up to date with new information
this also makes in very implausible as a psychological model of human memory

Question 15

Q

advantages backprop

Answer

A

easy to use
a. few parameters
b. algorithm is easy to implement
can be applied to a wide range of data
very popular
paved the way for deep learning

Question 16

Q

gradient descent in error space (= find the steepest slope down), but

Answer

Study These Flashcards

A

i. does not guarantee high performance

ii. does not prevent getting stuck in local minima (unlike perceptron that will find the solution if there is one)

iii. the learning rule is complicated and tends to slow down

lecture 2 Flashcards

(16 cards)