ANN Flashcards

Question 1

Q

What is the difference between supervised ANNs and unsupervised ones?

Answer

A

Supervised = trained with labelled inputs - weights attempt to fit correct output - inductive

Unsupervised = learning done iteratively to satisfy some learning rule

Question 2

Q

What are the types of ANN topologies?

Answer

A

Feed forward: all weights are directed forwards

Recurrent: Weights can point backwards and provide immediate feedback.

[Hidden] Single / Multiple layered

Partially / Fully connected: describes connectivity between nodes across layers

Question 3

Q

What are the different activation functions? Explain them.

Answer

A

Linear: output = c*input
Threshold: If weighted sum > 0, output = 1; else -1;
Sigmoid: Continuous threshold. Bound output between 1 and -1;

Question 4

Q

When would you use ANNs?

Answer

A

When:

Input data is high dimension + continuous
Data is noisy
Long training times are OK
When you have enough labelled training data
When target function is unknown
When explaining the result is not important

Question 5

Q

What are the 3 learning rules? Describe them.

Answer

A

Hebb’s rule: If 2 connected neurons are simultaneously on => weight( new ) = weight (old) + x1x2

2. Perceptron rule: weight (new) = weight(old) + σ(t-o)x
t = target output (0,1)
o = actual output (0,1)
x = neuron input
σ = learning rate
*** threshold activation

Delta rule: weight (new) = weight(old) + σ(t-o)x
* ** linear/continuous activation
* ** outputs can be anything

Question 6

Q

What are perceptrons?

Answer

A

Linear classifiers - a hyper plane ( vector space of one dimension lower that divides a space classifying data points )

If weighted sum + bias > the boundary, perceptron outputs a 1

Else, 0.

*** outputs can only be 1 or 0/-1

Question 7

Q

What is the point of the bias term?

Answer

A

Speeds up learning by shifting the hyper plane from the origin - do not always need them

Question 8

Q

What are some of the properties of perceptrons?

Answer

A

Can classify inputs as 0 or 1 => can simulate any logic gate

Question 9

Q

How does perceptron learning work?

Answer

A

Using the perceptron learning rule:
For N training examples:
if o /= t:
Update weights according to the learning rule s.t error is minimised
Stop when error is acceptable or i = N
Note: perceptron rules adapts weights only

Question 10

Q

What is the fundamental basis of perceptron learning?

Answer

A

Error correction learning => adjust weights until o = t

Question 11

Q

What is the limitation of single perceptrons? What is the solution?

Answer

A

Can only classify linear separable spaces

Multiple Layer perceptrons = interconnected perceptrons

Question 12

Q

When use the delta learning rule?

Answer

A

If the range of values we want to be able to produce is continuous - continuous activation functions

*** t and o do not have to be 1 or 0

Question 13

Q

How does delta learning work?

Answer

A

The exact same way as perceptron learning

Question 14

Q

DLR vs PLR - differences and similarities

Answer

A

Same: both learn through error correction

Different:

PLR can only work with threshold activation functions (0;1)

DLR can work with any differentiable activation function

Question 15

Q

What is a universal function approximator?

Answer

A

ANN with at least one hidden layer (1) enough nodes (2) continuous activation function (3) can approximate any continuous non-linear function

Question 16

Q

What is the purpose of the delta learning rule?

Answer

Study These Flashcards

A

Classify non linearly separable problems

Question 17

Q

How do you decide the type of perceptron organisation to use as well as the activation function?

Answer

Study These Flashcards

A

Is the problem linearly separable?
No: use MLP
Yes: Is the output Binary?
 No: use single perceptron + continuous 
 activation function
 Yes: use single perceptron + threshold activation 
 function.

Question 18

Q

What is the weight/hypothesis space?

Answer

Study These Flashcards

A

Mapping of weights to error - gradient descent aims to find minimums in this topology

Question 19

Q

How do we calculate error for learning by error minimisation for the ∆ rule?

Answer

Study These Flashcards

A

E(w) = 1/2Σ(t-o)^2 ==> modified MSE

Question 20

Q

What is gradient descent?

Answer

Study These Flashcards

A

Iteratively change weights to reduce MSE

Can implement DLR at a neuron or network level.

Always moves us in the direction of the steepest decrease in slope

Question 21

Q

What is backpropagation?

Answer

Study These Flashcards

A

Process of feeding error back through the network to determine by how much we need to adjust weights.

Question 22

Q

What do we need for back-propagation to work?

Answer

Study These Flashcards

A

differentiable activation function

2. non linear activation function

Question 23

Q

What is the back propagation algorithm?

Answer

Study These Flashcards

A

Forward pass
Calculate ẟi for output neurons
[ẟi = Oi*(1-Oi) * (Ti-Oi)] - sigmoid
Calculate change in weights to output nodes
[∆Whi = η * ẟi * xhi] - where xhi is the output from the previous node
Calculate ẟh for hidden neurons
[ẟh = Oh * (1-Oh) * ΣWhi*ẟi]
Calculate change in weights to hidden nodes
[∆Whi = η * ẟi * xhi] - where xhi is the output from
Update weights

ANN Flashcards

(23 cards)