Perceptrons and FeedForward ANN's Flashcards
What is a perceptron?
An equation that takes a number of inputs, and gives a single output value
What are the inputs to a perceptron?
The value of the input, multiplied by its weight (how valuable the input is).
What does an activation function do?
It decides whether the output of one perceptron is enough to give as input to the next layer of perceptrons
What are some issues with training a perceptron on some data?
There may be overfitting. This is when the model has perfectly separated the classes, meaning that it cannot be generalised (used on other data - it is useless)
How do we estimate weights?
Initialise weights. Can be all 0, or a random number
Iterate through each sample in the training dataset
- Calculate the predicted output
- Update the weights based on this
What is the relationship between input magnitude and ∆ weight?
The larger the input magnitude, the larger effect it has on ∆ weight
What is learning rate?
It is a measure that controls how fast a network learns
It can be constant or adaptive
What is the issue with the learning rate being too big?
The network may take a long time to reach the global minimum or oscillate around it forever
What is the issue with the learning rate being too small?
The network may become trapped in a local minimum
When do we stop training a network?
When the error falls below an acceptable threshold
When the change in instantaneous error falls below an acceptable threshold
When it has completed a predetermined amount of iterations