Sequence Models Flashcards
Notes on Sequence Models that may help with the exam.
What is a one-line summary of Sequential Learning?
One data item is dependent on those that come before or after it, not independently and identically distributed
What are some common applications of Sequence Learning?
Speech/Voice Recognition
Weather forecasting
Language translation
DNA Sequence Analysis
What are the four types of application scenarios in regards to Recurrent Neural Networks?
One-to-one
One-to-many
Many-to-one
Many-to-many
What is a one-line summary of One-to-one in the context of Recurrent Neural Networks?
Classical feed forward neural network with one input and one output e.g. image classification
What is a one-line summary of One-to-many in the context of Recurrent Neural Networks?
Input image and output words with variable length e.g. Image Captioning
What are some common applications of Many-to-one in the context of Recurrent Neural Networks?
Sentiment Classification
Share Price Predictions
What are some common applications of Many-to-many in the context of Recurrent Neural Networks?
Language Translation - Input and output are with variable length
Video Clip Classification - Input and output have the same length
What is the equation for a Basic Recurrent Neural Network unit?
Current state = activation function * (Previous state and the input vector at present time step)
What challenge do Recurrent Neural Networks face?
It suffers from the vanishing/exploding gradient problem
What are Long Short Term Memory Networks?
They are altered variations of Recurrent Neural Networks, which are specifically designed to capture long term dependencies.
What are Long Short Term Memory networks better at doing compared to Recurrent Neural Networks?
They are better at back propagating the gradient much more efficiently than the standard Recurrent Neural Network
What does a Long Short Term Memory consist of in regards to its two main components?
Long term memory cell states, which are non-learnable
Short term memory hidden states, which are learnable.
What connects the two main blocks of a Long Short Term Memory Network, and determines whether information passes through to either side?
Multiple sigmoid functions act as gates that switch on and off the information passing areas.
What three types of gates exist within the Long Short Term Memory Network Unit?
Forget Gate
Input Gate
Output Gate
What does the Forget Gate determine in LSTMs?
The forget gate determines how much long term memory is retained (% amount)