RNN Flashcards

Question 1

Q

When are RNN used?

Answer

A

For sequential data like text, speech, video, time series data.

RNNs have the features of memory i.e., they can remember the past inputs. That is why they work great on the sequential data.

Question 2

Q

Why RNN over ANN?

Answer

A

Length of input may vary for sequential data, whereas in case of ANN, length of input is fixed.
ANN will be computationally expensive as for sequential data like text, the number of words may be too large.
For sequential data, the size of test input may be higher than the training input.
ANN will ignore the semantic meaning of the text data.

Question 3

Q

Zero Padding in RNN

Answer

A

Pad the sequences to the maximum length with zeros

Question 4

Q

Type of RNNs

Answer

A

Many to one - Sentiment Analysis
One to many - Image to Captioning
Many to many - Synchronous - NER, POS
Asynchronous - Machine translation, Text summarization, Chatbot, Q/A, Speech to text

Question 5

Q

How does input in RNN looks like?

Answer

A

Input = (Timesteps, No. of features)

Question 6

Q

Why RNN is called so (Recurrent)?

Answer

A

This is because we are using the same hidden network multiple time (re-occurring)

Question 7

Q

Problem with RNNs

Answer

A

Long term dependencies (Vanishing gradient problem)
As the time steps increases, the partial derivative term in weight updation consist of many terms (in range [-1,1]- due to tanh), and that is why the overall value becomes very less.
Unstable training (Exploding gradient problem) - Similar case as above, except Relu is used as activation.
Can be prevented by gradient clipping, learning rate