lesson_11_flashcards

Question 1

Q

What is structured representation in deep learning?

Answer

A

Representing relationships between elements explicitly, such as words, pixels, or nodes, to model compositional structures across domains like language and vision.

Question 2

Q

What is a scene graph?

Answer

A

A graph-based representation where nodes are objects or object parts, and edges represent relationships like spatial arrangements or actions.

Question 3

Q

What are recurrent neural networks (RNNs)?

Answer

A

Neural networks designed for sequential data, maintaining a state vector that represents past inputs while processing sequences of arbitrary length.

Question 4

Q

What is the vanishing gradient problem in RNNs?

Answer

A

Gradients become too small during backpropagation through time, making it difficult to learn long-term dependencies.

Question 5

Q

What is attention in deep learning?

Answer

A

A mechanism to focus on relevant parts of input data dynamically, weighting elements using similarity scores for better feature representation.

Question 6

Q

What is the softmax function’s role in attention?

Answer

A

It converts similarity scores into probabilities, enabling weighted summations for attention mechanisms.

Question 7

Q

What are transformer architectures?

Answer

A

Models that use attention-based mechanisms, including multi-head attention, to process sequences or unordered sets efficiently.

Question 8

Q

What is a non-local neural network?

Answer

A

A network that dynamically learns connectivity patterns between data points using attention mechanisms, generalizing beyond local receptive fields.

Question 9

Q

How are graph neural networks (GNNs) structured?

Answer

A

Nodes represent entities with feature vectors, and edges represent relationships, enabling propagation of information across the graph.

Question 10

Q

What is the role of embeddings in GNNs?

Answer

A

They represent nodes or elements as vectors, incorporating local and neighborhood features through attention mechanisms.

Question 11

Q

What is a sequence-to-sequence (seq2seq) task?

Answer

A

A task where a sequence of inputs is mapped to a sequence of outputs, such as machine translation or speech recognition.

Question 12

Q

What is the benefit of multi-head attention in transformers?

Answer

A

It allows the model to focus on different aspects of the data simultaneously, improving representation learning.

Question 13

Q

What is an example of a many-to-many task in sequential modeling?

Answer

A

Speech recognition, where an input sequence of sound waves is mapped to an output sequence of words.

Question 14

Q

What is the application of scene graphs in computer vision?

Answer

A

Scene graphs can describe spatial relationships in images, aiding tasks like object detection, relationship modeling, and image captioning.

Question 15

Q

How does attention enhance graph representations?

Answer

A

By weighting neighbors dynamically, attention refines node features, enabling context-aware embeddings.