Week 8 High-Level Vision Continued Flashcards

1
Q

What is the role of attention mechanisms in vision tasks?

A

Attention mechanisms improve performance on high-level vision tasks by focusing on important features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does scene graph generation identify?

A

Scene graph generation identifies objects, attributes, and their relationships in images.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are graph neural networks (GNNs) used for?

A

GNNs are commonly used for scene graph tasks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is human pose estimation?

A

Human pose estimation predicts key body points for activity recognition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are GANs used for in vision?

A

GANs produce realistic images from noise for applications like style transfer and image inpainting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is video captioning?

A

Video captioning generates temporal descriptions of dynamic content.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is emotion recognition?

A

Emotion recognition identifies facial expressions and emotions from images or videos.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How does video scene understanding extend high-level vision?

A

Video scene understanding expands high-level vision to dynamic content.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are multimodal approaches in high-level vision?

A

Multimodal approaches combine images, text, and audio for richer understanding.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the importance of self-supervised learning in high-level vision?

A

Self-supervised learning methods improve segmentation and representation tasks by leveraging unlabeled data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly