lesson_9_flashcards

1
Q

What is image segmentation?

A

A computer vision task where each pixel of an image is classified into categories, providing a detailed understanding of the scene.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the difference between semantic and instance segmentation?

A

Semantic segmentation classifies each pixel without distinguishing instances, while instance segmentation assigns unique IDs to different objects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the purpose of encoder-decoder architectures in image segmentation?

A

Encoders extract abstract features through downsampling, while decoders upsample to restore spatial resolution for pixel-level classification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are single-stage object detectors?

A

Models like YOLO and SSD that directly predict bounding boxes and class labels for objects in one pass through the network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are two-stage object detectors?

A

Models like Faster R-CNN that first propose regions of interest (ROIs) and then refine them through classification and regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is non-maximum suppression (NMS)?

A

A technique to remove redundant bounding boxes by keeping the box with the highest confidence score in overlapping regions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is ROI pooling in object detection?

A

A method to resize regions of interest (ROIs) to a fixed size before feeding them into fully connected layers for classification and regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are anchor boxes in object detection?

A

Predefined bounding boxes of different scales and aspect ratios used to detect objects at various sizes and positions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is mean average precision (mAP)?

A

A performance metric for object detection that averages precision across all recall levels and categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are transposed convolutions?

A

Also known as deconvolutions, they upsample feature maps by reversing the process of convolution to increase spatial resolution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do single-stage and two-stage detectors compare?

A

Single-stage detectors are faster but less accurate, while two-stage detectors are slower but more precise.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Mask R-CNN?

A

An extension of Faster R-CNN that adds a branch for pixel-level segmentation, enabling instance segmentation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the role of skip connections in UNet architectures?

A

They transfer high-resolution feature maps from the encoder to the decoder, preserving spatial information for better segmentation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is intersection over union (IoU)?

A

A metric used in object detection to measure the overlap between predicted and ground-truth bounding boxes, with values ranging from 0 to 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the role of feature pyramids in object detection?

A

They improve multi-scale detection by combining features from different layers, enabling detection of small and large objects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly