lesson_9_flashcards

Question 1

Q

What is image segmentation?

Answer

A

A computer vision task where each pixel of an image is classified into categories, providing a detailed understanding of the scene.

Question 2

Q

What is the difference between semantic and instance segmentation?

Answer

A

Semantic segmentation classifies each pixel without distinguishing instances, while instance segmentation assigns unique IDs to different objects.

Question 3

Q

What is the purpose of encoder-decoder architectures in image segmentation?

Answer

A

Encoders extract abstract features through downsampling, while decoders upsample to restore spatial resolution for pixel-level classification.

Question 4

Q

What are single-stage object detectors?

Answer

A

Models like YOLO and SSD that directly predict bounding boxes and class labels for objects in one pass through the network.

Question 5

Q

What are two-stage object detectors?

Answer

A

Models like Faster R-CNN that first propose regions of interest (ROIs) and then refine them through classification and regression.

Question 6

Q

What is non-maximum suppression (NMS)?

Answer

A

A technique to remove redundant bounding boxes by keeping the box with the highest confidence score in overlapping regions.

Question 7

Q

What is ROI pooling in object detection?

Answer

A

A method to resize regions of interest (ROIs) to a fixed size before feeding them into fully connected layers for classification and regression.

Question 8

Q

What are anchor boxes in object detection?

Answer

A

Predefined bounding boxes of different scales and aspect ratios used to detect objects at various sizes and positions.

Question 9

Q

What is mean average precision (mAP)?

Answer

A

A performance metric for object detection that averages precision across all recall levels and categories.

Question 10

Q

What are transposed convolutions?

Answer

A

Also known as deconvolutions, they upsample feature maps by reversing the process of convolution to increase spatial resolution.

Question 11

Q

How do single-stage and two-stage detectors compare?

Answer

A

Single-stage detectors are faster but less accurate, while two-stage detectors are slower but more precise.

Question 12

Q

What is Mask R-CNN?

Answer

A

An extension of Faster R-CNN that adds a branch for pixel-level segmentation, enabling instance segmentation.

Question 13

Q

What is the role of skip connections in UNet architectures?

Answer

A

They transfer high-resolution feature maps from the encoder to the decoder, preserving spatial information for better segmentation.

Question 14

Q

What is intersection over union (IoU)?

Answer

A

A metric used in object detection to measure the overlap between predicted and ground-truth bounding boxes, with values ranging from 0 to 1.

Question 15

Q

What is the role of feature pyramids in object detection?

Answer

A

They improve multi-scale detection by combining features from different layers, enabling detection of small and large objects.

lesson_9_flashcards

(15 cards)