Object detection Flashcards

Question 1

Q

Whats the problem with moving a sliding window over an entire image and use a classification network for object detection?

Answer

A

To computational expensiv to use a classifier on each pixel (and scale…)

Question 2

Q

What is selective search?

Answer

A

We over-segment an image , and than combine regions with similar features to create region proposals

Question 3

Q

What is the R_CNN method?

Answer

A

Uses selective search for region proposal, SVM for classification and linear regression for localization. Both use CNN features.

Question 4

Q

What is the fast r_CNN method

Answer

A

Uses selective search for proposal, CNN for localization and classification

Question 5

Q

What is a RoI pooling layer?

Answer

A

Converts convo feature maps into a fixed size. Used because region proposals can be of arbitary size.

Question 6

Q

What loss does the fast R_CNN method use?

Answer

A

It uses cross entropy for classification and L1 or L2 for localization.

Question 7

Q

How does the fast R_CNN predict location?

Answer

A

The location is predicted as offsets from the original region proposal

Question 8

Q

What is the smooth L1 loss

Answer

A

It is quadratic (0.5x^2) for small values and linear (abs(x) - 0.5) for values above 1.

Question 9

Q

How are the training batches for fast R_CNN defined?

Answer

A

2 images, 64 region proposals pr. image, 25% foreground, 75% background.

Question 10

Q

What is the main difference between fast R_CNN and faster R_CNN?

Answer

A

Faster R_CNN uses a a proposal CNN. The proposal and and detection networks share feature maps.

Question 11

Q

How does the Region Proposal Network work in faster R_CNN?

Answer

A

It uses a 3x3 sliding window On the convo feature map and proposes a number of bounding boxes with different scales, aspects and anchors.

Question 12

Q

What is Mask R_CNN

Answer

A

It performs segmantion on the faster_RCN objects.

Object detection Flashcards

(12 cards)