Object detection Flashcards

1
Q

What is object detection?

A

Object detection detects where in the image we have objects using a bounding box and classifies them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How can we do object detection using sliding windows=

A

Do a classification on every sliding window, resulting in classification as a object or background.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why don’t we usually use the sliding window approach for object detection?

A

Too slow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does the R-CNN ( Region based CNN) work?

A
  1. Get ROI (Regions of interesest) from a sperate algorithm 2. Forward each region trough a convonet
  2. Use SVM for classification and Bbox regression
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the probem for the R-CNN?

A

Training and inference is slow, training is ad-hoc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the SPP methode for object detection?

A
  1. Send the whole image trough a convonet.
  2. Extract ROI from the result.
  3. Use pooling to reduce the size of the ROI.
  4. Use fully connected to svm for classification and fully connected to BB regression.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the main advantage of SPP(Spatial pyramide pooling) over R-CNN?

A

Makes testing phase faster as we only need to use convo once. Training is still slow and ad-hoc, but faster than R-CNN

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the fast R-CNN algorithm?

A
  1. Use convonet on image.
  2. Do RoI proposals
  3. Do RoI pooling
  4. Use FC + linear for regression and FC + linear + softmax for classification
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What kind of loss does the fast R-CNN use?

A

cross entropy for classification and L1 for regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does the Fast R-CNN RoI pooling work?

A

Divide the project proposal into 7x7 grid and do max pooling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the main advantage of Fast R-CNN over SPP and R-CNN?

A

It’s fully trainable, with “fast” training and inference time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the main difference between Fast R-CNN and faster R-CNN?

A

Faster R-CNN trains a convo net to do proposal selection. The proposal convo net uses classification (object/not) and BB regression loss

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the mask R-CNN?

A

It adds FCN part to the faster R-Cnn to do semantic segmentation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe the YOLO algorithm

A
  1. Split image into grid (e.g. 7x7)
  2. For each cell make two (or more) boundingboxes and predict p(object)
  3. Each cell also predicts a class probability p(class | object)
  4. Combine the bouding boxes and class prediction
  5. Perform NMS
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How are the bounding boxes trained in Yolo?

A

For each cell find the best bounding box, adjust it and increase confidence. For cells without objects and other boxes reduce confidence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly