Object Detection Flashcards

1
Q

When there are multiple bounding boxes for an object in an image, what is Intersection over Union (IoU) ratio?

A

It is Intersection Area / Union area

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

YOLO - In a NxN grid of an image, if each grid box predicts Output = [Pc, Bx, By, Bh, Bw, C1, C2, C3], what does each value stand for

A

Pc -> Probability the grid element contains an object
Bx, By -> midpoint of the object
Bh, Bw -> Height and Width of the object
C - Class of object

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

If multiple grid elements output bounding boxes for the same object, how do we identify the most appropriate bounding box using Non-Max suppression?

A
  • Discard all object with probability < 0.6
  • Pick box with largest Pc
  • Discard any other box with IoU >= 0.5
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is object locaization?

A

It is determining the location and class of a single object in an image.

It is training and predicting [P,bx, by, bh, bz, c1,c2, c3] where P is probability of presence, b is coordinate, and c is the class of object.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is sliding windows for object detection?

A

Using sliding windows of multiple sizes and using each window in a CNN to classify it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Explain Yolo

A

Image is divided into 9 grid cells and object localisation is is applied in all 9 independently.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In Yolo, what if an image is spread across grid cells?

A

We assign the object to grid where its midpoint falls

How well did you know this?
1
Not at all
2
3
4
5
Perfectly