C4W3 Flashcards
Difference between classification with localisation vs object detection?
In object detection we have multiple objects
Explain YOLO algorithm?
YOLO stands for you only look once
1. Place grid onto your image
2. You should run your network once so it will output a 1x1xN vector for each grid cell
What is non max suppression?
Used to suppress multiple detection of the same object
Suppress regions (outputs) with less probability
When anchor boxes are used?
When you want to distinguish overlapping objects
What is regions proposal?
Use some clustering or another NN to propose regions to conv net
What is intersection over union?
Metric for evaluating correctness of the predicted region
How intersection over union works?
- Compute size of the intersection
- Divide by the size of the union
- The pred is correct if >=0.5
What is semantic segmentation?
When you want to classify each pixel on the image
Idea of semantic segmentation and U-Net (Unit-Net)
Use transpose conv, which blows up smaller inputs
Use symmetric skip connection from initial layers to the latest convoluted layers
Steps of non max suppression?
- Discard all the boxes with <threshold prob
- Discard any boxes with IOU >0.5 with boxes from step 1