Object detection Flashcards
Name some difficulties associated with object detection in images
Objects may appear in different poses
Cluttered scenes with occlusions
Intrinsic variability within a class
Non-rigid objects like humans
What are the steps in Dalal-triggs pedestrian detection algorithm
- Extract windows of 64x128 in all relevant positions and scales.
- Compute HoG
- Use support vector machine to score the window
- Remove non-max overlapping detections
Name some important properties of the non-linear functions used in deep learning
Good gradients,
efficient
What is the field of vision regarding Convo nets?
The field of vision for a node is the input nodes that affect the given node in any way.
What is the Relu
A function, f(x) = max(0,x)
What might be the problem if a neural net doesn’t train?
- If the input to the layer (The gradient) is zero, that layer won’t train
- If the weights to a layer are zero, no layer before the layer will change.
- If many layers have weights with norm smaller than one, the change will get smaller and smaller
What solutions might be implemented in a neural net to resolve the problem of diminishing training as we go back through the layers?
- Batch normalization
- ResNet/ Skip connections
- Good weight initialization
Why is the softmax activation function often used in the last layer for classification problems?
The sum of all outputs will be equal to 1 with the softmax, which means the output values can be interpreted as “probabilities” for an object belonging to each class.