Convolutional Neural Network Flashcards
CNN for images and videos
How CNN learn visual features?
It uses __ __ in the input to inform the model.
It uses spatial structure in the input to inform the model.
What is the idea of spatial difference?
It connect __ of input to __ in __ layers.
It connect patches of input to neurons in hidden layers.
How does the patches in input layer connect with each other?
They connect each other by __ __.
They connect each other by sliding window.
Filters input into patches. What is the operation called?
Con.. Op..
Convolution operation.
What are the 3 steps of convolution operation?
- Apply a filter - to extract __ __.
- Use __ __ to extract __ features.
- Spatially share __ across each filters.
- Apply a filter - to extract local features.
- Use multiple filters to extract different features.
- Spatially share parameters across each filters.
What does convolution produce?
It produce __ __.
It produce feature maps.
What are the 3 steps to train CNN?
- __: Apply filters to generate feature maps.
- _____: ReLU
- __: Downsampling operation
- Convolution: Apply filters to generate feature maps.
- Non-linearity: ReLU
- Pooilng: Downsampling operation.
In convolutional layers, what is the local connectivity?
The neuron in __ layer are computed as __ sum between patch __ weighted by __ patch. Each neuron only sees a __ patch.
The neuron in hidden layer are computed as weighted sum between patch input weighted by filter patch. Each neuron only sees a single patch.
What is the computation of layer wise operation?
There are 6 steps.
- Each inputs are neurons in patch of previous layers.
- Apply __ __ __ (filters)
- Do ___ operations
- Add o__.
- Add b__.
- Activate with __.
- Each inputs are neurons in patch of previous layers.
- Apply matrix of weights (filters)
- Do element-wise operations
- Add outputs.
- Add bias.
- Activate with non-linearity
What does pooling do?
Give 2
- Reduce dimensionality of __ & increase dimensionality of __.
- Preserve __ __.
- Reduce dimensionality of features & increase dimensionality of filters.
- Preserve spatial difference.
Mini Conclusion to CNN: What are the 2 major parts of CNN?
- F__ L__
- C__ P__
- Feature Learning
- Class Probabilities
Object Detection involves with detecting objects into boxes to classify each boxes into their own class.
How to effectively detect object?
The naive approach is bad cause it detect too many boxes.
…-…. and Faster …-…
R-CNNs and Faster R-CNNs
What are the 2 operations for semantic segmentation?
Downsampling and upsampling
Downsampling broke down the image into pixels of classes.
Upsampling build these pixels back into an image.