3 Flashcards
What is Convolutional Neural Network (CNN)?
Deep learning model specifically designed for processing grid-like data, such as images, videos, or time-series data. CNNs are highly effective in tasks involving visual data, like image classification, object detection, and segmentation, because they can automatically capture spatial hierarchies and patterns from raw pixel data
How does Convolution work?
We take one layer, pass it through the filter and get the Activation map. Do it for all z dimensions (x-y-z)
What are parameters of the Convolutional map and what they do?
Kernel size (k) - defines the size of the filter
Stride (s) - how many pixels to move?
Padding (p) - how much extend the borders
How to calculate the dimensions of the feature map after going through convolution?
(N-F/S) + 1
How to calculate the number of parameters in the feature map?
(filter’s lenght * filter’s height * image’s width +1) * number of filters
Why use CNNs instead of Fully-Connected layers?
CNNs have less parameters
CNNs exploit the spacial structure of the image
Better generalization due to shared weight structure.
How Pooling and Max Pooling works?
Pooling - reduces the size of the image (resolution) without adding additional learnable parameters
Max Pooling - find a maximum value of a filter. It doesn’t have any parameters