Convolution Flashcards
What is the mathematical operation called convolution?
Sliding window dot product between a kernel, a small matrix, and the pixel matrix.
Reminder: dot product is the summation of element-wise multiplication:
a * b = sum(a_i + b_i)
What is the size of the of the output for convolution between a picture of N pixels and a kernel size of K
N - K + 1
Feature Map / Activation Map
The output of the convolution between an input and a kernel
Convolution Layer
a convolution layer computes the convolution between k kernels and its input with optionally a bias, yielding k outputs.
The learned parameters are the kernels and the biases.
Pooling Layer
Reduces the dimensionality of its input by an unlearned, simple filter.
Max Pooling: take the maximum value from a given window (for example, a 2x2 window from a 4x4 matrix).
Name the components of a CNN
- Convolution Layers
- Pooling Layers
- Fully Connected Layer
The fully connected layer is designed to flatten the output to the desired dimensionality. For classification, it will undergo a softmax to get a probablisitic output.