Computer Vision 2 Flashcards
What is imagenette?
Subset of ImageNet dataset created by fast.ai for faster prototyping, contains 10 classes that are easily distinguishable for humans
How pooling works?
Instead of applying a kernel with learnable weights the average or maximum of the values at the kernel position is taken
2 types of pooling
Either specify the kernel size (classical pooling) or the desired output size (adaptive pooling)
What is ResNet block?
key building block used in ResNet (Residual Network), a deep neural network architecture that was introduced to solve the problem of vanishing gradients and to enable the training of very deep networks
What pooling does to feature map?
reducing it
Key features of ResNet block?
Skip connections, predicts the difference (residual) between the optimal mapping and the input
What does ResNet to loss landscape?
Smoothing it: reduces the likelihood of sharp peaks and valleys, allows model to directly propagade the input forward and the gradient backwards, reducing the number of nonlinear transformations
What is stem of the network?
Higher layers have more parameters (more channels) but lower layers perform more computations (greater height and width of the feature map)
What bottleneck layers do?
reducing the number of channels, then, applying the
computationally more expensive convolution, and finally, increasing the number of channels to the
original size again
What is momentum?
optimization technique used to improve gradient descent by adding a fraction of the previous update to the current one
Why use momentum?
helps to accelerate the convergence process and smooth out updates, leading to more efficient and stable training
What is is RMSProp?
optimizer that adapts the learning rate per weight
What is Adam?
Combines ideas of Momentum and RSMProp in an algorithm called adaptive moment estimation
If the number of input channels does not equal the desired number of output channels a true identity path is never possible
True
A convolution with a kernel size of 1 x 1 would not make sense in the stem of a state of the are ResNet, as these convolutions do not reduce the size of the feature map.
True