Computer Vision 2 Flashcards

Question 1

Q

What is imagenette?

Answer

A

Subset of ImageNet dataset created by fast.ai for faster prototyping, contains 10 classes that are easily distinguishable for humans

Question 2

Q

How pooling works?

Answer

A

Instead of applying a kernel with learnable weights the average or maximum of the values at the kernel position is taken

Question 3

Q

2 types of pooling

Answer

A

Either specify the kernel size (classical pooling) or the desired output size (adaptive pooling)

Question 4

Q

What is ResNet block?

Answer

A

key building block used in ResNet (Residual Network), a deep neural network architecture that was introduced to solve the problem of vanishing gradients and to enable the training of very deep networks

Question 5

Q

What pooling does to feature map?

Answer

A

reducing it

Question 6

Q

Key features of ResNet block?

Answer

A

Skip connections, predicts the difference (residual) between the optimal mapping and the input

Question 7

Q

What does ResNet to loss landscape?

Answer

A

Smoothing it: reduces the likelihood of sharp peaks and valleys, allows model to directly propagade the input forward and the gradient backwards, reducing the number of nonlinear transformations

Question 8

Q

What is stem of the network?

Answer

A

Higher layers have more parameters (more channels) but lower layers perform more computations (greater height and width of the feature map)

Question 9

Q

What bottleneck layers do?

Answer

A

reducing the number of channels, then, applying the
computationally more expensive convolution, and finally, increasing the number of channels to the
original size again

Question 10

Q

What is momentum?

Answer

A

optimization technique used to improve gradient descent by adding a fraction of the previous update to the current one

Question 11

Q

Why use momentum?

Answer

A

helps to accelerate the convergence process and smooth out updates, leading to more efficient and stable training

Question 12

Q

What is is RMSProp?

Answer

A

optimizer that adapts the learning rate per weight

Question 13

Q

What is Adam?

Answer

A

Combines ideas of Momentum and RSMProp in an algorithm called adaptive moment estimation

Question 14

Q

If the number of input channels does not equal the desired number of output channels a true identity path is never possible

Question 15

Q

A convolution with a kernel size of 1 x 1 would not make sense in the stem of a state of the are ResNet, as these convolutions do not reduce the size of the feature map.

Question 16

Q

Bottleneck layers do not necessarily have fewer kernels than plain ResNet layers

Answer

Study These Flashcards

A

True

Question 17

Q

The reduction in number of operations from a 9 x 9 kernel to a 3 x 3 kernel is proportionally the same as from a 3 x 3 kernel to a 1 x 1 kernel if one disregards the bias related computations

Answer

Study These Flashcards

A

True

Question 18

Q

When preprocessing the dataset, the image size for batch_tfms must be less than or equal to the image size for item_tfms.

Answer

Study These Flashcards

A

True

Computer Vision 2 Flashcards

(18 cards)