Computer Vision Flashcards

1
Q

What is feature map?

A

Output pixel based on the multiplication and summation for each position of the kernel on the input image

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is kernel?

A

(small) tensor with values that are multiplied with the values of an input tensor at the position of the kernel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Concept of Mapping a Convolutional Kernel

A

Applying the kernel across the image means sliding it over the coordinate grid of pixels. In default, no part of the kernel can overstep the boundaries of the grid

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Why apply the convolutional kernel?

A

Resulting feature map will be smaller than input

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What padding does?

A

adds virtual number of rows and columns in the height and width dimension filled with zeros

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why apply padding?

A

If the output feature map should have the same size as the input tensor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How many channels used for color images?

A

3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Kernels and Multichannel Inputs?

A

In the case of an input with multiple channels, the kernels processing this input will have a channel dimension with the same number of channels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Rules for model plot statistics

A

Mean and standard deviation should develop smoothly, activations near zero are problematic since they cannot be processed further in a meaningful way

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

gradient estimate is more accurate with…

A

higher batch size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which activations are problematic since they cannot be processed further in a meaningful way?

A

near zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which requirements 1cycle training incorporate?

A

start low
End low
Medium high

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Phases of learning rate schedule?

A

– Warmup phase: the learning rate is increased gradually
– Annealing phase: the learning rate is decreased again

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What solves initial distribution with many near zero activations and shift of the distribution during training?

A

Batch normalization

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why use batch normalization?

A

Speed up training (high learning rate), allows sup-optimal starts (less iterations), adds randomness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is convolution?

A

Sliding the kernel across the input tensor (image) and multiplying overlapping values, then summing the products to get a single output value for that location

17
Q

What higher stride leads to?

A

Smaller output feature map, reduced computational cost

18
Q

What lower stride leads to?

A

More spatial detail for capturing fine features

19
Q

What is stride?

A

Step size with which the kernel moves across the input during the convolutional operation

20
Q

Typical start for convolution?

A

Top left corner