Convolutional neural networks Flashcards
1
Q
Now, imagine that this 4×4×2 volume gets fed into a subsequent layer with 3 filters, each of shape 2×3×2. What will be the shape of the resulting output volume?
A
3×2×3
suppose you have an n1×n2×3 image and you f1×f2×3 and 3 filters with stride s, then the formula is (n1-f1)/s +1 × (n2-f2 )/s+1 × #amount of filters
2
Q
Describe the layers in convet
A
- Repeat following pattern
- Conv layer
- ReLu
- Max pool
- Final layer is a Fully connected layer which determines the output (use softmax for multiclass classification)
3
Q
Describe max pooling
A
- Take the maximum within a region and only keep this element
- Pooling doesnt care where the maximum comes from, therefore makes it more robust to position
4
Q
Mention all of the CNN architectures
A
- Lenet5
- Alexnet
- VGG
- GoogleNet
- ResNet
5
Q
Mention the novelty in each CNN architecture
A
6
Q
Describe the architecture behind AlexNet
A
7
Q
pixel space vs feature space
Give an example of images closei n pixel space and in feature space
A
- Pixel space is how close the RAW
- i.e if pictures share large blue backgroun, then they may be close in pixel space, but not in feature space since the background does not matter for the object
- Feature space is how close the pictures according to the training labels
- What do the pictures have in common in they all share same label
- Abstracting away backround etc..
8
Q
A