Encoder Decoder Networks Flashcards
What tasks can be accomplished by decoding
Classofication
mapping
captioning
translation
generation
recognition
Define decoding
Process of decoding the information represented into a vector whose elements are extracted from predefined alphabet
For what can interpolation maps be used?
Up-sampling from a low-resolution image towards an high resolution image
what is the deep learning optimal approach to up-sampling and why?
Transposed convolution
It does not use a predefined interpolation method. Learning will allow the tuning of the parameters
Transposed convolution is a many-to-one association T or F
F
Transposed convolution has the same connectivity as the normal convolution in the backward direction T or F
T
Transposed convolution is a type of convolution T or F
F
How can we emulate transposed convolution using convolution?
By up-sampling the input by adding zeros between the values in the input matrix in a way that the direct convolution produces the same effect as the transposed convolution
Weights in the transposed convolution are learnable T or F
T
For up-sampling with transposed convolution we need a predefined interpolation method T or F
F
Other names for transposed convolution?
Deconvolution
Fractional strided convolution
Why transposed convolution is called so?
Because if we interpret convolution as a matrix product between a rearranged version of the filter and the image, than TC correspond to a matrix multiplication between the transposed matrix of the rearranged filter and the image to up-sample
How to perform transposed convolution
Match the filter with a single pixel of the image by scalar multiplication. Than repeat for the next pixel, the position of the result in the output is according to the strided. Perform sum on the superposition of the output
Possible output layers for encoder-decoder networks
Linear, binary, softmax
Main feature of U-Net
Skipe connections between correspondent encoding and decoding layer. The outpur of progressive decoding step is concatenated to the correspondent feature map in the encoding layer and the receptive field fo the neuron spans the full depth of the result of the concatenation