Image Segmentation Flashcards
Name 2 traditional image segmentation methods
Clustering: Agglomerative and Divisive
Region Growing
Name 3 properties that can be used for clustering-based image segmentation
Intensity values, Color properties and Texture measurements
Draw a typical NN for semantic segmentation. Make sure to explain how the output shape looks like. Input: HxW C classes
Input -> Fully convolutonal Layers (Sampling and Unsampling, to process the semantic segmentation and return the output in the original size) -> Output: HXWXC
What is instance segmentation and how does it compare to semantic segmentation. Name 2 typical approaches for instance segmentation
Instance segmentation not only classifies each pixel to an object like semantic segmentation, but also marks the instance that belongs it.
2 approaches are Mask R-CNN and Multi-task network cascades.
What is the difference between semantic segmentation and instance segmentation. Name two fields of application for segmentation tasks
Instance segmentation not only classifies each pixel to an object like semantic segmentation, but also marks the instance that belongs it. Application fields: Medical Images and image or video captioning.
Name two segmentation paradigms that are designed in such a way that the number of objects can be obtained. Explain the difference between these paradigm
Instance segmentation focus on classifies each pixel to an object and also says at which instance the object belongs. The semantic segmentation classifies each pixel to an object, but without saying which instance belongs to.
Insert the formula for the Pixel Accuracy, IoU and dice coefficient DSC
Pixel Accuracy = (TP + TN) / (TP + TN + FP + FN)
IoU = TP / (TP + FP + FN)
DSC = 2TP / (2TP + FP + FN)
Atrous Spatial Pyramid Pooling is an es essential part of DeepLab-Segmentation Architecture. Describe or draw an ASPP-block
Input -> Convolutional Kernel 1x1 rate 6 // Convolutional Kernel 3x3 rate 12 // Convolutional Kernel 3x3 rate 18 // Convolutional Kernel 3x3 rate 24 -> Concatenation Output
What problems might ASPP specifically face?
ASPP can face with problems based on degeneration of filters due to 0-padding. That occurs due to increasing rate during the process, and the information of the image is lost by 0-padding (the mean value will also be 0 and the information would be lost)
Name potential solutions for the problems you mentioned above about ASPP
Solution:
- Incorporation of global context
- GAP-Features fed to 1x1 convolution and upsample
How many pooling layers of extent 2x2 does a fully convolutional network need to produce afeature map of 1/32 the scale of the original image?
The pooling operation with a 2x2 kernel reduces the spatial dimensions of the feature map by half in each dimension (HxW → H/2 x W/2).
So, to get (H/32 X W/32) it is necessary 5 pooling layers
If a 2x2 atrous (dilated) convolution kernel with a dilated rate = 1 has 4 parameters, how many parameters does a 3x3 atrous (dilated) convolution kernel with a dilated rate = 2 have?
9 parameters. The dilated rate determines the spacing between the values in the kernel, and does not change the number of parameters.
Write down one advantage of atrous (dilated) convolutionlayers
Increase the image resolution
How does convolution in DeepLab differ from conventional convolutions and what is its advantage?
DeepLab uses atrous (dilated) convolutions, enabling increase the receptive field and maintains the image resolution during the training.
What technique does DeepLabV3 use to deal with degeneration of filters?
DeepLabV3 uses the “atrous spatial pyramid pooling” (ASPP), which incorporates a global context by concatenating parallel atrous convolutions