Image Segmentation Flashcards

1
Q

Name 2 traditional image segmentation methods

A

Clustering: Agglomerative and Divisive
Region Growing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Name 3 properties that can be used for clustering-based image segmentation

A

Intensity values, Color properties and Texture measurements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Draw a typical NN for semantic segmentation. Make sure to explain how the output shape looks like. Input: HxW C classes

A

Input -> Fully convolutonal Layers (Sampling and Unsampling, to process the semantic segmentation and return the output in the original size) -> Output: HXWXC

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is instance segmentation and how does it compare to semantic segmentation. Name 2 typical approaches for instance segmentation

A

Instance segmentation not only classifies each pixel to an object like semantic segmentation, but also marks the instance that belongs it.
2 approaches are Mask R-CNN and Multi-task network cascades.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the difference between semantic segmentation and instance segmentation. Name two fields of application for segmentation tasks

A

Instance segmentation not only classifies each pixel to an object like semantic segmentation, but also marks the instance that belongs it. Application fields: Medical Images and image or video captioning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Name two segmentation paradigms that are designed in such a way that the number of objects can be obtained. Explain the difference between these paradigm

A

Instance segmentation focus on classifies each pixel to an object and also says at which instance the object belongs. The semantic segmentation classifies each pixel to an object, but without saying which instance belongs to.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Insert the formula for the Pixel Accuracy, IoU and dice coefficient DSC

A

Pixel Accuracy = (TP + TN) / (TP + TN + FP + FN)
IoU = TP / (TP + FP + FN)
DSC = 2TP / (2TP + FP + FN)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Atrous Spatial Pyramid Pooling is an es essential part of DeepLab-Segmentation Architecture. Describe or draw an ASPP-block

A

Input -> Convolutional Kernel 1x1 rate 6 // Convolutional Kernel 3x3 rate 12 // Convolutional Kernel 3x3 rate 18 // Convolutional Kernel 3x3 rate 24 -> Concatenation Output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What problems might ASPP specifically face?

A

ASPP can face with problems based on degeneration of filters due to 0-padding. That occurs due to increasing rate during the process, and the information of the image is lost by 0-padding (the mean value will also be 0 and the information would be lost)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Name potential solutions for the problems you mentioned above about ASPP

A

Solution:
- Incorporation of global context
- GAP-Features fed to 1x1 convolution and upsample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How many pooling layers of extent 2x2 does a fully convolutional network need to produce afeature map of 1/32 the scale of the original image?

A

The pooling operation with a 2x2 kernel reduces the spatial dimensions of the feature map by half in each dimension (HxW → H/2 x W/2).

So, to get (H/32 X W/32) it is necessary 5 pooling layers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

If a 2x2 atrous (dilated) convolution kernel with a dilated rate = 1 has 4 parameters, how many parameters does a 3x3 atrous (dilated) convolution kernel with a dilated rate = 2 have?

A

9 parameters. The dilated rate determines the spacing between the values in the kernel, and does not change the number of parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Write down one advantage of atrous (dilated) convolutionlayers

A

Increase the image resolution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does convolution in DeepLab differ from conventional convolutions and what is its advantage?

A

DeepLab uses atrous (dilated) convolutions, enabling increase the receptive field and maintains the image resolution during the training.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What technique does DeepLabV3 use to deal with degeneration of filters?

A

DeepLabV3 uses the “atrous spatial pyramid pooling” (ASPP), which incorporates a global context by concatenating parallel atrous convolutions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

State one difference that sets apart data augmentation for segmentation and for classification

A

Data augmentation for segmentation must preserve spatial correspondence between input images and their corresponding masks, while classification augmentation does not require such alignment.