Convolutional Neural Network Flashcards

CNN for images and videos

1
Q

How CNN learn visual features?

It uses __ __ in the input to inform the model.

A

It uses spatial structure in the input to inform the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the idea of spatial difference?

It connect __ of input to __ in __ layers.

A

It connect patches of input to neurons in hidden layers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does the patches in input layer connect with each other?

They connect each other by __ __.

A

They connect each other by sliding window.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Filters input into patches. What is the operation called?

Con.. Op..

A

Convolution operation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the 3 steps of convolution operation?

  1. Apply a filter - to extract __ __.
  2. Use __ __ to extract __ features.
  3. Spatially share __ across each filters.
A
  1. Apply a filter - to extract local features.
  2. Use multiple filters to extract different features.
  3. Spatially share parameters across each filters.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does convolution produce?

It produce __ __.

A

It produce feature maps.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the 3 steps to train CNN?

  1. __: Apply filters to generate feature maps.
  2. _____: ReLU
  3. __: Downsampling operation
A
  1. Convolution: Apply filters to generate feature maps.
  2. Non-linearity: ReLU
  3. Pooilng: Downsampling operation.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

In convolutional layers, what is the local connectivity?

The neuron in __ layer are computed as __ sum between patch __ weighted by __ patch. Each neuron only sees a __ patch.

A

The neuron in hidden layer are computed as weighted sum between patch input weighted by filter patch. Each neuron only sees a single patch.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the computation of layer wise operation?

There are 6 steps.

  1. Each inputs are neurons in patch of previous layers.
  2. Apply __ __ __ (filters)
  3. Do ___ operations
  4. Add o__.
  5. Add b__.
  6. Activate with __.
A
  1. Each inputs are neurons in patch of previous layers.
  2. Apply matrix of weights (filters)
  3. Do element-wise operations
  4. Add outputs.
  5. Add bias.
  6. Activate with non-linearity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does pooling do?

Give 2

  1. Reduce dimensionality of __ & increase dimensionality of __.
  2. Preserve __ __.
A
  1. Reduce dimensionality of features & increase dimensionality of filters.
  2. Preserve spatial difference.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Mini Conclusion to CNN: What are the 2 major parts of CNN?

  1. F__ L__
  2. C__ P__
A
  1. Feature Learning
  2. Class Probabilities
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Object Detection involves with detecting objects into boxes to classify each boxes into their own class.
How to effectively detect object?

The naive approach is bad cause it detect too many boxes.

…-…. and Faster …-…

A

R-CNNs and Faster R-CNNs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the 2 operations for semantic segmentation?

A

Downsampling and upsampling

Downsampling broke down the image into pixels of classes.
Upsampling build these pixels back into an image.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly