L9 - Computer Vision and CNN Flashcards

Question 1

Q

What is the aim of Computer Vision?

Answer

A

Use ANN’s to mimc what the brain is doing when humans use their eyes.

Question 2

Q

What is a naive approach to using a deep neural network for image classification?

Answer

A

Stack image pixels into a vector of binary format
1. Learn the vector

Question 3

Q

What are the issues of stacking pixels and running them through a DNN for image classification?

Answer

A

High-dimensional and doesn’t scale
1. Not robust to small changes

Question 4

Q

What did Hubel and Wiesel notice? What neural network did this lead to?

Answer

A

Different neurons fire when vision target changes
1. Convolutional Neural Network

Question 5

Q

What 3 practicalities are CNN’s based on?

Answer

A

Reduces number of input nodes
1. Tolerates small pixel changes with no impact on classification ability
2. Takes advantage of pixel correlations on complex images

Question 6

Q

What is a filter in CNN? How is it learned?

Answer

A

Filter is a low dimension kernel learned via back propagation on the original image.

Question 7

Q

How is the filter applied to the image?

Answer

A

Place the filter ‘over’ the original image, and perform Dot Product between the filter and the original image.
1. Add a bias term
2. Returns a value of either 0 or 1
3. Value is then added to a feature map
4. Move filter one position and repeat
5. Stop when feature map is complete

Question 8

Q

What does convolution mean?

Answer

A

The process of moving the filter over the image and adding the value to the feature map. Hence the name CNN.

Question 9

Q

What is the difference between pre-1980 CNN’s and post-1980 CNN’s?

Answer

A

Post 1980 introduced Deep Convolutional Neural Networks with nested feature maps.

Question 10

Q

Describe the Deep Convolutional Neural Network process…

Answer

A

original 64 x 64 image, with a 5x5 filter.
1. Perform convolutions to obtain 60x60 feature map
2. 60x60 feature map is used as input for the next iteration.
3. Process repeats, creating a deep nesting of feature maps.
4. Each iteration creates a more accurate feature map

Question 11

Q

In a Deep CNN, what are FC Layers?

Answer

A

Fully connected -> Everything connects to everything in each layer

Question 12

Q

In a Deep CNN, what is a Soft Max?

Answer

A

Deep CNN connects down to 10 neurons

Question 13

Q

What are 3 limitations to Deep Learning?

Answer

A

Computationally expensive -> Needs GPU
1. Hard to interpret -> Making them Black Box like
2. Need lots of training data