L9 - Computer Vision and CNN Flashcards
1
Q
- What is the aim of Computer Vision?
A
- Use ANN’s to mimc what the brain is doing when humans use their eyes.
2
Q
- What is a naive approach to using a deep neural network for image classification?
A
- Stack image pixels into a vector of binary format
- Learn the vector
3
Q
- What are the issues of stacking pixels and running them through a DNN for image classification?
A
- High-dimensional and doesn’t scale
- Not robust to small changes
4
Q
- What did Hubel and Wiesel notice? What neural network did this lead to?
A
- Different neurons fire when vision target changes
- Convolutional Neural Network
5
Q
- What 3 practicalities are CNN’s based on?
A
- Reduces number of input nodes
- Tolerates small pixel changes with no impact on classification ability
- Takes advantage of pixel correlations on complex images
6
Q
- What is a filter in CNN? How is it learned?
A
- Filter is a low dimension kernel learned via back propagation on the original image.
7
Q
- How is the filter applied to the image?
A
- Place the filter ‘over’ the original image, and perform Dot Product between the filter and the original image.
- Add a bias term
- Returns a value of either 0 or 1
- Value is then added to a feature map
- Move filter one position and repeat
- Stop when feature map is complete
8
Q
- What does convolution mean?
A
- The process of moving the filter over the image and adding the value to the feature map. Hence the name CNN.
9
Q
- What is the difference between pre-1980 CNN’s and post-1980 CNN’s?
A
- Post 1980 introduced Deep Convolutional Neural Networks with nested feature maps.
10
Q
- Describe the Deep Convolutional Neural Network process…
A
- original 64 x 64 image, with a 5x5 filter.
- Perform convolutions to obtain 60x60 feature map
- 60x60 feature map is used as input for the next iteration.
- Process repeats, creating a deep nesting of feature maps.
- Each iteration creates a more accurate feature map
11
Q
- In a Deep CNN, what are FC Layers?
A
- Fully connected -> Everything connects to everything in each layer
12
Q
- In a Deep CNN, what is a Soft Max?
A
- Deep CNN connects down to 10 neurons
13
Q
- What are 3 limitations to Deep Learning?
A
- Computationally expensive -> Needs GPU
- Hard to interpret -> Making them Black Box like
- Need lots of training data