Week 5 Flashcards

Question 1

Q

Why don’t we use regular NN for images?

Answer

A

Doesn’t scale:
100x100 pixels = 10k parameters per node
Not robust to small changes in input
Doesn’t take advantage of correlations between pixels

Question 2

Q

What are filters in CNN

Answer

A

Subset of the image which acts as the weights that will be learned by the NN via backpropagation

Question 3

Q

How do we apply filters from CNNs

Answer

A

Do a dot product between filter and original image, store the result with a bias term in a matrix called a feature map. Move the filter 1 pixel across and go again.

Question 4

Q

What is a feature map

Answer

A

Map of where the feature indicated by your filter appears
Values > 0: filter appears here
Values < 0: does not appear here

Question 5

Q

Limitations of deep learning

Answer

A

DL is data hungry: it needs a LOT of data to work
DL is heavy: you often need GPUs and cloud computing to train it and even to use it
DL is bad at representing uncertainty: it’s easy to trick a neural network into thinking it’s right
Hard to optimise: architecture, learning method…
Hard to interpret: neural networks are black boxes.

Question 6

Q

What do we do to the feature map once its made?

Answer

A

Pass the values throguh a ReLU function.
(e.g. its equal to x if x>=0 or its 0 if its less).
Finds all the filter matches.
relu(x) = max(0,x)

Question 7

Q

Why do we need to downsample

Answer

A

Our feature map is N-2 pixels when N is the width of our image. Does not scale

Question 8

Q

How do we downsample?

Question 9

Q

What is Pooling?

Answer

A

Aggregating,
Summarising,
Downsampling the image

Max-pooling
Average-pooling

Question 10

Q

What is stride?

Answer

A

Step size in convolution

Question 11

Q

Whats a problem with relu?

Answer

A

Discards all negative values

Question 12

Q

What other activation functions do we have?

Answer

A

ReLU
tanh
Sigmoid
Leaky ReLU
Maxout
ELU
Softmax

different activation functions solve different problems

Question 13

Q

Whats a problem with tanh?

Answer

A

It’s derivatives go to 0 (which is bad for backpropagation)

Question 14

Q

What is softmax?

Answer

A

Uses exponents to normalise the output layer of NN into probabilities.

Question 15

Q

Issues with large learning rate

Answer

A

Overshoot the bottom of the curve for error

Question 16

Q

Issues with small learning rate

Answer

Study These Flashcards

A

Takes too long to learn and can get stuck on a suboptimal solution

Question 17

Q

Examples of hyperparameters for a CNN

Answer

Study These Flashcards

A

Filter size and number of filters
Padding and stride value
Learning rate and dropout rate
Number of epochs and batch size
Activation function
Number of hidden layers
Number of neurons in each layer

Week 5 Flashcards

(17 cards)