Deep Learning Flashcards

1
Q

What is batch normalisation?

A

Normalising the output of each layer in order to speed up training

Remove the mean and scale by the standard deviation,

We don’t care about the magnitude of the output just the shape of it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is dataset augmentation?

A

This is a regularisation technique

This is where we create fake data and add it to the training set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why do we add a small amount of noise to the network weights?

A

It is a regularisation technique to prevent over-fitting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is early stopping?

A

A regularisation technique where we stop training early

The number of training samples can then be used as a hyper parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Sparse Representation?

A

A regularisation technique where we force the hidden representation to have more zeroes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are Ensemble Methods?

A

A regularisation technique, where we train multiple models at the same time and let them vote on the final output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is dropout?

A

A regularisation technique, where we randomly remove a percentage of neurons during training

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Local Connectivity in the context of CNNs?

A

The idea that each neuron inside a layer is only connected to a small region in the previous layer, called a receptive field

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is weight sharing in the context of a CNN?

A

The idea that a single filter (small number of weights) can be used to detect a feature in any part of the image

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is stride in a CNN?

A

The size of the step we take when sliding the filter across the previous layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the size of the output after convolving a FxF filter on an NxN image using a stride of S?

A

A= (N-F)/S + 1

(A,A)

we need to choose values of N,F and S such that A is an integer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly