Week 1: principles of deep learning in artificial networks Flashcards

Question 1

Q

What is a deep network?

Answer

A

A learning network that transforms or extracts features using multiple nonlinear processing units arranged in multiple layers with hierarchical organisation and different levels of representation and abstraction

Question 2

Q

How do complex outcomes emerge in deep networks?

Answer

A

From interactions between many simple steps

Question 3

Q

What is a representation in a deep network?

Answer

A

The information the computer is given. Each representation is built from an earlier representation that can transfer the features and extract complex features from simpler features

Question 4

Q

What are the 4 operations in a linear-nonlinear layer

Answer

A

Filter/convolve
Threshold/rectification
Pool
Normalise

Question 5

Q

What is the purpose of the filter/convolve operation?

Answer

A

To determine how well each group of nearby pixels matches each of a group of filters

Question 6

Q

How does the filter operation work?

Answer

A

The input is a pixel map

convolution step looks for a pattern in a group of neighbouring pixels that corresponds to the convolution filter
if source pixels follow the filter pattern the results is a high value, if input area is all same brightness result will be 0, if source pixels are opposite to the filter the result will be negative
output is feature maps

Question 7

Q

What is the purpose of the threshold/rectify operation?

Answer

A

Introduce nonlinearity by setting negative activations of units to zero (and maybe set a maximum activation)

Question 8

Q

What is the goal of the ReLU activation function?

Answer

A

To only activate the output feature map if its value reaches a certain threshold

Question 9

Q

What is the purpose of the pooling operation?

Answer

A

Downsample the units to improve computational efficiency

Question 10

Q

How does the pooling operation work?

Answer

A

Typically takes the maximum of a square of 2x2 neighbouring units of the feature map

Question 11

Q

What is the purpose of the normalise operation?

Answer

A

Rescale responses of each feature map to have mean 0 and standard deviation 1 so each feature map contributes similarly to classification

Question 12

Q

Why is normalisation necessary?

Answer

A

The range could be different between feature maps, weighting some more than others

Question 13

Q

What tasks are deep learning useful for?

Answer

A

Useful for achieving tasks that are difficult to describe formally. Tasks that are difficult for computers but intuitive for humans

Question 14

Q

What is the final layer of the network

Answer

A

The final fully-connected layer links pattern of most abstracted, top-level features to required response
-the last feature map is flattened into a line of independent units where each is connected to all the others

Question 15

Q

What does the softmax function do?

Answer

A

Determines the probability of the desired response

Question 16

Q

How does the softmax function work?

Answer

Study These Flashcards

A

Each input image has a score reflecting the match bewegen the top layer’s activation pattern. The score is converted to a probability that this input image falls into each category

Question 17

Q

How are convolution filters and weights related?

Answer

Study These Flashcards

A

The convolution filters form the weights of connections between the nodes in a neural network

Question 18

Q

How does a network learn the weights of the connections?

Answer

Study These Flashcards

A

Using back-propagation error

Question 19

Q

What does back-propagation error do?

Answer

Study These Flashcards

A

Adjusts the filter structure - the link between layers

Question 20

Q

What is the basis for back-propagation error?

Answer

Study These Flashcards

A

The match or conflict of expected and actual outputs

Question 21

Q

Why do filters generally have a single set of weights for all positions in the feature map?

Answer

Study These Flashcards

A

if a feature is useful to compute at one position, it is probably useful at another position
the filter values are weights that need to be learned, it is computationally demanding if the set is too large
the convolution operation is a fast matrix function, if filters are not fixed the convolution operation cannot be used

Question 22

Q

What is a definition of machine learning?

Answer

Study These Flashcards

A

A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P if its performance at tasks in T, as measured by P, improves with experience E

Week 1: principles of deep learning in artificial networks Flashcards

(22 cards)