Week 1: principles of deep learning in artificial networks Flashcards
What is a deep network?
A learning network that transforms or extracts features using multiple nonlinear processing units arranged in multiple layers with hierarchical organisation and different levels of representation and abstraction
How do complex outcomes emerge in deep networks?
From interactions between many simple steps
What is a representation in a deep network?
The information the computer is given. Each representation is built from an earlier representation that can transfer the features and extract complex features from simpler features
What are the 4 operations in a linear-nonlinear layer
Filter/convolve
Threshold/rectification
Pool
Normalise
What is the purpose of the filter/convolve operation?
To determine how well each group of nearby pixels matches each of a group of filters
How does the filter operation work?
The input is a pixel map
- convolution step looks for a pattern in a group of neighbouring pixels that corresponds to the convolution filter
- if source pixels follow the filter pattern the results is a high value, if input area is all same brightness result will be 0, if source pixels are opposite to the filter the result will be negative
- output is feature maps
What is the purpose of the threshold/rectify operation?
Introduce nonlinearity by setting negative activations of units to zero (and maybe set a maximum activation)
What is the goal of the ReLU activation function?
To only activate the output feature map if its value reaches a certain threshold
What is the purpose of the pooling operation?
Downsample the units to improve computational efficiency
How does the pooling operation work?
Typically takes the maximum of a square of 2x2 neighbouring units of the feature map
What is the purpose of the normalise operation?
Rescale responses of each feature map to have mean 0 and standard deviation 1 so each feature map contributes similarly to classification
Why is normalisation necessary?
The range could be different between feature maps, weighting some more than others
What tasks are deep learning useful for?
Useful for achieving tasks that are difficult to describe formally. Tasks that are difficult for computers but intuitive for humans
What is the final layer of the network
The final fully-connected layer links pattern of most abstracted, top-level features to required response
-the last feature map is flattened into a line of independent units where each is connected to all the others
What does the softmax function do?
Determines the probability of the desired response