Deep Learning Flashcards
what characterises supervised learning
labelled data
cost function
reperated epochs
what characterises un-supervised learning
non-labelled data
no cost function
finds representations of data
deep learning what is it
usually labelled data
uses multilayered neural networks, either convolutional or perceptrom based
What are the key components of a neural network
-Some cost function e.g. RMSE
-Forward propagation per layer per node consisting of weight x input + bias
-Following a neuron usually an activation function such as a ReLu or a Softmax
-Backpropagation
Why is backpropagation an efficient method of optimization
It stores inermediate gradients, allowing for more efficient computations of the partial derivatives at each feed forward stage, this allows for fast optimiazation of a neural network (at least relative to computing the derivatives seperately per variable
What is the activation function sigmoid and what are its pros and cons
-it is equal to 1/1+e^x and maps values range from any possible value [-infinity,infinity] to range of [0,1]
pros: differentiable and normalisees ranges
cons: hard to compute
What is the activation function (tanH) and what are its pros and cons
(e^x - e^-x) / ( e^x + e^-x). normalizes values to the range [-1,1]
-differentiable but hard to compute
What is the activation function ReLu and what are its pros and cons
f(x) = max (0,x). easy to compute but non differnetiable
-avoids saturation issues (i.e. when nodes have weights close to -1 or +1)
What is the neural radiance field neural network
Form of stereo vision
-takes as input an image from different perspectives
-outputs a 3d map of the object
This is an example of a depth image
What is the relationship between amount of data and the performance of different ML / AI methods
Large neural network performs best with large data but worst with little data
- small neural network 2nd place in both cases
- traditional ML performs better than rest with little data but worst with most data
What is the general goal of a convolutional neural network
Learning higher-level features from image
Neural networks do this hierarchically i.e. they first learn low level -> mid level -> high level features then train a classifier
what is a general structure of a general CNN up to the feature maps
Data
Convolution
Activation
Pooling
Feature maps
CNN = multi-layer neural network with:
- Local connectivity
–neurone in a layer are only connected to small region of lair before it - They share weight parameters across spatial positions
– Learning shift-invariant filter kernels
what parameters do convolutional kernels have
- Kernel Size
- Stride
- Padding
- Dilation:
what are non-linear activation functions
Leaky ReLu
Parametric ReLu
Exponential Linear Units
Good for when we want to find non-linear relationships
what is a pooling layer
taking the average or max of a section of kernel to reduce kernel size for better efficiency