ann Flashcards

Question 1

Q

latent

Answer

A

existing but not yet developed; hidden, concealed

Question 2

Q

simple perceptron

Answer

A

activation function = step function
1. perform (forward) inference
2. compute loss
3. update weights (remember learning rate)

Question 3

Q

multilayer perceptron

Answer

A

use backpropagation and gradient descent to update weights

Question 4

Q

output layer activation/loss

Answer

A

binary classification = sigmoid + binary cross-entropy
multiclass classification - non-mutually exclusive = sigmoid + binary cross-entropy on each output
multiclass classification - mutually exclusive = softmax (normalize output) + categorical cross-entropy
regression = no activation + MSE

Question 5

Q

hidden layer activation

Answer

A

sigmoid and tanh - small and large values of z cause gradient to be 0
ReLU = max (0, z) - gradient = 0 for z negative
LReLU - nonlinear but piecewise linear, gradient != 0
LReLU(x) = x, x > 0
= ax, x < 0, a = configurable slope

Question 6

Q

detecting over/underfitting

Answer

A

underfitting - training and validation set errors are both high => need higher ANN complexity
overfitting - training loss decreasing, validation loss increasing => IID, more data, decrease complexity, smaller magnitude weights (regularization)

Question 7

Q

Universal Approximation Theorem

Answer

A

A MLP with a linear output layer, at least one hidden layer with any squashing activation function (sigmoid/tanh) can approximate any function, provided the network is given enough hidden neurons

Question 8

Q

weight initialization

Answer

A

needs to be random to break the symmetry of the ANN
sample from normal distribution

Question 9

Q

learning rate

Answer

A

high learning rate - faster, but can miss minimum
small learning rate - slower, but guaranteed to reach minimum

Question 10

Q

mini batch gradient descent

Answer

A

combination between stochastic (fast, high variance) and batch (slow, low variance)
not one, not all training examples at a time, but some (4, 8, 16)

Question 11

Q

training procedure

Answer

A

split into training, validation and test set
split into mini-batches, update all weights
after all mini-batches are done, one epoch is done, reshuffle, do 3-5 epochs
set some checkpoints for every epoch, compute training vs validation loss to prevent over/underfitting
tune hyperparameters on validation set
report performance results on test set with k-fold cross validation

Question 12

Q

CNN uses

Answer

A

image classification, object detection, object segmentation

Question 13

Q

CNN technology

Answer

A

apply convolutional filters to reduce height/width of feature map and increase depth => 1-dimensional array => plug into ANN
encoder-decoder / autoencoder

Question 14

Q

CNN on levels

Answer

A

first layers - detect low lever features (vertical/horizontal lines)
deeper layers - detect higher level concepts (parts of face)
deepest layers - reconstruct entire image, highlight most important classification features (moustache)

Question 15

Q

RNN

ann Flashcards

(15 cards)