Module 4 Flashcards
1
Q
Gradient descent
A
Repeatedly update parameters a and b by taking small steps in the negative direction of the partial derivative
2
Q
Gradient
A
Vector of all partial derivatives
3
Q
Logistic regression
A
- pass the regression output through a logistic function
- for binary classification
4
Q
Perceptron
A
- algorithm for supervised binary classification
- activation function: threshold function
5
Q
Linear activation
A
Directly passes on the output of the linear layer
6
Q
Sigmoid activation
A
Compresses the output smoothly into the range [0,1]
7
Q
Tanh activation
A
Compresses the output into the range [-1,1]
8
Q
ReLU activation
A
Rectified linear unit
- linear in the positive part
- non-linear overall
9
Q
Softmax activation
A
Scales the inputs into a probability distribution