Introduction to Deep Learning Flashcards

Question 1

Q

In a single perceptron(neuron), what is the formula for calculating a single output?

Answer

A

y = g(W0 + XW)
1. Dot product (XW)
2. Add bias (W0)
3. Apply non-linearity (g)

Question 2

Q

What are the 3 layers of NN?

Answer

A

Input layer, hidden layers, output layer.
The input layer receives data, hidden layers process it, and the output layer produces the final predictions.

Question 3

Q

How to determine how well the model perform?

__ Function

Answer

A

Using Loss Function

Question 4

Q

What are the 2 types of loss functions?

Binary __ loss
Mean __

Answer

A

Binary cross entropy loss (Classification)
Mean Squared Error (MSE) (Regression)

Question 5

Q

What is the problem of training NN?

Finding the vector __. The __ determine the performance, because they…

Answer

A

Finding the vector weights(W). The weights determine the performance, because they directly influence the output and thus the loss.

Question 6

Q

What is backpropagation in simple sense?

Backpropa uses the __ rule to compute the gradient of the loss …

Answer

A

Backpropagation uses the chain rule to compute the gradient of the loss with respect to the weights (W). This allows us to update the weights in the direction that minimizes the loss.

Question 7

Q

In summary, what is forwardpropagation?

Answer

A

FP refers to the computation of outputs from the inputs in NN.

Question 8

Q

What is the optimization method we can use during backpropagation?

__ (GD)

Answer

A

The method is gradient descent.

Question 9

Q

How does GD works?

It sets the ___ __ (LR) …

Answer

A

It sets the learning rates by designing an adaptive LR that adapts to the landscape.

Question 10

Q

What are 2 tips to train NN?

S…
B…

Answer

A

Stochastic GD (SGD) and batching

Question 11

Q

Batching is better than SGD. Why?

It uses ___ batch of data to compute.

Answer

A

Instead of using single training point, batching uses small batch of data to compute. It is more accurate and faster to train.

Question 12

Q

How to deal with overfitting in NN?

Re___

Answer

A

Regularization

Question 13

Q

What is the 2 regularization technique?

D…
E… S…

Answer

A

Dropout
Early Stopping

Question 14

Q

How does Dropout work?

Randonly set some ___ to 0.
Drop __% of __ in layers.

Answer

A

Randomly set some activations to 0.
Drop 50% of activation in layers

Question 15

Q

How does Early Stopping work?

Stop the model from ___ before ___

Answer

A

Stop the model from training before overfitting.

Introduction to Deep Learning Flashcards

(15 cards)