Deep Learning Using SAS® Software Flashcards

Question 1

Q

What are the three Deep Learning model variants?

Answer

A

Deep fully connected neural networks (DNN)
Convolutional neural networks (CNN)
Recurrent neural networks (RNN)

Question 2

Q

What other languages can read CASL?

Answer

A

Python, R, Java, and Lua

Question 3

Q

What is Curriculum Learning?

Answer

A

slowly build up learning concepts aka shuffle action to randomize data

Question 4

Q

What method of weight initialization is used in deep learning?

Answer

A

a normalized initialization in which the variance of the hidden weights is a function of the amount of incoming information and outgoing information

Question 5

Q

What does RMSE stand for?

Answer

A

root means square error

Question 6

Q

What does MAPE stand for?

Answer

A

mean absolute percentage error

Question 7

Q

What is regularization?

Answer

A

the process of introducing or removing information to stabilize an algorithm’s understanding of the data

Question 8

Q

What does the dropout regularization method do?

Answer

A

Dropout adds noise to the learning process so that the model is more generalizable

Question 9

Q

How can you improve model generalization?

Answer

A

Dropout can improve model generalization

Question 10

Q

What is a thinned network?

Answer

A

Each time that units are removed (USING DROPOUT), the resulting network is referred to as a thinned network

Question 11

Q

What does the batch normalization regularization method do?

Answer

A

The batch normalization operation normalizes data being passed between layers in a neural network to prevent large input values in the combination function, which can lead to overfitting of the data.
batch normalization normalizes the information back to the linear region of the sigmoid, which is a safe output region;
batch normalization brings the weight values back to a familiar range

Question 12

Q

Why do weight initializations have less impact on model performance if batch normalization is used?

Answer

A

batch normalization standardizes information that is passed between hidden layers

Question 13

Q

When should you use GPUs instead of CPUs in Deep Learning?

Answer

A

The use of GPUs should be reserved for larger neural networks. The difference in performance between CPUs and GPUs is negligible in neural networks with a small number of parameters.

Question 14

Q

Why are GPUs effective when modeling neural networks?

Answer

A

GPUs are designed to perform many operations in parallel

Question 15

Q

What does Loss refer to in Neural Network output?

Answer

A

loss specifies the training error function value

Question 16

Q

What does Validation Loss refer to in Neural Network output?

Answer

A

specifies the validation error function value

Question 17

Q

What does Validation Error refer to in Neural Network output?

Answer

A

the validation misclassification rate

Question 18

Q

What does Fit Error refer to in Neural Network output?

Answer

A

the misclassification rate for the training data

Question 19

Q

What does a sudden increase in the L1 or L2 norm values indicate?

Answer

A

Overfitting (because the weights in the model must be getting really large to push those values up)

Question 20

Q

What does ADAM optimization do if the signal to noise ratio is high?

Answer

A

adjusts the step size in order to take larger steps towards error minima

Question 21

Q

What does ADAM optimization do if the signal to noise ratio is low?

Answer

A

adjusts the step size to move more slowly