Deep Learning Using SAS® Software Flashcards
What are the three Deep Learning model variants?
- Deep fully connected neural networks (DNN)
- Convolutional neural networks (CNN)
- Recurrent neural networks (RNN)
What other languages can read CASL?
Python, R, Java, and Lua
What is Curriculum Learning?
slowly build up learning concepts aka shuffle action to randomize data
What method of weight initialization is used in deep learning?
a normalized initialization in which the variance of the hidden weights is a function of the amount of incoming information and outgoing information
What does RMSE stand for?
root means square error
What does MAPE stand for?
mean absolute percentage error
What is regularization?
the process of introducing or removing information to stabilize an algorithm’s understanding of the data
What does the dropout regularization method do?
Dropout adds noise to the learning process so that the model is more generalizable
How can you improve model generalization?
Dropout can improve model generalization
What is a thinned network?
Each time that units are removed (USING DROPOUT), the resulting network is referred to as a thinned network
What does the batch normalization regularization method do?
The batch normalization operation normalizes data being passed between layers in a neural network to prevent large input values in the combination function, which can lead to overfitting of the data.
batch normalization normalizes the information back to the linear region of the sigmoid, which is a safe output region;
batch normalization brings the weight values back to a familiar range
Why do weight initializations have less impact on model performance if batch normalization is used?
batch normalization standardizes information that is passed between hidden layers
When should you use GPUs instead of CPUs in Deep Learning?
The use of GPUs should be reserved for larger neural networks. The difference in performance between CPUs and GPUs is negligible in neural networks with a small number of parameters.
Why are GPUs effective when modeling neural networks?
GPUs are designed to perform many operations in parallel
What does Loss refer to in Neural Network output?
loss specifies the training error function value