Machine Learning with Viya® 3.4® Lesson 4: Neural Networks Flashcards
What is the neural network term for a parameter estimate or slope?
Weight estimate
What is the mathematical transformation that is applied to a linear combination of the input values in a MLP or Neural Network referred to as?
activation function
Which neural network architecture is best for modeling nonstationary data?
A skip-layer perceptron is the best architecture to use with nonstationary data.
Which hyperparameters or set of hyperparameters in Model Studio are used to control weight decay?
the L1 and L2 hyperparameters control weight decay
What is the purpose of the Minibatch size option in a Neural Network mode?
It defines the number of training observations to calculate the model error and update the model coefficients.
Can neural networks can select inputs in a similar fashion as a tree-based model?
neural network cannot select inputs like a tree-based model
To which optimization method does the minibatch option in a Neural Network apply?
The Minibatch size option defines the number of training observations to use in the SGD instead of using all training observations.
Why is a neural network referred to as universal approximators?
Neural networks are called universal approximators because they can model any input-output relationship, no matter how complex.
What terms refers to a parameter estimate or slope that is associated with an input in a neural network?
Weight estimate is the neural network term for a parameter estimate or slope.
Which activation function is commonly used in the target layer when modeling a binary target?
The logistic function is the target layer activation function (or target layer link function) that is typically used with a binary target.
What is the limitation of modeling with a neural network?
Neural networks are generally considered to be “black boxes.” Because they are minimally interpretable, at best, neural networks are most useful in pure prediction scenarios.
What is the value of a standardized variable called?
Z-score or standard score
What is standardization?
rescaling your data to have a mean of 0 and a standard deviation of 1
When is the identity target activation function appropriate when modeling a neural network?
when the target error function is normally distributed
What is the neural network term for an intercept estimate?
Bias estimate
When early stopping is used to build a neural network model, which data partition does Model Studio use to select the final model?
The validate partition
What may be affected when multi-scaled variables are used in a multivariate analysis?
Model stability and parameter estimate precision are influenced during multivariate analysis when multi-scaled variables are used (i.e., a variable that ranges between 0 and 100 will outweigh a variable that ranges between 0 and 1.)
What occurs during a neural network’s learning process?
Numerical optimization is an important part of the learning process.
Why might you apply weight decay when building a neural network model?
Weight decay is one of two main methods used to avoid overfitting when building a neural network model
What is the partition validation method?
With partition, you specify proportions to use for randomly assigning observations to each role.
The Neural Network node can use weight decay to avoid overfitting. How are the L1 and L2 regularizations applied?
L1 penalizes the absolute value of the weights.
L2 penalizes the squared weights.
What is an activation function?
a mathematical transformation that is applied to a linear combination of the input values.