Formulas Flashcards
What is the formula for mean squared error?
What is the sigmoid function?
What is the function for the hyperbolic tangent?
What are the two equations used to update the weights using Momentum?
What is the equation for the running average of the gradients used in Adam?
What is the equation for the squared gradients used in Adam?
How is each parameter updated when using Adam?
What is the Bayes Rule?
What is the formula for the entropy of a discrete probability distribution?
What is the formula for KL-divergence for two probability distributions?
What is the formula for the entropy of a continuous probability distribution?
What is the formula for the KL-divergence of a continuous probability distribution?
What is the entropy of a Gaussian Distribution?
What is the entropy of a d-dimensional Gaussian distribution?
What is the KL-divergence between two d-dimensional multivariate Gaussian Distributions?
What is the Wasserstein difference for two multivariate Gaussian Distributions?
What is the cross entropy error for a binary classification task?