Chapter 9 Gradient Descent For Machine Learning Flashcards
HOW DOES PLOTTING COST VS TIME HELP US EVALUATE THE ALGORITHM? P41
A well performing gradient descent, decreases the error (cost) in each iteration, if it does not decrease, we should try reducing the learning rate.
HOW MANY PASSES ARE NEEDED FOR STOCHASTIC GRADIENT DESCENT? P41
SGD often doesn’t need more than 1-10 passes through the training set to converge on good or good enough coefficients
IS LINEAR REGRESSION SENSITIVE TO HAVING GAUSSIAN DISTRIBUTION? P47
Yes, Linear regression will make more reliable predictions if your input and output variables have a Gaussian distribution. You may get some benefit using transforms (e.g. log or BoxCox) on your variables to make their distribution more Gaussian looking