Deep Learning Flashcards

Question

0301 Specifying a model

Answer 1

Now you'll get to work with your first model in Keras, and will immediately be able to run more complex neural network models on larger datasets compared to the first two chapters. To start, you'll take the skeleton of a neural network and add a hidden layer and an output layer. You'll then fit that model and see Keras do the optimization so your model continually gets better. As a start, you'll predict workers wages based on characteristics like their industry, education and level of experience. You can find the dataset in a pandas dataframe called df. For convenience, everything in df except for the target has been converted to a NumPy matrix called predictors. The target, wage_per_hour, is available as a NumPy matrix called target. For all exercises in this chapter, we've imported the Sequential model constructor, the Dense layer constructor, and pandas.

Answer 2

You're now going to compile the model you specified earlier. To compile the model, you need to specify the optimizer and loss function to use. In the video, Dan mentioned that the Adam optimizer is an excellent choice. You can read more about it as well as other keras optimizers here, and if you are really curious to learn more, you can read the original paper that introduced the Adam optimizer. In this exercise, you'll use the Adam optimizer and the mean squared error loss function. Go for it!

Answer 3

You're at the most fun part. You'll now fit the model. Recall that the data to be used as predictive features is loaded in a NumPy matrix called predictors and the data to be predicted is stored in a NumPy matrix called target. Your model is pre-written and it has been compiled with the code from the previous exercise.

Answer 4

df.describe() total number of entries in the data. The maximum age in the data is 80.

Answer 5

You'll now create a classification model using the titanic dataset, which has been pre-loaded into a DataFrame called df. You'll take information about the passengers and predict which ones survived. The predictive variables are stored in a NumPy array predictors. The target to predict is in df.survived, though you'll have to manipulate it for keras. The number of predictive features is stored in n_cols. Here, you'll use the 'sgd' optimizer, which stands for Stochastic Gradient Descent. You'll learn more about this in the next chapter!

Answer 6

https://en.wikipedia.org/wiki/Stochastic_gradient_descent

Answer 7

The trained network from your previous coding exercise is now stored as model. New data to make predictions is stored in a NumPy array as pred_data. Use model to make predictions on your new data. In this exercise, your predictions will be probabilities, which is the most common way for data scientists to communicate their predictions to colleagues.

Answer 8

Learning rate too low. Learning rate too high. Poor choice of activation function.

Answer 9

It's time to get your hands dirty with optimization. You'll now try optimizing a model at a very low learning rate, a very high learning rate, and a "just right" learning rate. You'll want to look at the results after running this exercise, remembering that a low value for the loss function is good. For these exercises, we've pre-loaded the predictors and target values from your previous classification models (predicting who would survive on the Titanic). You'll want the optimization to start from scratch every time you change the learning rate, to give a fair comparison of how each learning rate did in your results. So we have created a function get_new_model() that creates an unoptimized model to optimize.

Answer 10

Now it's your turn to monitor model accuracy with a validation data set. A model definition has been provided as model. Your job is to add the code to compile it and then fit it. You'll check the validation score in each epoch.

Answer 11

https://keras.io/optimizers/

Answer 12

Now that you know how to monitor your model performance throughout optimization, you can use early stopping to stop optimization when it isn't helping any more. Since the optimization stops automatically when it isn't helping, you can also set a high value for epochs in your call to .fit(), as Dan showed in the video. The model you'll optimize has been specified as model. As before, the data is pre-loaded as predictors and target. optimization will automatically stop when it is no longer helpful, it is okay to specify the maximum number of epochs as 30 rather than using the default of 10 that you've used so far. Here, it seems like the optimization stopped after 7 epochs.

Answer 13

Now you know everything you need to begin experimenting with different models! A model called model_1 has been pre-loaded. You can see a summary of this model printed in the IPython Shell. This is a relatively small network, with only 10 units in each hidden layer. In this exercise you'll create a new model called model_2 which is similar to model_1, except it has 100 units in each hidden layer. After you create model_2, both models will be fitted, and a graph showing both models loss score at each epoch will be shown. We added the argument verbose=False in the fitting commands to print out fewer updates, since you will look at these graphically instead of as text. Because you are fitting two models, it will take a moment to see the outputs after you hit run, so be patient. The blue model is the one you made, the red is the original model. Your model had a lower loss value, so it is the better model. Nice job!

Answer 14

You've seen how to experiment with wider networks. In this exercise, you'll try a deeper network (more hidden layers). Once again, you have a baseline model called model_1 as a starting point. It has 1 hidden layer, with 50 units. You can see a summary of that model's structure printed out. You will create a similar network with 3 hidden layers (still keeping 50 units in each layer). This will again take a moment to fit both models, so you'll need to wait a few seconds to see the results after you run your code.

Answer 15

Increasing the number of units in each hidden layer would be a good next step to try achieving even better performance.

Answer 16

https://www.datacamp.com/community/tutorials/deep-learning-jupyter-aws

Answer 17

https: //www.datacamp.com/courses/advanced-deep-learning-with-keras-in-python https: //www.datacamp.com/courses/convolutional-neural-networks-for-image-processing https://www.datacamp.com/tracks/machine-learning-with-python

Answer 18

You've reached the final exercise of the course - you now know everything you need to build an accurate model to recognize handwritten digits! We've already done the basic manipulation of the MNIST dataset shown in the video, so you have X and y loaded and ready to model with. Sequential and Dense from keras are also pre-imported. To add an extra challenge, we've loaded only 2500 images, rather than 60000 which you will see in some published results. Deep learning models perform better with more data, however, they also take longer to train, especially when they start becoming more complex. If you have a computer with a CUDA compatible GPU, you can take advantage of it to improve computation time. If you don't have a GPU, no problem! You can set up a deep learning environment in the cloud that can run your models on a GPU. Here is a blog post by Dan that explains how to do this - check it out after completing this exercise! It is a great next step as you continue your deep learning journey. Ready to take your deep learning to the next level? Check out Advanced Deep Learning with Keras in Python to see how the Keras functional API lets you build domain knowledge to solve new types of problems. Once you know how to use the functional API, take a look at "Convolutional Neural Networks for Image Processing" to learn image-specific applications of Keras.

Deep Learning Flashcards

Deep Learning (42 cards)