All Flashcards

Question

What is the use of tf.keras.layers.TextVectorization?

Answer 1

It turns raw strings into an encoded representation that can be read by an Embedding layer or Dense layer.

Answer 2

By adding a nonlinear transformation layer, which is facilitated by a nonlinear activation function such as a sigmoid, tanH or ReLU.

Answer 3

By adding a nonlinear transformation layer, which is facilitated by a nonlinear activation function such as a sigmoid, tanH or ReLU.

Answer 4

When there are more layers in the network, the value of the product of derivative decreases until at some point the partial derivative of the loss function approaches a value close to zero, and the partial derivative vanishes. As a result the models' weights don't update .

Answer 5

Networks with ReLU hidden activations often have 10 times the speed of training than networks with sigmoid hidden activations.

Answer 6

Due to the negative domain's function always being zero, we can end up with ReLU layers dying. Now, what I mean by that is, you'll start getting inputs in the negative domain. Then, the output of the activation will be zero, negative times zero, zero, which doesn't help in the next layer getting the inputs back into the positive domain. It's still going to be zero. Some extensions to ReLU meant to relax the nonlinear output of the function and to allow small negative values: Leaky ReLU, logistics sigmoid function .

Answer 7

TF.Keras, again, that's TensorFlow's high level API for building and training deep learning models.

Answer 8

- user friendly - modular and composable - easy to extend

Answer 9

- if the model has multiple inputs or multiple outputs. - if any of the layers in the model have multiple inputs and multiple outputs that, that model needs to do layer sharing - if the model has a nonlinear topology such as a residual connection or if it multi-branches.

Answer 10

- An optimizer that's generally used in machine learning. - SGD is an algorithm that descends the slope, hence the name, to reach the lowest point on that loss surface.

Answer 11

Optimizers tie together that loss function and the model parameters by actually doing the updating of the model in response to the output of the loss function.

Answer 12

- computationally efficient - having little memory requirements - its invariability due to the diagonal rescaling of the gradients.

Answer 13

Adam is well-suited for - models that have large data sets - if you have a lot of parameters that you're adjusting - problems with very noisy or sparse gradients and nonstationary objectives

Answer 14

Once we have a model in this format, we have many ways to serve the model, web application, code like JavaScript, from a mobile application, etc.

Answer 15

SavedModel is a universal serialization format for TensorFlow models.

Answer 16

- language neutral format to save your machine learning models that is both recoverable and hermetic. - It enables higher level systems and tools to produce, consume and transform your TensorFlow models. - Models saved in this format can be restored using the tf

Answer 17

to create models that - share layers - reuse layers - have multiple inputs or outputs. Use functional API instead.

Answer 18

- It's less verbose than using keras.Model subclasses - It validates your model while you're defining it - Your functional model is plottable and respectable. - model can be serialized or cloned.

Answer 19

- It does not support dynamic architectures - Sometimes, you just need to write everything from scratch. When writing advanced architectures, you may want to do things that are outside the scope of defining a DAG of layers

Answer 20

The functional API treats models as DAGs, or directed acyclic graphs, of those layers. Since recursive networks or Tree-RNNs, do not follow this assumption.

Answer 21

- The sequential model (consists of a simple list of layers but is limited to single-input, single-output stacks of layers - The functional API ( supports arbitrary model architectures. It is Keras industry strength modeling) - The subclassing (you implement everything from scratch on your own. You should use this if you have complex, out-of-the-box research use cases)

Answer 22

Both by updating network weights iteratively based on training data by diagonal rescaling of the gradients

Answer 23

By adding dropout layers to our neural networks

Answer 24

Unlike the Keras Sequential API, we have to provide the shape of the input to the model.

Answer 25

Use non-saturating, nonlinear activation functions such as ReLUs.

Answer 26

When sending training jobs to Vertex AI, it's common to split most of the logic into a task.py file and a model.py file.

Answer 27

Task.py is the entry point to your code that Vertex AI will start and knows job-level details like - how to parse the command line arguments, - how to long run, - where to write the outputs, - how to interface with the hyperparameter tuning - and so on. To do core ML, task.py will invoke model.py

Answer 28

- upload the data to Google Cloud Storage, - move the code into a trainer Python package, - submit the training job with gcloud to train on Vertex AI.

All Flashcards

(53 cards)