All Flashcards

Question

What approaches are used to update the model using gradients from various devices?

Answer 1

- asynchronous parameter server approach - the synchronous allreduce approach

Answer 2

- instead of training your model directly within your notebook instance, you can submit a training job from your notebook (The training job would automatically provision computing resources and deprovision those resources when the job is complete) - The training service can help to modularize your architecture (put your training code into a container to operate as a portable unit. ). - the training code can export the trained model file, thus enabling working with other AI services in a decoupled manner - The training service also supports reproducibility. Each training job is tracked with inputs, outputs and the container image used - The training service also supports distributed training, which means that you can train models across multiple nodes in parallel.

Answer 3

A large learning rate value may result in the model learning a sub-optimal set of weights too fast or an unstable training process.

Answer 4

Training may take a long time.

Answer 5

- grid search - random search - bayesian optimization (default)

Answer 6

- takes into account past evaluations when choosing which hyperparameter set to evaluate next. - typically requires fewer iterations to get the optimal set of hyperparameter values - limits the number of times a model needs to be trained

Answer 7

Vertex Vizier is a black-box optimization service that helps you tune hyperparameters in complex machine learning models.

Answer 8

Grid Search

Answer 9

Batch prediction is asynchronous, which means that the model will wait until it processes all of the prediction requests before returning a CSV file or a BigQuery table with prediction values.

Answer 10

- useful if your model is part of an application, and parts of your system are dependent on a quick prediction turnaround. - synchronous in real time, which means that it will quickly return a prediction but only accepts one prediction request per API call

Answer 11

You must provide Vertex AI with a Docker container image that runs an HTTP server

Answer 12

- BigQuery data source tables cannot be larger than 100 gigabytes. - you must use a multi-regional BigQuery dataset in the US or EU locations. - If the table is in a different project, you must provide the BigQuery Data Editor role to the Vertex AI service account in that project.

Answer 13

- the first line of the data source must contain the name of the columns. - Each data source file cannot be larger than 10 gigabytes. You can include multiple files up to a maximum size of 100 gigabytes. - If the cloud storage bucket is in a different project where you use Vertex AI, you must provide the Storage Object Creator role to the Vertex AI service account in that project.

Answer 14

Vertex AI model monitoring is a service that helps you manage the performance of your models: - lets you detect drift in data quality, - identify skew in training versus serving data, - monitor feature attribution, - use the UI to visualize monitoring metrics.

Answer 15

the statistical distribution of the feature's values in the training data.

Answer 16

the statistical distribution of the feature's values seen in production in the recent past.

Answer 17

- model monitoring computes the statistical distribution of the latest feature values seen in production. - this statistical distribution is then compared against another baseline distribution by computing a distance score to determine how similar the production feature values are to the baseline. - when the distance score between two statistical distributions exceeds a certain threshold, model monitoring identifies that as skew or drift.

Answer 18

Vertex AI pipelines are portable and scalable ML workflows that are based on containers and Google Cloud services.

Answer 19

we recommend that you build your pipeline using TFX. For other use cases, we recommend that you build your pipeline using the Kubeflow pipeline's SDK.

Answer 20

- skew detection, - fine-tuning alert thresholds, - using feature attributions to detect data drift or skew - tracking outliers.

Answer 21

works for structured data like numerical or categorical features but not for unstructured data like images.

All Flashcards

(46 cards)