12. Model Monitoring, Tracking, and Auditing Metadata Flashcards by KK Cheng

What are two types of drifts?

Concept drift
Data drift

How well did you know this?

Not at all

Perfectly

What is concept drift?

The relationship between input variables and predicted variables change.

How well did you know this?

Not at all

Perfectly

How do you prevent model deterioration?

Model monitoring, i.e., monitor input data and evaluate the model with the same metrics during the training phase.

How well did you know this?

Not at all

Perfectly

What is data drift?

Input data change, e.g., statistical distribution, schema, feature definition

How well did you know this?

Not at all

Perfectly

What can Vertex AI model monitoring monitor?

Training-serving skew: Feature distribution differences between production and training.
Prediction drift: Input’s statistical distribution changes in production over time.

How well did you know this?

Not at all

Perfectly

What are the two types of data that can be monitored?

Categorical features and numerical values

How well did you know this?

Not at all

Perfectly

How do you calculate baselines?

Baseline for skew detection: The statistical distribution of the feature’s values in the training data
Baseline for drift detection: The statistical distribution of the feature’s values in the production data (recent past)
Distribution calculations for categorical and numerical features (bin): The count or percentage of instances of each possible value.

How well did you know this?

Not at all

Perfectly

How does Vertex AI monitor drift and skew?

It compares the baselines and the equivalent latest values in the production.
Categorical features: L-infinity distance
Numerical features: Jensen-Shannon divergence
Vertex AI takes as an anomaly if the distance score hits a pre-defined threshold.

How well did you know this?

Not at all

Perfectly

What are the factors for effective monitoring?

Sampling rate
Monitoring frequency
Alerting thresholds
Number of models in an endpoint

How well did you know this?

Not at all

Perfectly

How do you monitor input schemas?

The input values are part of the payload of the prediction requests. You can specify a schema when you configure model monitoring.

How well did you know this?

Not at all

Perfectly

What are the two types of schema?

Automatic schema: Model monitoring will analyze and detect the schema.
Custom schema: User specified in Open API format

How well did you know this?

Not at all

Perfectly

What are the three types of schema formats?

Object: key/value pairs
Array: array-like format
String: csv-string

How well did you know this?

Not at all

Perfectly

What is the reason for logging?

Monitor input trends
Auditing

How well did you know this?

Not at all

Perfectly

What are the models supported by Vertex AI logging?

AutoML models (tabular & image)
Custom-trained models
Logging can be enabled during model deployment or endpoint creation

How well did you know this?

Not at all

Perfectly

What are the three types of logging?

Container Logging: stdout and stderr for debugging
Access Logging: Time stamp and latency for each request to Cloud Logging
Request-Response Logging: sample of the online prediction requests and responses.

How well did you know this?

Not at all

Perfectly

Where can you change log settings?

Study These Flashcards

Create an endpoint
deploy a model to an endpoint

What is parameters, artifacts and metrics of an ML experiment called?

Study These Flashcards

Metadata

What are the limitations for model monitoring and logging?

Study These Flashcards

Both services use BigQuery table
Model monitoring is enabled, you can’t enable request-response logging.
Request-response logging is enabled first, then model monitoring. The request-response logging can’t be modified.

What are the uses of metadata?

Study These Flashcards

Detect model degradation
Compare different sets of hyperparameters
Track the lineage of the ML artifacts
Rerun an ML workflow with the same parameters
Track downstream usage of artifacts for audit purposes

Hints: Whales Dive, Humpbacks Love Abyss.

What is Metadata store?

Study These Flashcards

It is the top-level container for all the metadata resources, i.e., one by organization

How do you manage ML Metadataschema?

Study These Flashcards

Most common types of resources stored have predefined schemas called system schemas.

What are Metadata resources?

Study These Flashcards

Artifacts: Pieces of data created by or consumed by an ML workflow, e.g., datasets, models, input files, training logs, metrics
Context: A group of artifacts and executions that can be queried for identifying the best model.
Execution: A step in a ML workflow.
Events: An event connects artifacts and executions.
Metadataschema: Specifies the schema to be used by the particular types of data like artifact or execution.

Hints: Apples Can Entertain Every Monkey.

What are the functions of Vertex AI Experiments?

Study These Flashcards

Track the steps for an experiment run (preprocessing, embedding, training)
Track input like algorithms, hyperparameters, dataset, etc.
Track output of these steps like models, metrics, checkpoints, etc

What are automatically generated when you use Vertex AI Pipelines?

Study These Flashcards

The model metadata and artifacts are automatically stored in the metadata store for lineage tracking, e.g., dataset summary, model evaluation metrics, metadata on certain executions.

How do you use Vertex AI Debugging?

Install interactive Bash shell in the training container Run the custom training Make sure the user having right permissions Enable enableWebAccess API You can use interactive shell to do the followings: Check permission Visualize Python execution with profiling tools Analyze performance of training node using Perf Check CPU or GPU usage

12. Model Monitoring, Tracking, and Auditing Metadata Flashcards

(25 cards)