13. Maintaining ML Solutions Flashcards

Question 1

Q

What are the steps in ML?

Answer

A

Data:
Extraction (from sources)
Analysis (EDA)
Preparation (transform and feature engineering)
Model:
Training (get the best model)
Evaluation (assess the model quality)
Validation (meet a predefined performance metrics)
Deployment (online & batch):
Serving (RESTful endpoint)
Monitor (Detect anomalies, drift & skew)

Hints:
Data: Elephants Are Playful
Model: Tigers Enjoy Vegetation During Sunny Mornings

Question 2

Q

What are the three levels of MLOps?

Answer

A

Level 0: Manual Phase
Level 1: Strategic automation phase
Level 2: CI/CD automation, transformational phase

Question 3

Q

What are the key features of Level 0?

Answer

A

Manual
ML and MLOps are different teams
No CI/CD/CT
No deploying an entire ML system

Question 4

Q

What are the key features of Level 1?

Answer

A

Orchestrated experimentation
CT
Experiment-operational symmetry
Modular components
CD
Pipeline deployment

Question 5

Q

What are the considerations for triggering retraining?

Answer

A

Training costs
Training time
Delayed training
Scheduled training

Question 6

Q

What are the key features of Level 2?

Answer

A

Pipeline
CI/CD

Question 7

Q

What are the triggers for retraining?

Answer

A

Absolute threshold
Rate of degradation

Question 8

Q

What are the problems for not having a centralised feature store?

Answer

A

Non-reusable: Features created not shared
Governance: Features created by different sources not governed
Cross-collaboration: Features not being shared continue to go separately.
Training and serving differences: Differences may exist between training and serving data.
Productizing features: Lack of automation in features used in experimentation.

Question 9

Q

What is model versioning for?

Answer

A

Deploy an additional model to the existing model.

Question 10

Q

What are the two key features of Feature Store?

Answer

A

Process large feature sets quickly
Access the features with low latency for real-time and batch predictions.

Question 11

Q

Is Vertex AI Feature Store a managed service and scale dynamically?

Question 12

Q

What model does Feature Store use to store all the data?

Answer

A

Time-series

Question 13

Q

What is the hierarchy of featurestore?

Answer

A

Featurestore > EntityType > Feature

Question 14

Q

What are the two types of ingestions supported by Feature Store?

Answer

A

Batch and streaming ingestion, e.g., BigQuery to Feature Store.

Question 15

Q

What are the two types of retrieving supported by Feature Store?

Answer

A

Batch and online.

Question 16

Q

What are the best practices to use IAM security?

Answer

A

Least privilege
Actively manage service accounts and service account keys
Enable auditing
Check policy management

Question 17

Q

What service do you use to manage permissions to perform various operations?

Answer

A

Identity and Access Management (IAM)

Question 18

Q

What is the specific uses of IAM in Vertex AI?

Answer

A

Google automatically creates several service accounts for Google Cloud Projects. They may have more permissions than required. Use custom service accounts.

Question 19

Q

What is Access Transparency in Vertex AI?

Answer

A

You need logs to track what content and who is accessing it. They may be legal and compliance requirements.
There are two types of access logs. Cloud Audit logs are logs of users from your organisation and Access Transparency logs are logs of Google personnel.

Question 20

Q

What are the common training errors?

Answer

A

Input data not transformed or encoded
Tensor shape mismatched
Out of memory errors because of instance size

Question 21

Q

What are the common serving errors?

Answer

A

Input data not transformed or encoded
Signature mismatched

Question 22

Q

What are the ways to prevent and reduce training and serving errors?

Answer

A

Compute statistics
Infer schema
Detect anomalies

Question 23

Q

What does Vertex AI provide to debug training for both pre-built and custom containers?

Answer

A

Interactive shell

Question 24

Q

What can you inspect with interactive shell during training?

Answer

A

Run tracing and profiling tools
Analyze GPU utilization
Validate IAM permissions for the container