DL-04 - MLOps Flashcards

1
Q

DL-04 - MLOps

What are the steps of the DevOps cycle? (7, PCVPRCM)

A

1) Plan
2) Create
3) Verify
4) Package
5) Release
6) Configure
7) Monitor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

DL-04 - MLOps

What are the goals of MLOps? (4, EMDQ)

A
  • Faster experimentation
  • Faster model development
  • Faster deployment of updated models
  • Quality assurance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

DL-04 - MLOps

Name the steps of the MLOps lifecycle? (8, EDDTRDIM)

A

1) EDA (Exploratory data analysis)
2) Data Prep
3) Develop
4) (Re-)Train
5) Review
6) Deploy
7) Inference
8) Monitor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

DL-04 - MLOps

What happens in CI in MLOps? (2)

A
  • When you update your code/data, the ML pipeline reruns.
  • Integration is run before release, e.g. performance/error tests
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

DL-04 - MLOps

What are some things you should monitor in MLOps? (2)

A
  • Performance over time
  • Data drift
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

DL-04 - MLOps

What happens in CD in MLOps? (2 examples)

A
  • Test model in staging environment.
  • Soft release to some portion of customers, canary release.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

DL-04 - MLOps

What is data drift?

A

Data drift is the change in data distribution over time, affecting the model’s performance and accuracy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

DL-04 - MLOps

What is a canary release?

A

A canary release is a deployment strategy where a new model or application version is gradually rolled out to a small percentage of users to test its performance and identify potential issues before a full-scale release.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

DL-04 - MLOps

What are the most important things about DVC? (5, VRCPS)

A
  • Version control for data
  • Reproducibility
  • collaboration
  • pipeline management
  • remote storage integration.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

DL-04 - MLOps

What are the most important things about CML? (4, ACSR)

A
  • automation
  • collaboration
  • scalability
  • reproducibility
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

DL-04 - MLOps

How does CML mitigate the problem of people working on the same code?

A

By enabling collaboration through version control and automated machine learning pipelines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

DL-04 - MLOps

How can you ensure reproducibility in deep learning projects?

A

By tracking what logic, dataset, and pipeline were used to get a certain model/performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

DL-04 - MLOps

In Github, what is a Self-hosted runner?

A

A Self-hosted runner is an installable application that you can host on your own hardware to run GitHub Actions and Workflows.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

DL-04 - MLOps

What does a self-hosted runner for Github allow you to do?

A

Run GitHub Actions workflows on your own infrastructure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly