DL-04 - MLOps Flashcards
DL-04 - MLOps
What are the steps of the DevOps cycle? (7, PCVPRCM)
1) Plan
2) Create
3) Verify
4) Package
5) Release
6) Configure
7) Monitor
DL-04 - MLOps
What are the goals of MLOps? (4, EMDQ)
- Faster experimentation
- Faster model development
- Faster deployment of updated models
- Quality assurance
DL-04 - MLOps
Name the steps of the MLOps lifecycle? (8, EDDTRDIM)
1) EDA (Exploratory data analysis)
2) Data Prep
3) Develop
4) (Re-)Train
5) Review
6) Deploy
7) Inference
8) Monitor
DL-04 - MLOps
What happens in CI in MLOps? (2)
- When you update your code/data, the ML pipeline reruns.
- Integration is run before release, e.g. performance/error tests
DL-04 - MLOps
What are some things you should monitor in MLOps? (2)
- Performance over time
- Data drift
DL-04 - MLOps
What happens in CD in MLOps? (2 examples)
- Test model in staging environment.
- Soft release to some portion of customers, canary release.
DL-04 - MLOps
What is data drift?
Data drift is the change in data distribution over time, affecting the model’s performance and accuracy.
DL-04 - MLOps
What is a canary release?
A canary release is a deployment strategy where a new model or application version is gradually rolled out to a small percentage of users to test its performance and identify potential issues before a full-scale release.
DL-04 - MLOps
What are the most important things about DVC? (5, VRCPS)
- Version control for data
- Reproducibility
- collaboration
- pipeline management
- remote storage integration.
DL-04 - MLOps
What are the most important things about CML? (4, ACSR)
- automation
- collaboration
- scalability
- reproducibility
DL-04 - MLOps
How does CML mitigate the problem of people working on the same code?
By enabling collaboration through version control and automated machine learning pipelines.
DL-04 - MLOps
How can you ensure reproducibility in deep learning projects?
By tracking what logic, dataset, and pipeline were used to get a certain model/performance.
DL-04 - MLOps
In Github, what is a Self-hosted runner?
A Self-hosted runner is an installable application that you can host on your own hardware to run GitHub Actions and Workflows.
DL-04 - MLOps
What does a self-hosted runner for Github allow you to do?
Run GitHub Actions workflows on your own infrastructure.