Developing Machine Learning Solutions Flashcards

1
Q

End to end machine learning lifecycle process

A

Business goal identification

ML problem framing

Data processing (data collection, data preprocessing, and feature engineering)

Model development (training, tuning, and evaluation)

Model deployment (inference and prediction)

Model monitoring

Model retraining
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Bias

A

The gap between your predicted value and the actual value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Variance

A

Dispersion of your predicted values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Classification metrics

A

1) Accuracy 2) Precision 3) Recall 4) F1 4) AUC-ROC

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Regression metrics

A

1) Mean squared error
2) R squared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Confusion matrix

A

For a classification model. A confusion matrix can help classify why and how a model gets something wrong.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Model Accuracy or Score Equation

A

Correct predictions/Total number of predictions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Precision equation

A

Positive predictions that are correct/Total number of positive predictions. Great metric when the cost of false positives is high

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Recall (sensitivity) equation

A

True positive count/sum of true positive and false negatives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

AUC-ROC

A

For classification models. show what the curve for true positive compared to false positive looks like at various thresholds. AUC-ROC uses sensitivity (true positive rate) and specificity (false positive rate)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is feature engineering?

A

Feature engineering transforms data into features or inputs that will be valuable for the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Machine learning solution without code

A

SageMaker Canvas

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is MLOps

A

MLOps is a set of practices and principles that aims to manage the entire lifecycle of machine learning systems, from model development and training to deployment, monitoring, and maintenance. It provides a structured approach to streamlining the ML workflow, ensuring reliability, scalability, and reproducibility.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the machine learning lifecycle?

A

the end-to-end process of developing, deploying, and maintaining machine learning models from identifying the business problem to the deploying and monitoring the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly