M3 Flashcards

1
Q

What are the two phases in ML

A

Training phase and Inference phase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the purpose of EDA

A

To maximize insight into a data set, uncover underlying structure, extract important variables, detect outliers and anomalies, test underlying assumptions, develop parsimonious models and determine optimal factor settings.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the attributes related to data quality

A

Accuracy, completeness, consistent, and timely

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the difference between three data analysis approaches?

A

The difference is the sequence and focus of the intermediate steps.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are three main data analysis approaches, and explain each?

A

classical analysis, the data collection is followed by the imposition of a model, normality, linearity, for and the analysis, estimation and testing that follows are focused on the parameters of that model.

EDA: For EDA, the data collection is not followed by a model imposition.Rather, it is followed immediately by analysis with a goal of inferring what model would be appropriate.

Bayesian analysis, the analyst attempts to answer research questions about unknown parameters using probability statements based on prior data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Two types of supervised problems?

A

Regression and Classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What type of loss functions do LR and classification use usually?

A

regression models usually use means-squared error as their loss function whereas classification models tend to use cross entropy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

two techniques used to prevent overfitting in a loss function

A

Regularization and early stopping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is a loss function?

A

It quantifies the error between your model’s predictions and the true target values. By minimizing this loss function during training, you are essentially fine-tuning your model to make accurate predictions, which is the primary objective of machine learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is regularization in machine learning?

A

regularization is a technique used to prevent overfitting and improve the generalization ability of a model. The regularization term is typically controlled by a hyperparameter called the regularization strength, denoted as “λ” (lambda). The higher the value of λ, the stronger the regularization effect, and the more the model’s parameters are constrained.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Auto ML?

A

No data science expertise is required to use
AutoML. AutoML lets you create and train a model with minimal technical effort.
codeless
Must target one of the autoMLs predefined objectives(classification, regression)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is r square evaluation metric?

A

R squared is the square of the Pearson correlation coefficient between the observed and predicted values.
The R squared value ranges from zero to one, where a higher value indicates a higher-quality model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Explain AUC

A

PR AUC, this is the area under the precision-recall PR curve. This value ranges from zero to one, where a higher value indicates a higher-quality model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain ROC AUC

A

this is the area under the receiver operating characteristic ROC curve. It is plotted as TP rate vs FP rate for different thresholds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Explain Log loss

A

Log loss, this is the cross-entropy between the model predictions and the target values.
This ranges from zero to infinity, where a lower value indicates a higher-quality model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Three steps in making a recommendation system with BQ ML

A

prepare training data in BigQuery, train a recommendation system with BigQuery ML, and use the predicted recommendations in production.

17
Q

Which of these BigQuery supported classification models is most relevant for predicting binary results, such as True/False?

DNN Classifier (TensorFlow)

AutoML Tables

Logistic Regression

XGBoost

A

Logistic Regression

18
Q

what is a scaling parameter used to set the step size of the gradient descent called?

A

learning_rate

19
Q

Why retraining the model could output different results even if all the setting are the same?

A

The loss surface could have more than 1 minima (a non convex surface, without a global minima)

20
Q

What are the three steps of a training loop?

A

Calculate derivative, take a step, check loss.

21
Q

How can you speed up training time of a model?

A
  1. Number of data points that we collect the derivative on (the derivative comes from our loss function, and our loss function composes the error of a number of predictions together.So this method essentially reduces the number of data points that we feed into our loss function at each iteration of our algorithm.
    The reason that this might still work is that it’s possible to extract samples from our training data that on average balance each other out.sampling strategy selects from our training set with uniform probablity. This is called mini-batch gradient descent)
  2. frequency with which we check the loss.
22
Q

What do they mean by batch-size

A

the size of samples in mini batch gradient descent

23
Q

What are the two consequences of inappropriate minima

A

Doesn’t reflect the relationship between features and label
won’t generalize well

24
Q

Difference between loss function and performance metrics

A

Loss function is during the training, whereas PM is after
LF is harder to understand
PM is directly connected to business goals

25
Q

What is a type 1 error

26
Q

What is a Type 2 error

27
Q

Why can’t we use the validation set only to report the model performace

A

Because you used the validation data set to choose when to stop the training, so now it is no longer independent.

28
Q

What is bootstrapping or cross-validation?