Introduction to Machine Learning Flashcards
If the business case is to predict fraud detection, which is the correct Objective to choose in Vertex AI?
Segmentation
Forecasting
Regression/Classification
Clustering
Regression/Classification
Which of the following metrics can be used to find a suitable balance between precision and recall in a model?
F1 Score
ROC AUC
Log Loss
PR AUC
PR AUC, this is the area under the precision-recall PR curve.
F1 score, this is the harmonic mean of precision and recall. F1 is a useful metric if you’re looking for a balance between precision and recall and there’s an uneven class distribution.
ROC AUC, this is the area under the receiver operating characteristic ROC curve.
Log loss, this is the cross-entropy between the model predictions and the target values.
MAE, MAPE, RMSE, RMSLE and R2 are all available as test examples in the Evaluate section of Vertex AI and are common examples of what type of metric?
Decision Trees Progression Metrics
Linear Regression Metrics
Clustering Regression Metrics
Forecasting Regression Metrics
Linear Regression Metrics
If a dataset is presented in a Comma Separated Values (CSV) file, which is the correct data type to choose in Vertex AI?
Video
Tabular
Text
Image
Tabular
For a user who can use SQL, has little Machine Learning experience and wants a ‘Low-Code’ solution, which Machine Learning framework should they use?
Python
BigQuery ML
AutoML
Scikit-Learn
BigQuery ML
What is the default setting in AutoML Tables for the data split in model evaluation?
80% Training 10% Validation, 10% Testing
80% Training, 15% Validation, 5% Testing
80% Training, 5% Validation, 15% Testing
70% Training, 20% Validation, 10% Testing
80% Training 10% Validation, 10% Testing
What does the Feature Importance attribution in Vertex AI display?
How much each feature impacts the model, expressed as a decimal
How much each feature impacts the model, expressed as a ratio
How much each feature impacts the model, expressed as a ranked list
How much each feature impacts the model, expressed as a percentage
How much each feature impacts the model, expressed as a percentage
Which of the following are stages of the Machine Learning workflow that can be managed with Vertex AI?
All of the options.
Train an ML model on your data.
Deploy your trained model to an endpoint for serving predictions.
Create a dataset and upload data.
All of the options.
What is the main benefit of using an automated Machine Learning workflow?
It reduces the time it takes to develop trained models and assess their performance.
It makes the model run faster.
It deploys the model into production.
It makes the model perform better.
It reduces the time it takes to develop trained models and assess their performance.
Which of the following are advantages of BigQuery ML when compared to Python based ML frameworks?
All of the options
BigQuery ML automates multiple steps in the ML workflow
BigQuery ML custom models can be created without the use of multiple tools
Moving and formatting large amounts of data takes longer with Python based models compared to model training in BigQuery
All of the options
Which of these BigQuery supported classification models is most relevant for predicting binary results, such as True/False?
DNN Classifier (TensorFlow)
AutoML Tables
XGBoost
Logistic Regression
Logistic Regression
Where labels are not available, for example where customer segmentation is required, which of the following BigQuery supported models is useful?
Time Series Anomaly Detection
Recommendation - Matrix Factorization
Time Series Forecasting
K-Means Clustering
K-Means Clustering
For Classification or Regression problems with decision trees, which of the following models is most relevant?
XGBoost
AutoML Tables
Wide and Deep NNs
Linear Regression
XGBoost
What are the 3 key steps for creating a Recommendation System with BigQuery ML?
Prepare training data in BigQuery, specify the model options in BigQuery ML, export the predictions to Google Analytics
Import training data to BigQuery, train a recommendation system with BigQuery ML, tune the hyperparameters
Prepare training data in BigQuery, train a recommendation system with BigQuery ML, use the predicted recommendations in production
Prepare training data in BigQuery, select a recommendation system from BigQuery ML, deploy and test the model
Which of the following loss functions is used for classification problems?
MSE
Cross entropy
None of the options are correct.
Both MSE & Cross entropy
Cross entropy