Training ML Flashcards

1
Q

Custom Training: Can manually optimize model performance with hyperparameter tuning?

A

Yes. You can tune the model during each training run for experimentation and comparison.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

AutoML: Data science expertise needed?

A

No.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

AutoML: Time to trained model?

A

Lower. Less data preparation is required, and no development is needed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

AutoML: Limits on machine learning objectives?

A

Yes, you must target one of AutoML’s predefined objectives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

BigQueryML: Can manually optimize model performance with hyperparameter tuning?

A

Yes. BigQuery ML supports hyperparameter tuning when training ML models using CREATE MODEL statements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

AutoML: Limits on data size?

A

Yes. AutoML uses managed datasets; data size limitations vary depending on the type of dataset.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the possible solutions to working with a sparse dataset?

A
  1. Use a model that supports training with sparse datasets (e.g. Wide&Deep, Autoencoder) 2. Remove features with low variance (e.g. use Lasso or L1 regularization) 3. Use a Dimensionality reduction method (e.g. PCA) to make sparse datasets dense, extracting principal components with the most variance.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Custom Training: Limits on data size?

A

For unmanaged datasets, no. Managed datasets have the same limits as managed dataset objects created in and hosted by Vertex AI and are used to train AutoML models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Custom Training: Programming ability needed?

A

Yes, to develop the training application.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

BigQueryML: Data science expertise needed?

A

No.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

BigQueryML: Limits on data size?

A

Yes. BigQuery ML enforces appropriate quotas on a per-project basis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the challenges with sparse datasets?

A

Overfitting, High memory usage, Computational Complexity, Inaccurate results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

BigQueryML: Programming ability needed?

A

SQL programming ability required to build, evaluate, and use the model in BigQuery ML.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

AutoML: Can manually optimize model performance with hyperparameter tuning?

A

No. AutoML does some automated hyperparameter tuning, but you can’t modify the values used.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Custom Training: Can control aspects of the training environment?

A

Yes. You can specify aspects of the environment such as Compute Engine machine type, disk size, machine learning framework, and number of nodes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

BigQueryML: Limits on machine learning objectives?

A

Yes.

17
Q

Custom Training: Limits on machine learning objectives?

A

No.

18
Q

BigQueryML: Time to trained model?

A

Lower. Model development speed is increased since you don’t need to build the infrastructure required for batch predictions or model training, as BigQuery ML leverages the BigQuery computational engine.

19
Q

AutoML: Programming ability needed?

A

No, AutoML is codeless.

20
Q

Custom Training: Data science expertise needed?

A

Yes, to develop the training application and also to do some of the data preparation like feature engineering.

21
Q

What is an example of a sparse dataset?

A

Netflix recommendation system. One-hot encoded categorical dataset for ‘Have you watched this movie before?’ out of 100 movies.

Think why this is sparse.

22
Q

BigQueryML: Can control aspects of the training environment?

A

No.

23
Q

AutoML: Can control aspects of the training environment?

A

Limited. For image and tabular datasets, you can specify the number of node hours to train for, and whether to allow early stopping of training.

24
Q

Custom Training: Time to trained model?

A

Higher. More data preparation is required, and training application development is needed.

25
Q
A
26
Q
A