Path3.Mod1.b - Automated Machine Learning - Featurization and Models Flashcards

1
Q

Differences between Training Data, Validation Data and Test Data

A
  • Training Data: Used to train your models
  • Validation Data: Used to tune hyperparameters.
  • Test Data: AutoML’s recommended model is tested at the end with this data if provided (preview only)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The problem with Validation Data in AutoML

A

AutoML uses the same data per iteration of Hyperparameter tuning, creating a model evaluation bias as the model continues to improve and fit to the Validation Data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define:
- Feature Engineering
- Featurization wrt Feature Engineering

A
  • Using domain knowledge to create features to help ML learn better.
  • The collective techniques of scaling and normalization coupled with Feature Engineering.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define Customizing Featurization and how to enable it

A

Additional feature engineering techniques (data manipulation) such as encoding and transforms. These can be enabled through ML Studio and the Python SDK

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define Ensemble Models

A

The combination of multiple models to improve machine learning results and predictive performance (ensemble learning).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

V WA, St heter

The two types of Ensemble Model Combining Methods and which uses a Meta-Model

A
  • Voting: predicts on weighted average of either predicted class probabilities (classification) or regression targets.
  • Stacking: combines heterogenous models and trains a meta-model based on the output from the individual models.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

LR EN

The two default Ensemble Meta-Models (and for which type of ML task)

A
  • LogisticRegression: for Classfication tasks
  • ElasticNet: for Regression and Forcasting tasks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define ONNX Format and its usage

A

Open Neural Network Exchange, an open format built for representing ML Models. You can convert your models to ONNX format to run on a variety of platforms

https://onnx.ai/

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The ONNX Runtime does not support C#, only Python based models. Ergo you cannot run models in C# applications (T/F).

A

False. The ONNX Runtime also supports C#, so you can use ONNX formatted models in your C# apps

https://onnx.ai/

How well did you know this?
1
Not at all
2
3
4
5
Perfectly