SageMaker - Built-In Algorithms Flashcards
Does SageMaker handle the entire ML workflow?
Yes
What is a SageMaker notebook?
It is just a notebook that is spun up from the console.
Can you use Scikit_learn, Spark, and Tensorflow from a SageMaker notebook?
Yes
Can you launch servers from your SageMaker notebook?
Yes
What is the SageMaker Input Mode S3 File Mode?
It is the default. It copies the data to the docker container. This is okay for smaller datasets, but not large ones.
What is the SageMaker Input Mode S3 Fast File Mode?
It streams the data from the S3 source. This was a replacement for Pipe Mode.
What is the SageMaker Input Mode S3 Express One Zone Mode?
It is a high performance storage class in one AZ. Works with other input modes.
What is the SageMaker Input Mode Amazon FsX for Lustre?
This is for HPC and 100s of GB of throughput. This is really meant for large datasets.
What is the SageMaker Input Mode EFS Mode?
Uses EFS as a file system for the source data.
What is the Linear Learner Model in SageMaker?
It handles linear regression. This is used for predications and classifications.
If your model training is taking too much time to get started, what can you do?
Use pipe mode which will stream the data.
Does Linear Learner require nomalized data?
Yes. This can be done in advance or within the model.
What kind of regularization does Linear Learner support?
L1 and L2.
What is XGBoost?
It is a boosted group of decision trees. The new trees made to correct the errors of the previous trees.
How can you prevent overfitting when using XGBoost?
Use the subsample or Eta hyperparameters
Is XGBoost memory or CPU bound?
Memory.
What is LightGBM?
A gradient boosting decision tree. Like XGBoost.