ML Implementations and Operations Flashcards
What is the name of the library used for making containers compatible with Sagemaker?
sagemaker-containers
What is the file structure of a training container on Sagemaker?
/opt/ml
├── input
│ ├── config
│ │ ├── hyperparameters.json
│ │ └── resourceConfig.json
│ └── data
│ └── <channel_name>
│ └── <input></input>
├── model
│
├── code
│ └──
</channel_name>
│
└── output
└── failure
What is the file structure of a docker image?
WORKDIR
* nginx.conf
* predictor.py
* serve/
* train/
* wsgi.py
What are production variants of a model?
They variants of a model that you can test on live traffic through the usage of weighted traffic redistribution. Allows you to perform A/B testing
True or False: Shadow Testing is an example of a production variant of a model
True
What is Sagemaker Neo?
It is a service that optimizes machine learning model execution for specific devices, allowing them to run on edge devices
True or False: Sagemaker Neo is composed of a singular compiler that optimizes your code
False, it is composed of a compiler and a runtime
What other AWS Service would you use with Sagemaker Neo to deploy a model to the Edge?
IoT Greengrass
True or False: Sagemaker Neo optimized models can be deployed to an HTTP endpoint
False, only HTTPS
True or False: There are no restirction to which instance a Sagemaker Neo optimized model can be deployed to
False, it must be deployed to the same type of instance as the one where it was optimized
True or False: IoT Greengrass uses Lambda inference applications
True
True or False: You can use KMS integrations to encrypt all the data on Sagemaker Notebooks and jobs
False, all notebook data can be encrypted, but on jobs only data /opt/ml/ and /tmp can be encrypted
True or False: When running a training job, inter-node training communication may be optionally encrypted with no impacts on performance (AKA inter-container traffic encryption)
False, it can result in increased training cost and time for deep learning
True or False: You can run Sagemaker jobs inside VPCs, but Notebooks are internet enabled by default. If this is disabled, your notebook needs an interface endpoint (PrivateLink) or NAT
Gateway, and allow outbound connections, for training and hosting to
work. The same is true for inference and training images.
True
What on Sagemaker can Cloudwatch log and monitor?
-Ground Truth (active workers, how much they are doing)
-Instance health metrics
-Invocations and latency of endpoints
What are some frameworks you can use to perform distributed training on Sagemaker?
Horovod and Parameter Servers
True or False: Sagemaker does not support Spot Training
False
What is Sagemaker Elastic Inference?
It is a feature that accelerates inference at a fraction of the cost of using a GPU
True or False: Sagemaker Elastic Inference works by leeting you select instance types optimized for inference when you choose your instance
False, it allows you to use an accelerator alongside a CPU instance
True or False: Elastic Inference cannot be applied to Sagemaker notebooks
False, it can
What built-in Sagemaker algorithms work with Elastic Inference?
-Image Classification and Object Detection
True or False: Elastic Inference works with both pre-built and custom MXNet, Tensorflow and PyTorch containers
True
Which use cases are good for Serverless Inference?
Use cases where traffic is unpredictable
What is Sagemaker Inference Recommender?
It is a Sagemaker feature that recommends the best instance type and configuration for your model deployments