SageMaker Flashcards
SageMaker
Fully managed service for developers and data scientists to build ML models
Collect and prepare data, Build and train ML models, then Deploy them and monitor prediction performance
Deploy with one click, automatic scaling, no servers to manage
Real-Time Inference
SageMaker model deployment method for real-time endpoints that make one prediction at a time
Serverless Inference
SageMaker model deployment method for workloads that have idle periods between traffic spikes and can tolerate cold starts
Asynchronous Inference
SageMaker model deployment method for requests with large payload sizes up to 1GB, long processing times, and near real-time latency requirements
Batch Transform
SageMaker model deployment method for getting predictions for an entire dataset
Automatic Model Tuning
SageMaker feature that automatically chooses hyperparameter ranges, search
strategy, maximum runtime of a tuning job, and early stop condition
Define the Objective Metric to determine how this feature actually tunes your model
Saves you time and money; Helps you avoid wasting money on suboptimal configurations
SageMaker Studio
SageMaker feature that offers end-to-end ML development from a unified interface
Team collaboration; deploy, tune, and debug ML models; automated workflows
SageMaker Data Wrangler
SageMaker feature for preparing tabular and image data for ML that is imported from S3
Single interface for data selection,
cleansing, exploration, visualization,
and processing
Data preparation, transformation and
feature engineering; SQL supported
Data Quality tool for ensuring ML training data does not negatively impact inference accuracy
ML Features
Inputs to ML models used during training and used for inference
For example, converting a rigid BirthDate value into a simple Age number
Important to have high quality features across your datasets in your company for re-use
SageMaker Feature Store
SageMaker feature that ingests features from a variety of sources, which become available in SageMaker Studio
Ability to define the transformation of data into feature from within the store
Can publish directly from SageMaker Data Wrangler into this feature
SageMaker Clarify
SageMaker feature for evaluating FMs on human-like factors, such as friendliness or humor
Leverage an AWS-managed team or bring your own employees for evaluation
Use built-in datasets or bring your
own dataset; offers built-in metrics and algorithms
Integrated with SageMaker Studio
Model Explainability
SageMaker Clarify feature that offers set of tools to help explain how MLs make predictions
Understand model characteristics as a whole prior to deployment; debug predictions provided by the model after it’s deployed
Helps increase the trust and understanding of the model
Bias Detection
SageMaker Clarify feature that offers ability to detect and explain biases in your
datasets and models
Measure bias using statistical metrics; specify input features and bias will be automatically detected
SageMaker Ground Truth
SageMaker feature for Reinforcement Learning from Human Feedback
RLHF: Model review, customization and evaluation; Align model to human preferences; human feedback included in “reward” function
Human feedback for ML includes creating or evaluating your models, and data generation or annotation like data labels
Reviewers include Amazon Mechanical Turk workers, your employees, or third-party vendors
SageMaker Model Cards
SageMaker feature for ML governance that allows you to document essential model information
Examples include intended uses, risk ratings, and training details