High Level ML Services Flashcards
What is Amazon Comprehend
It is a managed ML Service focused on NLP.
What are some examples of Amazon Comprehend capabilities?
-Sentiment Detection
-Entity Identification
-Text Analysis
-PII detection and redaction
-Key Phrase Extraction
-Language Identification
-Syntactical Analysis
True or False: Amazon Comprehend can be trained on additional data
True
What is Amazon Comprehend Medical and how is it different from regular Comprehend?
It is a comprehend tuned to medical data and documents, being more performant on them and being able to detect Protected Health Information (PHI) (DetectPHI API)
What is Amazon Translate?
It is a managed ML Service focused text translation from one language to another.
True or False: Amazon Translate accepts inputs in CSV and TMX Format
True
What is the main use of AWS Transcribe?
Converting audio to text
How can Transcribe deal with confidential information on the audio being transcribed?
Transcribe can automatically remove Personally Identified Information using Redaction, but any other kind of confidential information will still pass through
Declare True or False for the statements below:
-Transcribe accepts only MP3 and MP4 as inputs, aside from streaming audio
-Transcribe is incapable of differentiating different speakers on the same audio
-Transcribe automatically identifies the language being spoken
-False, it accepts FLAC and WAV as well
-False, it can differentiate speakers as long as you provide the number of speakers in the audio
-True
-Transcribe supports the usage of custom vocabularies to increase transcription quality
What languages does Transcribe Streaming accept?
English, Spanish and French
What is the main use of AWS Polly?
Converting text to audio
True or False: Polly is incapable of customizing the pronunciation of words
False, this customization can be done through Lexicons in the SynthetizeSpeech operation
True or False: Polly pronunciation can be manipulated through the use of SSML on the input text
True
True or False: Speech marks can be used to encode where a sentence begins and ends on Polly
True
What are the main uses of AWS Rekognition?
Locating persons, objects, texts and scenes in images or videos.
Rekognition can use facial analysis and facial search to do people counting and user verification. What are the 2 types of face Rekognition can detect?
Rekognition can detect either a specific face from a catalog of faces or compare faces to famous celebrities.
How could AWS Rekognition be used to help with content moderation?
It can be used to detect inapropriate images or videos. Those can then be removed or flagged for manual review.
True or false: The Confidence Threshold for items to be flagged on moderation cases for Rekognition can be manually configured
True
True or False: When working with video, Rekognition inference data must come from S3, and prioritize higher framework over resolution
False, data must come from Kinesis Video Streams, and should prioritize higher resolution
How would you use Rekognition to detect labels it wasn’t trained on?
Retrain the model using images with custom labels
What is the main use of Amazon Forecast?
ML service that delivers forecasts based on time-series data
True or False: The AutoML Forecast feature allows Forecast to select the best model for your time series data
True
List all of Amazon Forecast’s algorithms
-CNN-QR
-DeepAR+
-Prophet
-NPTS
-Arima
-ETS
Describe each of the following Amazon Forecast algorithms:
-CNN-QR
-DeepAR+
-Prophet
-NPTS
-Arima
-ETS
-CNN-QR: Convolutional Neural Network – Quantile Regression. Best for datasets with hundreds of time series, accepts related historical time series data & metadata
-DeepAR+: RNN. Best for large datasets, accepts related forward-looking time series & metadata
-Prophet: Additive model with non-linear trends and seasonality
-NPTS: Non-Parametric Time Series. Good for sparse data. Has variants for seasonal / climatological forecasts.
-ARIMA: Autoregressive Integrated Moving Average. Commonly used for simple datasets (<100 time series).
-ETS: Exponential Smoothing. Commonly used for simple datasets (<100 time series).