Analytics and Machine Learning Services Flashcards
Data Warehouse
A storage solution that aggregates massive amounts of data from disparate sources. (ex: combine data from multiple systems). They support analytics, not used for transaction processing.
Redshift
A scalable data warehouse solution. Can handle exabyte level data. Improves speed and efficiency. Great for data consolidation, or running a DB that doesn’t require real-time processing, and performing queries based on predefined sets of dimensions.
Analytics
Querying or processing your data.
Athena
A serverless SQL query service for S3. Pay per query.
Glue
Prepare your data for analytics. Extract, Transform, Load (ETL) service that helps you better understand your data.
Kinesis
Analyze data and video streams in real time. Supports video, audio, and application logs.
Elastic MapReduce (EMR)
Process large amounts of data using Hadoop.
Data Pipeline
Helps move data between compute and storage services, either on AWS or on-premises. Can move data at specific intervals, or based on conditions. Sends notification on success or failure.
Rekognition
Automate image and video analysis. Can recognize face and text on videos or images.
Comprehend
A Natural Language Processing (NLP) Service. Uncovers insights and relationships in text. ex: review social media posts for keywords
Polly
Text-to-Speech Service. Mimics natural sounding human speech in multiple languages.
SageMaker
Build, train, and deploy machine learning models quickly. Ex: recommendation engine
Translate
Real time and batch language translations. Good for adding localizations to webapps.
Lex
Build chatbots, like Alexa.
CloudWatch
A collection of services that help you monitor and observe your cloud resources. Monitor application logs, visualize usage data, and set alarms for your meters.