Amazon Analytics Flashcards
Amazon Athena
o Allows you to run SQL queries on S3
o Interactive query service that allows
you to analyze and query data located
in S3
o Completely server-less, nothing to
provision
o No need to setup complex
extract/transform/load processes
o Used to analyze log data stored in S3
o Can be used for the following:
• Query log files in S3
• Generate business reports in S3
• Analyze cost and usage reports
• Run queries on click stream data
Amazon CloudSearch
a service that makes it simple and cost effective to setup, manage, and scale a search solution for your website or application
Amazon Elastic Map Reduce (EMR)
- Used to process large amounts of data
- Helps you analyze and process vast
amounts of data by distributing the
computational work across a cluster of
virtual servers running in the AWS Cloud - The cluster is managed using an open-
source framework called Hadoop. - Amazon EMR lets you focus on crunching
or analyzing your data without having to
worry about time-consuming setup,
management, and tuning of Hadoop clusters
or the compute capacity they rely on.
Amazon Redshift
Cloud data warehousing/online
analytics processing —> process of
collecting, storing, and organizing
data lakes to provide analytics for
better decision making
Invented to help with online analytics
processing away from your production
database
Used for business intelligence (tools like
Cognos, SQL Server, Oracle Hyperion)
Used to pull in very large and complex
datasets
Use different type of architecture both
from a database perspective and
infrastructure layer
Amazon Elasticsearch Service
TBD
Amazon Kinesis
Kinesis easily collects, processes, and analyzes real time streaming data for dashboard application