AWS Analytics Flashcards
1
Q
What is Amazon Redshift?
A
Redshift is a scalable data warehouse solution.
- Data warehousing solution
- Improves speed and efficiency
- Handles exabyte-scale data
2
Q
How to use Amazon Redshift in the real world?
A
- When consolidating multiple data sources for reporting
- When you want to run a database that doesn’t require real-time transaction processing (INSERT,UPDATE and DELETE)
3
Q
What is Amazon Athena?
A
Amazon Athena is query service for Amazon S3
- Query Service
- Analyze S3 data using SQL
- Pay per query
- Considered serverless
4
Q
What is Amazon Glue?
A
Amazon Glue prepares your data for analytics.
- Extract, Transform, Load (ETL) service
- Prepeare load data
- Helps to better understand your data
5
Q
What is Amazon Kinesis?
A
Amazon Kinesis allows you to analyze data and video streams in real time.
- Analyze real-time, streaming data
- Supports video, audio, application logs, website clickstreams, and IoT
NB: Kinesis helps analyze logs in near real time for application monitoring or fraud detection
6
Q
What is Elastic MapReduce (EMR)?
A
EMR helps you process large amounts of data.
- Process big data
- Analyze data using Hadoop
- Works with big data frameworks
7
Q
What is Data Pipeline?
A
Data Pipeline helps you move data between compute and storage services running either on AWS or on-premises.
- Moves data at specific intervals
- Moves data based on conditions
- Sends notifications on success or failure