Database & Analytics Flashcards
1
Q
Amazon Aurora
A
- AWS proprietary DB
- Supports PostgreSQL and MySQL
- AWS cloud optimized
- Storage automatically grows in increments of 10GB, up to 128 TB
- Cost 20% more than RDS, but is more efficient
2
Q
Amazon Aurora Serverless
A
- AWS proprietary DB
- Least management overhead
- PostgreSQL and MySQL
- No capacity planning needed
- Pay per second, can be more cost-effective
- Infrequent, intermittent or unpredictable workloads
3
Q
ElastiCache
A
- Managed Redis or Memcached DB
- In memory DB with high performance, and low latency
- Reduce load of DB for read intensive workloads
4
Q
DynamoDB
A
- Fully managed
- Replication across 3 AZ
- NoSQL DB - Not relational
- Distributed serverless database
- Single-digit millisecond latency
- Standard & Infrequent Access (IA) Table Class
5
Q
DynamoDB Accelerator - DAX
A
- Fully Managed in-memory cache for DynamoDB
- 10x performance improvement microsecond latency
- Only for and integrated with DynamoDB
- NoSQL
6
Q
DynamoDB - Global Tables
A
- Make DynamoDB table accessible with low latency in multiple regions
- Read/write in any region
- Active/Active replication
7
Q
Redshift DB
A
- PostgreSQL
- Not for OLTP (online transaction processing)
- Used for OLAP - online analytical processing (analytics and data warehousing)
- Columnar storage
- Massively Parallel Query Execution (MPP), highly available
- BI integration (AWS Quicksight or Tableau)
8
Q
Redshift DB Serverless
A
Not for OLTP/Used for OLAP
* Auto provision and scale
* Pay for what you use (storage and compute during analysis)
* Use case: reporting, dashboarding applications, real-time analytics
9
Q
Amazon EMR Elastic MapReduce
A
- Create Hadoop Cluster (Big Data)
- Clusters can hundreds of EC2 instances
- Use cases: data processing, machine learning, indexing, big data
10
Q
Amazon Athena
A
- Serverless query service
- Perform analytics against S3 objects
- Serverless SQL
- Pricing: $5 per TB scanned
11
Q
QuickSight
A
- Serverless
- machine learning-powered
- business intelligence
- Go to tool for BI
12
Q
Document DB
A
- NoSQL Database based on MongoDB
- Full Managed, highly available with replication across 3 AZ
- Auto grow in 10GB increments
- Auto-scales to workloads with millions of requests per second
13
Q
Amazon Neptune
A
- Fully managed graph database
- Dataset would be a social network
- Highly available across 3 AZ,
- Up to 15 read replicas
- Use case: knowledge graphs (Wikipedia), fraud detection, recommendation engines, social networking
14
Q
Amazon Timestream
A
- Fully managed, fast, scalable, serverless
- time series database
- Store and analyze trillions of events per day
- 1000x faster and 1/10th cost of relation DB
15
Q
Amazon QLDB (Quantum Ledger Database)
A
- Record financial transactions
- Fully managed, serverless, high available, replication across 3 AZ
- Review history of all changes made to you application data over time
- Immutable system: no entry can be removed or modified (Cryptographically verifiable)
- Difference with Amazon Managed Blockchain: no decentralization component, in accordance with financial regulations