Relational Database Service (RDS) Flashcards

1
Q

What are the six different RDS engines?

A
  1. Microsoft SQL Server
  2. Oracle
  3. MySQL
  4. PostgreSQL
  5. MariaDB
  6. Amazon Aurora
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the advantages of RDS?

A
  1. You can provision and have a database up and running in minutes
  2. Multi-AZ
  3. Failover capacity (automatically)
  4. Automated backups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When would you use an RDS database?

A

Generally used for online transaction processing (OLTP) workloads, not analyzing large amounts of data (OLAP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the difference between online transaction processing (OLTP) and online analytical processing (OLAP)?

A

OLTP processes data from transactions in real time (e.g. customer orders, banking transactions, payments, booking systems, etc.). All about data processing and completing a large number of small transactions in real time.

OLAP processes complex queries to analyze historical data (e.g. analyzing net profit figures from the last 3 years and sales forecasting). All about data analysis using large amounts of data, as well as complex queries that take a long time to complete.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

If you are given a scenario which asks which service would be recommended for an OLTP application, what would you recommend?

A

RDS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

If you are given a scenario which asks which service would be recommended for an OLAP application, what would you recommend?

A

Redshift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Explain how RDS handles multi-AZ.

A

RDS creates an exact copy of your database in another AZ (and continuously replicates the data as you write to the production database).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which RDS types can be configured as Multi-AZ?

A
  1. Microsoft SQL Server
  2. Oracle
  3. MySQL
  4. PostgreSQL
  5. MariaDB
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which RDS type is always configured as Multi-AZ

A

Aurora

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does RDS handle unplanned failure or maintenance in an Multi-AZ configuration?

A

The applications are connecting to the database via a connection string (web address of the database, username and password). Amazon handles all the DNS for that web address. If there is an unplanned outage of the primary database, Amazon will detect the failure and point the web address at the standby database in another AZ (automated DNS failover handled by AWS) and that will be promoted to primary.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the main purpose for enabling Multi-AZ in RDS?

A

For disaster recovery, not performance (you cannot connect to the secondary database when the primary is active).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a read replica in RDS?

A

A read-only copy of your primary database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why would you use a read replica in RDS?

A

When you have read-heavy workloads to take the load off your primary database and boost performance. It is not used for disaster recovery.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Where can read replicas in RDS be stored?

A

In the same AZ, in a different AZ or even in a different region.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do you address a read replica in RDS (as opposed to the primary database)?

A

A read replica has its own unique DNS endpoint, separate from the primary database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Can you promote a read replica in RDS into its own independent database?

A

Yes, but this breaks the replication.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

In what scenario would you promote a read replica in RDS to its own database?

A

When you are doing online analytics processing and you are about to do a massive query towards your database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What feature in RDS must be enabled in order to deploy a read replica?

A

Automatic backups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How many read replicas in RDS are supported per database?

A

Up to 5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is Amazon Aurora?

A

It is a MySQL and PostgreSQL compatible relational database engine that combines the speed and availability of commercial databases with the cost-effectiveness of open-source databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How much better performance can you expect from Amazon Aurora than MySQL or PostgreSQL?

A

5x better performance than MySQL and 3x better performance than PostgreSQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the minimum storage size of Amazon Aurora and the maximum?

A

It starts with 10 GB and scales in 10 GB increments up to 128 TB (with storage auto-scaling).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How is the data replicated in Amazon Aurora?

A

2 copies of your data are contained in each AZ, with a minimum of 3 AZs (at least 6 copies of data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is it mean that Aurora storage is self-healing?

A

Data blocks and disks are continuously scanned for errors and repaired automatically.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

How many copies of data can be lost without affecting database write availability in Amazon Aurora?

A

2 copies of data can be lost out of at least 6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

How many copies of data can be lost without affecting database read availability in Amazon Aurora?

A

3 copies of data can be lost out of at least 6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What are the three types of Amazon Aurora read replicas available?

A
  1. Aurora replicas (up to 15 read replicas)
  2. MySQL replicas (up to 5 read replicas)
  3. PostgreSQL replicas (up to 5 read replicas)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What are the benefits of Aurora read replicas over MySQL and PostgreSQL read replicas?

A
  1. You can have up to 15 read replicas vs only 5
  2. It replicates faster
  3. It has a low impact on performance on the primary database
  4. They can act as a failover target with no data loss
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Does Amazon Aurora support automated backups and snapshots?

A

Yes, automated backups are always enabled and do not impact performance. Snapshots are also available and do not impact performance and may be shared across accounts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is Amazon Aurora Serverless?

A

An on-demand, auto-scaling configuration for MySQL-compatible and PostgreSQL-compatible. Automatically starts up, scales down based on your application’s needs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

If you are given a scenario where you need the performance of Aurora, but you’re going to have spiky workloads, what RDS service would you recommend?

A

Amazon Aurora Serverless

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What are the use cases for Aurora Serverless?

A

Relatively simple, cost-effective option for infrequent, intermittent, or unpredictable workloads.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is DynamoDB?

A
  1. Fast, flexible NoSQL database for all applications that need consistent, single-digit millisecond latency at any scale.
  2. Fully managed
  3. Supports both document and key-value data models
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Name some scenarios where DynamoDB would be a good fit?

A

Mobile, web, gaming, ad-tech, IoT, etc.

35
Q

What type of storage is used for DynamoDB?

A

SSD

36
Q

What type of read consistency is supported by DynamoDB by default?

A

Eventually consistent reads (gives the best read performance), but you can opt in for strongly consistent reads, if needed.

37
Q

How is availability handled in DynamoDB?

A

It is spread across three geographically distinct data centers.

38
Q

What is DynamoDB Accelerator (DAX)

A

Fully managed, highly available, in-memory cache with 10X performance improvements, and no need for developers to manage caching logic.

39
Q

How does DynamoDB Accelerator work?

A

Your application connects directly to DAX and if DAX determines there isn’t a cache hit, it queries DynamoDB for you.

40
Q

What is the pricing scheme for DynamoDB Accelerator (DAX)?

A

Pay-per-request pricing, with no minimum capacity

41
Q

When would you consider using DynamoDB Accelerator (DAX)?

A

Useful for new product line launches, for example.

42
Q

What security features are available with DynamoDB?

A
  1. Encryption at rest using KMS
  2. Site-to-site VPN
  3. Direct Connect (DX)
  4. IAM policies and roles
  5. Allows Fine-grained access
  6. Integrates with CloudWatch and CloudTrail
  7. Integrates with VPC endpoints
43
Q

What does ACID mean in terms of database transactions?

A

All or nothing

  1. ATOMIC (all changes to the data must be performed successfully or not at all)
  2. Consistent (data must be in a consistent state both before and after the transaction)
  3. Isolated (no other process can change the data while the transaction is running)
  4. Durable (changes from a transaction must persist)
44
Q

Can you use ACID methodology with DynamoDB?

A

Yes, you must use transactions when building applications that require coordinated inserts, deletes or updates to multiple items as part of a single logical business operation (one or more tables within a single AWS account and region)

45
Q

If you are given a scenario where they are talking about having atomicity, consistency, isolation and durability and you want to do this with DynamoDB, what feature should be enabled?

A

DynamoDB transactions

46
Q

List some use cases for DynamoDB transactions.

A
  1. Processing financial transactions
  2. Fulfilling and managing orders
  3. Building multiplayer game engines
  4. Coordinating actions across distributed components and services
47
Q

What are the three options for reads in DynamoDB?

A
  1. Eventual consistency
  2. Strong consistency
  3. Transactional
48
Q

What are the two options for writes in DynamoDB?

A
  1. Standard
  2. Transactional
49
Q

What is the maximum number of items that can be updated in one transaction in DynamoDB?

A

25 items (or 4 MB of data)

50
Q

What is DynamoDB On-Demand Backup and Restore?

A

It does full backups at any time with zero impact on table performance and availability, in the same region as the source table

51
Q

How long are DynamoDB On-Demand Backups stored?

A

They are retained until they are deleted.

52
Q

What is Point-in-Time Recovery in DynamoDB?

A

It protects against accidental writes or deletes and can restore to any point in the last 35 days and its done using incremental backups.

53
Q

Is Point-in-Time Recovery in DynamoDB enabled by default?

A

No

54
Q

What is the latest restorable time using Point-in-Time Recovery in DynamoDB?

A

5 minutes in the past

55
Q

What are DynamoDB Streams?

A

Time ordered sequence of item-level changes (inserts/updates/deletes) in a table

56
Q

How long does data get stored in DynamoDB Streams?

A

24 hours

57
Q

How can you achieve functionality like stored procedures in DynamoDB?

A

Using DynamoDB Streams combined with Lambdas (may not be an exam topic)

58
Q

What are global tables in DynamoDB?

A

They are managed multi-master, multi-region replication (for disaster recovery of high availability), based on DynamoDB Streams.

59
Q

List use case for using global tables in DynamoDB.

A

If you have a globally distributed application across the world

60
Q

What feature in DynamoDB do you need enabled to enable global tables?

A

DynamoDB Streams

61
Q

What is the replication latency for global tables in DynamoDB?

A

1 second

62
Q

What is MongoDB?

A

A document database that allows for scalability and flexibility with your data as well as robust querying and indexing features.

63
Q

What is Amazon DocumentDB?

A

A managed service that allows you to run MongoDB in the AWS cloud. It scales with your workload and safely stores your information.

64
Q

If you are given a scenario where you have an existing MongoDB database and you don’t want to have to refactor to move to AWS cloud, what service would you recommend?

A

Amazon DocumentDB

65
Q

How can you move an existing MongoDB database on-premises to AWS cloud?

A

Use AWS Migration Service to automate your database migration to Amazon DocumentDB

66
Q

What is Cassandra?

A

A distributed database (i.e. it runs on many machines) that uses NoSQL. It’s primarily used for big data solutions. Enterprises, such as Netflix, use Cassandra on their backend.

67
Q

What is Amazon Keyspaces?

A

A fully managed serverless service that allows you to run Cassandra in the AWS cloud.

68
Q

If you are given a scenario where you have an existing Cassandra database and you don’t want to have to refactor to move to AWS cloud, what service would you recommend?

A

Amazon Keyspaces

69
Q

What is a Graph database?

A

Data is stored just like you might sketch ideas on a sketch board. It stores nodes and relationships instead of tables or documents.

70
Q

What is Amazon Neptune?

A

A fully managed graph database service

71
Q

Name some use cases for using Amazon Neptune.

A
  1. Build connections between identities (e.g. social graphs and accelerate updates for ad targeting, personalization and analytics)
  2. Build knowledge graph applications (e.g. add topical data to product catalogs, and help users quickly navigate highly connected datasets)
  3. Detect fraud patterns (in financial and purchase transactions)
  4. Security graphs to improve IT security (proactively detect and investigate IT infrastructure using the layered security approach; visualize all infrastructure to plan, predict and mitigate risk)
72
Q

If you are given a scenario where you have a need for a graph database, what service would you recommend?

A

Amazon Neptune

73
Q

What is a ledger database?

A

It is a NoSQL database that is immutable, transparent, and has cryptographically verifiable transaction log that is owned by one authority. You cannot update a record; instead, an update adds a new record to the database.

74
Q

What is the most common use cases for ledger databases?

A
  1. It is used for cryptocurrencies such as Bitcoin, Ethereum, etc. (for transactions on the blockchain)
  2. Shipping companies use it to track items, boxes, shipping containers, deliveries, etc.
  3. Pharmaceutical companies use it to track creation and distribution of drugs and ensure no counterfeits are produced
75
Q

What is Amazon Quantum Ledger Database (QLDB)?

A

It is a fully managed ledger database that provides a transparent, immutable and cryptographically verifiable transaction log.

76
Q

What are common use cases for Amazon Quantum Ledger Database (QLDB)?

A
  1. Store financial transactions
  2. Reconcile supply chain systems
  3. Maintain a claims history
  4. Centralize digital records
77
Q

If you are given a scenario where you have an immutable database, what service would you recommend?

A

Amazon Quantum Ledger Database (QLDB)

78
Q

What is Time-Series Data?

A

Data points that are logged over a series of time, allowing you to track your data. Examples could include temperature readings from weather stations around the world.

79
Q

What are some examples of Time-Series Data?

A
  1. IoT sensors relay thousands, millions and billions of points of information depending on the setup. One use case is for agriculture.
  2. Analytics (large websites such as Netflix need to analyze incoming and outgoing web traffic
  3. DevOps applications (applications that change in response to users needs may need to be monitored continuously so they can scale correctly)
80
Q

What is Amazon Timestream?

A

A serverless, fully managed database service for time-series data.

81
Q

If you are given a scenario where you need to store a large amount of time-series data for analysis, what service would you recommend?

A

Amazon Timestream

82
Q

If you are given a scenario where it is talking about scaling issues with your database or bad read performance, what service would you recommend?

A

Read Replicas

83
Q

If you are given a scenario where it is talking about disaster recovery with your database, what service would you recommend?

A

Multi-AZ

84
Q

If you are given a scenario where it talks about needing a serverless relational database, what service would you recommend?

A

Aurora Serverless