Databases & Analytics Flashcards

1
Q

What is AWS’s responsibility with databases?

A

AWS offers managed databases. Meaning they handle operations, upgrades, patches, monitoring, alerts, and backups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does RDS stand for?

A

Relational Database Service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What query language do AWS RDS use?

A

SQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What types of databases can you create using AWS RDS?

A
  1. Postgres
  2. MySQL
  3. MariaDB
  4. Oracle
  5. Microsoft SQL Server
  6. Aurora (AWS Proprietary database)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Advantages of using RDS versus deploying DB on ECS?

A
  1. Automated provisioning, OS patching
  2. Continuous backups and restore to specific timestamp (Point In Time Restore)
  3. Monitoring dashboards
  4. Read replicas for improved read performance
  5. Multi AZ setup for DR (Disaster Recovery)
  6. Maintenance windows for upgrades
  7. Scaling capability (vertical and horizontal)
  8. Storage backed by EBS (gp2 or io I)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the only disadvantage of using RDS?

A

You can’t SSH into your instance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Amazon Aurora?

A

A proprietary technology from AWS. It is AWS cloud optimized and claims 5x performance improvement over MySQL on RDS, over 3x the performance of Postgres on RDS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Tell me a bit about Aurora storage

A

Aurora storage automatically grows in increments of 10GB, up to 64TB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How does Aurora cost compare to RDS?

A

Aurora costs more than RDS by 20%, but it is more efficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Is RDS in the AWS free tier?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Is Aurora in the AWS free tier?

A

No

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are RDS read replicas?

A

Scale the read workload of your DB. Data is only written to the main DB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How many read replicas can you have on AWS RDS?

A

15

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is RDS Multi-AZ?

A

Replication across AZ. It is the failover in case of issues with main DB. You read and write both to the main DB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How many Multi-AZ replicas can you have?

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is RDS Multi-Region replicas?

A

Read replicas in other regions. Allow for disaster recovery and faster load performance for global reads. Still write to main.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the extra cost associated with RDS Multi-Region replicas?

A

There is a replication cost since it is cross region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is Amazon elastiCache?

A

Used to get managed Redis or Memcached databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are caches?

A

In-memory databases with high performance, low latency. Helps reduce load off databases for read intensive workloads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Which service should you use if you use if you need high performance, low latency, in memory databases

A

Amazon ElastiCache

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Which is faster, RDS or ElastiCache?

A

ElastiCache is faster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is DynamoDB?

A

Fully managed highly available with replication across 3 AZ. Scales to massive workloads, distributed ‘serverless’ database. Millions of requests per second, trillions of rows, 10s of TB of storage. NoSQL database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How fast is DynamoDB?

A

Fast and consistent performance. Single-digit millisecond latency - low latency retrieval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What type of data goes in DynamoDB?

A

Key/value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is DynamoDB Accelerator (DAX)?

A

Fully managed in-memory chache for DynamoDB.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

How fast is DynamoDB Accelerator (DAX)?

A

10x performance - single digit millisecond latency to microseconds latency when accessing your DynamoDB tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is the difference between ElastiCache and DAX at CCP level?

A

DAX is only used for and is integrated with DynamoDB, while ElastiCache can be used for other databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is Redshift?

A

Redshift is based on PostgreSQL, but it’s not used for OLTP. Data warehouse service. It is OLAP
Columnar storage of data (instead of row based)
Massive Parallel Query Execution (MPP) helps do computations very quickly making it highly available

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is OLTP?

A

Online Transaction Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is OLAP?

A

Online Analytical Processing (analytics and data warehousing)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

How often does Redshift load data?

A

Once every hour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What type of performance does Redshift have over other data warehouses?

A

10x better performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What scale does Redshift scale to?

A

PBs of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

How do you pay for Redshift?

A

Pay as you go

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Does Redshift have a SQL interface for performing the queries?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Does Redshift integrate with BI tools such as AWS Quicksight and Tableau

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is Amazon EMR?

A

Helps create Hadoop clusters (Big Data) to analyze and process vast amounts of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What does EMR stand for?

A

Elastic MapReduce

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Is Amazon EMR a database?

A

No

40
Q

Does Amazon EMR have auto-scaling?

A

Yes

41
Q

Does Amazon EMR have integration with Spot instances?

A

Yes

42
Q

What are the use cases for Amazon EMR?

A
  1. Data processing
  2. Machine learning
  3. Web indexing
  4. Big data
43
Q

What is Athena?

A

Fully server-less database with SQL capabilities

44
Q

What is Athena used for?

A

Used to query data in S3 and then output the results back to S3

45
Q

How is Athena secured?

A

IAM

46
Q

How do you pay for Athena?

A

Pay per query not the database

47
Q

Use cases for Athena?

A

one-time SQL queries, server-less queries on S3, log analytics

48
Q

What is Amazon QuickSight?

A

Server-less machine learning-powered business intelligence service to create interactive dashboards

49
Q

What is the go to tool in AWS for BI?

A

Amazon QuickSight

50
Q

Is AmazonQuickSight fast and scalable?

A

Yes

51
Q

How do you pay for Amazon QuickSight?

A

Per-session pricing

52
Q

Is Amazon QuickSight integrated with storage options?

A

Yes, integrated with RDS, Aurora, Athena, Redshift, S3…

53
Q

What is DocumentDB?

A

AWS’s version of MongoDB

Fully managed by AWS with replication across 3 AZ

54
Q

What type of database is DocumentDB?

A

NoSQL

55
Q

How does storage for DocumentDB scale?

A

Automatically grows in increments of 100GB up to 64 TB

56
Q

How fast is DocumentDB?

A

Can handle millions of requests per second

57
Q

What are the two NoSQL databases for AWS?

A
  1. DynamoDB

2. DocumentDB

58
Q

What is Amazon Neptune?

A

A fully managed graph database

59
Q

What is a graph dataset?

A

Good example is social network

60
Q

How available is Amazon Neptune?

A

Highly available across 3 AZ, up to 15 read replicas

61
Q

What is Amazon Neptune good for?

A

Build and run applications working with highly connected datasets - optimized for these complex and hard queries

62
Q

What is Amazon Neptune latency?

A

Can store billions of relations and query the graph with milliseconds latency

63
Q

What is Amazon QLDB?

A

Fully managed ,server-less, highly available, replication across 3 AZ. Used to review history of all changes made to your application data over time. Immutable system: no entry can be removed or modified and can be cryptographically verified

64
Q

What does QLDB stand for?

A

Quantum Ledger Database

65
Q

What is Amazon QLDB performance compared to common ledger blockchain frameworks?

A

2-3x better

66
Q

Can you use SQL to manipulate data in Amazon QLDB?

A

Yes

67
Q

What is difference between Amazon Managed Blockchain and Amazon QLDB?

A

Amazon QLDB does not have a decentralization component, in accordance with financial regulation rules

68
Q

What is Amazon Managed Blockchain?

A

A managed service to

  1. Join public blockchain networks
  2. Create your own scalable private network
69
Q

What is Amazon Managed Blockchain compatible with?

A
  1. Hyperledger fabric

2. ethereum

70
Q

Is Amazon Managed Blockchain decentralized?

A

Yes

71
Q

What is DMS (Database Migration Service)?

A

Quickly and securely migrate databases to AWS, resilient, self healing.

72
Q

When using DMS do you need to take the source database offline?

A

No, it can remain available during the migration

73
Q

What types of migrations does DMS support?

A
  1. Homogeneous

2. Heterogeneous

74
Q

What is homogeneous migrations?

A

Ex: Oracle to Oracle

75
Q

What is heterogeneous migrations?

A

Ex: Microsoft SQL Server to Aurora

76
Q

What is AWS Glue?

A

A managed extract, transform, and load (ETL) service. Fully server-less

77
Q

What is AWS Glue useful for?

A

Useful to prepare and transform data for analytics

78
Q

Give an example of how AWS Glue works?

A
  1. Extract data from S3 bucket and Amazon RDS
  2. Glue ETL transforms the data
  3. Load the data into Redshift
79
Q

What is Glue Data Catalog?

A

Catalog of datasets

80
Q

What AWS options do you have for Relational databases? (OLTP)

A
  1. RDS

2. Aurora

81
Q

You want to create a decentralized blockchain on AWS. Which AWS service would you use?

  1. DocumentDB
  2. QLDB
  3. Managed Blockchain
  4. QuickSight
A

Managed Blockchain

82
Q

Which AWS database is a data warehouse?

  1. DynamoDB
  2. ElastiCahce
  3. Redshift
  4. RDS
A

Redshift

83
Q

Which AWS database is fully server-less and has SQL capabilities?

  1. RDS
  2. Aurora
  3. DynamoDB
  4. Athena
A

Athena

84
Q

You would like to use a server-less service to prepare data so it can be loaded for analytics. Which service would you use?

  1. Athena
  2. RDS
  3. Glue
  4. ElastiCache
A

Glue

85
Q

Which relational database is a proprietary technology from AWS and is cloud-optimized?

  1. DynamoDB
  2. Oracle
  3. Athena
  4. Aurora
A

Aurora

86
Q

You would like to migrate databases to AWS while still being able to use the database during the migration. What service allows you to do this?

  1. Elastic MapReduce(EMR)
  2. Database Migration Service (DMS)
  3. AWS Storage Gateway
  4. Snowball
A

Database Migration Service (DMS)

87
Q

How can you create Hadoop clusters to analyze and process a vast amount of data?

  1. EMR
  2. Athena
  3. EC2 Instances
  4. Redshift
A

EMR

88
Q

Which in-memory AWS database can you use to reduce the load off databases and has high performance, low latency?

  1. Redshift
  2. ElastiCache
  3. RDS
  4. DynamoDB
A

ElastiCache

89
Q

What is the name of a central repository to store structural and operational metadata for data assets in AWS Glue?

  1. Glue Data Table
  2. Glue Data Dictionary
  3. Glue Data Catalog
A

Glue Data Catalog

90
Q

Which of the following databases is a managed service with SQL capability suited for Online Transaction Processing (OLTP)?

  1. DynamoDB
  2. Redshift
  3. RDS
  4. MySQL on EC2
A

RDS

91
Q

Which AWS service is an immutable ledger database?

  1. QLDB
  2. EMR
  3. Managed Blockchain
  4. Inspector
A

QLDB

92
Q

You would like to set up a NoSQL database that can scale with no downtime and can handle millions of request per second. Which AWS database is best suited for this work?

  1. DynamoDB
  2. RDS
  3. Redshift
  4. Athena
A

DynamoDB

93
Q

Which AWS service can create complex graphs for fraud detection?

  1. Redshift
  2. Neptune
  3. QLDB
  4. Glue
A

Neptune

94
Q

Which AWS server-less service can use machine learning-powered business intelligence to create interactive dashboards such as business analytics?

  1. QuickSight
  2. Aurora
  3. Athena
  4. Managed Blockchain
A

QuickSight

95
Q

A company would like to set up a fully managed MongoDB database. Which AWS database is best-suited for this task?

  1. DocumentDB
  2. ElastiCache
  3. RDS
  4. Neptune
A

DocumentDB

96
Q

Which exclusive DynamoDB feature is an in-memory cache that can improve your performance up to 10x?

  1. ElastiCache
  2. Edge Locations
  3. DynamoDB Accelerator
  4. Snowball Edge
A

DynamoDB Accelerator

97
Q

RDS Multi-AZ deployments’ main purpose is high availability, while RDS Read replicas’ main purpose is scalability?

  1. True
  2. False
A

True