Databases Flashcards

1
Q

RDS runs on VMs

T or F

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

RDS is serverless

T or F

A

F

it is not serverless

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

aurora serverless is serverless

T or F

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

read replicas are used for scaling, not DR

T or F

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

must have auto backups turned on in order to deploy a read replica

T or F

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

You can have up to ___ read replica copies of any DB

A

5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

you can have read replicas of read replicas

T or F

A

T, but watch out for latency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

read replica facts:

each read replica will have its own DNS end point

you can have read replicas that have multi AZ

you can create read replicas of multi az source databases

read replicas can be promoted to be their own DB. THis breaks replication

you can have a read replica in a second region

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

2 types of backups for rds

automated backups

database backups

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

read replica facts

multi az

used to increase performance

must have backups turned on

can be in different regions

can be mysql, postgres, mariadb, oracle, aurora

can be promoted to master, this will break the read replica

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

multi az tips

used for DR

you can force a failover from one az to another by rebooting the instance.

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

This DB service is:

stored on ssd storage

spread across 3 geographically distinct data centers

eventual consistent reads (default)

strongly consistent reads

A

what is dynamo DB?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

consistency across all copies of data is usually reached within a second with this type of read. reoeating a read after a short time should return the updated data. (best read performance)

A

eventual consistent reads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

A ____ consistent read returns a result that reflects all writes that received a successful response prior to the read

A

strongly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

This is a fully managed, highly available, in memory cache for dynamo DB

10x performance improvement

reduces request time from milliseconds to microseconds - even under load

no need for developers to manage cache

compatible with dynamo db api calls

A

dynamo db accelerator (DAX)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

dynamo db transactions notes:

multiple all or nothing operations

financial transactions

fulfilling orders

two underlying reads or writes - prepare/commit

up to 25 items or 4 mb of data

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

this type of dynamo db capacity provides:

pay per request pricing

balance cost and performance

no minimum capacity

no charge for read/write - only storage and backups

pay more per request than with provisioned capacity

new product launches

A

on-demand capacity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

dynamo db on demand backup and restore notes:

full backups at any time

zero impact on table performance or availability

consistent within seconds and retained until deleted

operates withing same region as source code

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

dynamo db point in time recovery notes:

protects against accidental ______ or deletes

restore to any point in the last ____ days

_____ backups

not enabled by default

latest restorable: ____ minutes in the past

A

writes

35

incremental

five

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

dynamo db ___ are time ordered sequence of item lvel changes in a table

they are stored for 24 hours

inserts, updates, and deletes

combine with lambda functions for functionality like stored procedures

A

streams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

dynamo db global tables notes

managed multi master, multi region replication

globally distributed apps

based on dynamo db streams

multi region redundancy for dr or ha

no app rewrites

replication latency under one second

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

DMS =

A

database migration service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

dynamo db security

encyption at rest using ___

site to site ___

direct ____

IAM policies and ____

___ grained access

CW and CT

VPC endpoints

A

KMS

vpn

connect

roles

fine

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

____ is a fast and powerful, fully managed, petabyte scale data warehouse service in the cloud. Customers can start small for just .25 per hour with no commitments or upfront costs and scale toa. apetabyte or more for 1,000 per TB per year, less than a tenth of most other data warehousing solutions

A

redshift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

_____ transaction example:

net profit for EMEA and pacific for the digital radio product. pulls in large number of records

sum of radios sold in EMEA

sum of radios sold in pacific

unit cost of radio in each region

sales price of each radio

salce price - unit cost

A

OLAP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

olap

A

online analytics processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

REdshift can be configured as follows

single node (160GB)

multi node

leader node (manages client connections and receives queries)

computer node (store data and perform queries and computations) up to 128 compute nodes

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

redshit advanced ____

columnar data stores can be compressed much more than row based data stores because similar data is stored sequentially on disk. redshift employs multiple compression techniques and can often achieve significant compression relative to traditional relational indexes or materialized views, and so uses less space than traditional relational database systems. when loading data into an empty table, redshift automatically samples your data and selects the most appropriate compression scheme.

A

compression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

mpp =

A

massive parallel processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

___ ___ ___

redshift automatically distributes data and query loads across all nodes. redshift makes it easy to add nodes to your data warehouse and enables you to maintain fast query performance as your data warehouse grows.

A

massively parallel processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

redshift backups

enabled by default with a 1 day retention period

max retention period is 35 days

redshift always attempts to maintain at least three copies of your data (the original and replica on the compute nodes and a backup in s3)

redshift can also asynchronously replicate your snapshots to s3 in another region for disaster recovery.

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

redshift pricing

compute node hours (total number of house ou run across all your compute nodes for the billing period. you are billed for 1 unit per node per hour, so a 3 node data warehouse cluster running persistently for an entire month would incur 2,160 instance hours. you will not be charged for leader node hours; only compute nodes will uncur charges.)

charged for backups

charged for data transfer (only within vpc, not outside it)

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

redshift security considerations

encrypted in transit using SQL

encrypted at rest using AES-256 encryption

by default redshift takes care of key management

  • manage your own keys through HSM
  • AWS key management service
A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

redshift availability

___ AZ(s)

can restore snapshots to new AZs in event of an outage

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is aurora?

it is a mysql and postgresql compatible _____ db engine that combines the speed and availability of high end commercial databases with the eimplicity and cost efefctiveness of open source databases.

A

relational

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

aurora provides up to ___ x better performance than mysql and ___x better than postgres dbs at a much lower price point, whilst delivering similar performance and availability

A

5, 3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

THings to know about aurora

  1. start with __gb, scales in__gb increments to ___tb (storage autoscaling)
  2. compute resources can scale up to ___vCPUs and 244GB of RAM
  3. ___ copies of your data is contained in each AZ, with max of ___ AZs. ___ copies of your data.
A

10,10,64

34

2, 3, 6

38
Q

aurora is designed to transparently handle the loss of up to ___ copies of data without affecting db write avialbility and up to ___ copies without affecting read availability

A

2,3

39
Q

t or f

aurora storage is self healing. data blocks and disks are continuously scanned for errors and repaired automatically.

A

t

40
Q

three types of aurora replicas are available:

aurora replicas (how many?)

mysql read replicas (how many?)

postgresQL (how many?)

A

15, 5, 1

41
Q

t or f

backups are always enabled on aurora db instances

A

t

42
Q

t or f

backups impact db performance and must be done during slow traffic periods

A

false, they do not impact business

43
Q

t or f

aurora snapshots impact performance

A

f

they do not impact performance

44
Q

t or f

aurora snapshots cannot be shared with other aws accounts

A

f

they can

45
Q

aurora ____ is an on demand autoscaling capable edition of aurora. an aurora ___ db cluster automatically starts up, shuts down, and scales capacity up or down based on your apps needs.

A

serverless

46
Q

t or f

aurora serverless provides a relatively simple, cost effective option for infrequent, intermittent, or unpredictable workloads

A

t

47
Q

does memcached support simple cache to offload DB

A

yes

48
Q

does memcached support ability to scale horizontally

A

YES

49
Q

DOES MEMCAChed support multithreaded performance

A

yes

50
Q

does memcached support advanced data types

A

no

51
Q

does memcached support ranking/sorting data sets

A

no

52
Q

does memcached support pub/sub capabilities

A

no

53
Q

does memcached support persistence

A

no

54
Q

does memcached support multi AZ

A

no

55
Q

does memcached support backup and restore capabilities?

A

no

56
Q

does redis support simple cache to offload DB

A

yes

57
Q

does redis support ability to scale horizontally

A

yes

58
Q

does redis support multi threaded performance

A

no

59
Q

does redis support advanced data types

A

yes

60
Q

does redis support ranking/sorting data sets

A

yes

61
Q

does redis support ranking/sorting data sets

A

yes

62
Q

does redis support pub/sub capabilities

A

yes

63
Q

does redis support persistence?

A

yes

64
Q

does redis support multi az?

A

yes

65
Q

does redis support backup and restore capabilities?

A

yes

66
Q

use ___ to increase DB and web application performance

A

elasticache

67
Q

___ ___ ____ is a cloud service that makes it easy to migrate relational databses, data warehouses, nosql dbs, and other types of data stores. you cna use ___ __ ___ to migrate your data into the cloud, between on prem instances or between combinations of cloud and on prem setups.

A

database migration service (DMS)

68
Q

SCT = ?

A

schema creation tool

69
Q

t or f

you need SCT even if you are migrating to identical databases

A

f

you do not need sct if dbs are the same.

70
Q

DMS - the source can either be on prem or inside aws iteself or another provider such as azure

t or f

A

t

71
Q

t or f

dms allows you to migrate databses from one source to aws.

A

t

72
Q

t or f

you can do homogenous migrations(same db engines) or heterogenous migrations (different db engines)

DMS

A

t

73
Q

t or f

if you do a heterogenous migration with dms, you will need the aws schema conversion tool

A

t

74
Q

the follwing services have caching capabilities

api gateway

cloudfront

elasticache - memcached and redis

dynamodb accelerator (DAX)

A

yes

75
Q

emr = ?

A

elastic map reduce

76
Q

____ is the industry leading cloud big data platform for processing vast amounts of data using open source tools such as apache spark, apache hive, hbase, flink, hudi, presto. with ____ you can run petabyte scale analysis at less than half the cost of traditional on prem solutions and over 3x faster than standard apache spark

A

emr

77
Q

the central component of EMR is the ______

A

cluster

78
Q

EMR match the nodes:

master, core, task

  1. a node w/ sw components that only runs tasks and does not store data in HDFS. they are optional
  2. a node that manages the cluster. this node tracks the status of tasks and monitors the health of the cluster. every cluster has one.
  3. a node with sw components that runs tasks and stores data in the hadoop distributed file system (HDFS) on your cluster. multinode clusters have at least one.
A

1 = task

2 = master

3 = core

79
Q

emr archives log files to s3 at ___ minute intervals

A

5

80
Q

emr log files are available even after the cluster terminates?

t or f

A

t

81
Q

emr - by default log data is stored on core node.

t or f

A

f

data is stored on master

82
Q

t or f

EMR

you can configure replication to s3 on 5 min intervals for all log data from the master node, however, this can only be configured when creating the cluster for the first time.

A

t

83
Q

mysql default port is ___

A

3306

84
Q

When you add a rule to an RDS DB security group, you must specify a port number or protocol.

A

false

a destination port is needed, but the rds instance port numbers is automatically applied to the rds db sg.

85
Q

If you are using Amazon RDS Provisioned IOPS storage with a Microsoft SQL Server database engine, what is the maximum size RDS volume you can have by default?

A

16tb

86
Q

What happens to the I/O operations of a single-AZ RDS instance during a database snapshot or backup?

A

I/O may be briefly suspended while the backup process initializes (typically under a few seconds), and you may experience a brief period of elevated latency.

87
Q

In RDS, what is the maximum value I can set for my backup retention period?

A

35 days

88
Q
A
89
Q
A
90
Q
A