Udemy course - Sections 10-16 Flashcards
What does Amazon’s Relational Database Service (RDS) use?
EC2 instance, so you must choose and instance family/type
What are relational databases known as?
SQL - structured query language
What kind of database is RDS?
Online transition processing (OLTP) type
what are 4 benefits of RDS DBs?
1) easy to setup
2) highly available
3) fault tolerant
4) scalable
common use cases of RDS?
online stores and banking systems
what service does encryption use for RDS DBs?
KMS - key management service
how are RDS instances and snapshots recorded if setup?
encrypted and at rest
What DB engines does RDS support?
-sql server, oracle, mysql server, postgreSQL, Aurora, mariaDB
How do you scale RDS db?
increase instance size (computer/storage)
what kind of option do you have for heavy workloads on RDS DBs?
reading replicas (scales out for read/queries only)
What AZ options with disaster recovery for rds?
multi-AZ
What is aurora?
AWS db offering in rds family
what 2 db engines is aurora compatible with?
msql/postgreSQL buitl for cloud
what does amazon aurora feature?
fault-tolerant, self-healing storage system that auto-scales up to 128tb per database instance
5 features of dynamo db?
1) nosql db
2) key/value store and document store
3) non-relational, key/value type of db
4) serverless
5) push button scaling
what is redshift?
sql based data warehouse used for analytics
what kind of DB is redshift?
relational that is used for ONLAP (online analytics processing)
what does redshift use as an engine?
ec2 instances so you have to chose instance family/type
how many copies does redshift always keep/
3
what kind of backups does redshift provide?
continous/incremental
what is EMR?
Managed cluster platform that simplifies running big data
frameworks including Apache Hadoop and Apache Spark
what is EMR used for?
Used for processing data for analytics and business
intelligence
What is emr good for?
transforming and moving large amounts of data
what can you do with emr as a data mechanism?
ETL functions
what is elastichache?
1) Fully managed implementations Redis and Memcached
2) ElastiCache is a key/value store
what does elasti cache offer?
in-memory db offering high performance and low latency
can kind of dbs can elastic cache be put up in front?
RDS/DynamoDB
what is Athena?
db, that queries data in S3 using sql
what other sources can athena be connected to?
lambda
what formats can data be in for athena?
csv, tsv, json, parquet and orc formats
what does athena use to store information and schemas about db’s and tables?
aws glue
what is aws glue?
- Fully managed extract, transform and load (ETL) service
- Used for preparing data for analytics
what environment does glue run on?
fully managed/scaled out apache spark
what does glue work with?
Works with data lakes (e.g. data on S3), data warehouses
(including RedShift), and data stores (including RDS or EC2
databases
what is a kinesis data strem?
- Producers send data which is stored in shards for up to 7
days - Consumers process the data and save to another service
2 things to know about kinesis data firehose?
- No shards, completely automated and elastically scalable
- Saves data directly to another service such as S3, Splunk,
RedShift, or Elasticsearch
what is kinesis data analytics?`
Provides real-time SQL processing for streaming data
what is the data pipeline ?
- Processes and moves data between different AWS compute and
storage services - Save results to services including S3, RDS, DynamoDB, and EM
what is quicksight?
*Business intelligence (BI) service
* Create and publish interactive BI dashboards for Machine
Learning-powered insights
what is neptune?
Fully managed graph database service
3 things about document db to know?
- Fully managed document database service (non-relational)
- Supports MongoDB workloads
- Queries and indexes JSON data
2 things to know about QLDB?
*Fully managed ledger database for immutable change history
* Provides cryptographically verifiable transaction logging
1 thing to know about managed blockchain?
Fully managed service for joining public and private networks
using Hyperledger Fabric and Ethereum
Which AWS database service offers seamless horizontal scaling?
DynamoDB
How can fault tolerance be added to an Amazon RDS database?
Using multi-az
How can an organization enable microsecond latency for a DynamoDB database?
use dynamoDB accelerator
Which AWS database service is a relational, data warehouse?
redshift
Why might an organization decide to move an on-premises database to Amazon RDS?
reduce operational overhead
How do you increase the capacity of an Amazon RDS database?
scaling vertically by changing instance types
Amazon DynamoDB is good for which use case?
unstructured data, flexible schema
A company needs to run ETL jobs on data stored in S3. Which service should they use?
glue