Storage and Databases Flashcards
Block level storage
behaves like physical hard drives
Updates only pieces that change
Instance Stores
Provides temporary block-level storage for an EC2 instance
Disk storage that is physically attached to the host computer for an instance, and has the same lifespan as the instance
When instance is terminated, you lose any data in the instance store
Use for temporary data only
Amazon Elastic Block Store (EBS)
Provides block level storage volumes that you can use with EC2 instances
All data on the attached EBS volume remains available even if you terminate an EC2 instance
To create, define the configuration and provision it
Then you can attach to an EC2 instance
Back up the data using EBS snapshots - incremental backups record deltas
Amazon Simple Storage Service (S3)
Data store: store and retrieve an unlimited amount of data
Store data as objects
Store objects in buckets
Max object size of 5 TB
Version objects
Create multiple buckets
S3 Standard Storage class
Frequently accessed data
Stores in a minimum of 3 availability zones
High availability
Standard Infrequent Access Storage class
Infrequently accessed data
Lower storage price, higher retrieval price
One Zone-Infrequent Access Storage class
Stores data in a single availability zone
Lower storage price than standard
Use if you want to save costs and can easily reproduce data in event of failure
S3 Intelligent Tiering
Ideal for data with unknown or changing access patterns
Small monthly monitoring and automation fee per object
S3 Glacier Instant Retrieval
For archived data that requires immediate access
S3 Glacier Flexible Retrieval
Low cost storage for data archiving
Retrieve objects within a few min to hours
S3 Glacier Deep Archive
Lowest-cost object storage
Ideal for archiving
Retrieve objects within 12 hour
S3 Outposts
Creates buckets on outposts
Easier to retrieve, store and access data on aws outposts
Amazon Elastic File System (EFS)
File storage: multiple clients can access data that is stored in shared file folders
Storage server uses block storage with a local file system to organize files
Clients access through file paths
Ideal for use cases where a large number of services and resources need to access the same data at the same time
Multiple instances can access data in EFS at the same time
Reading and writing simultaneously
Linux file system
Regional resource
Scales up and down automatically
Unlike EBS, EFS stores data in and across multiple availability zones
Amazon Relational Database Service (RDS)
Managed service that automates tasks like hardware provisioning, database setup, patching, and backups
Integrates with other services to fulfill business and operational needs, such as using Lambda to query db from a serverless application
AWS supported databases:
MySQL
PostgreSQL
Oracle
Microsoft SQL Server
Lift and shift migration
Migrate db to run on EC2
RDS supports all major db engines
Added benefits:
Automated patching
Backups
Redundancy
Failover
Disaster recovery
Amazon Aurora
Enterprise-class relational database
Compatible with:
MySQL
PostgreSQL
1/10th cost of commercial dbs
Data replication
Up to 15 read replicas
Continuous backup to S3
Point in time recovery
Great if your workloads require high availability
Amazon DynamoDB
Nonrelational database: create tables to store and query data
NoSQL databases use structures other than rows and columns to organize data
Key-value pairs: data organized into items(keys), and items have attributes (values)
Serverless database: no need to provision, patch, or manage servers, no installing, maintaining or operating software
Non-relational, NoSQL db
Amazon Redshift
Data warehouses
Big data: historical analytics as opposed to operational analysis
Data is no longer changing
AWS Database Migration Service (DMS)
Helps to migrate dbs securely
Data is moved between a source and target db
Source db remains fully operational during migration
Homogenous databases
Microsoft SQL Server
Oracle
Heterogeneous databases
Convert using AWS schema conversion tool
Then use DMS to migrate
Other uses:
Dev and test db migrations
DB consolidation: combining several dbs into a single db
Continuous db replication (disaster recovery, geo separation)
Amazon DocumentDB
Content management
Catalogs
Supports MongoDB workloads
Amazon Neptune
Graph DB service
Build and run applications that work with highly connected datasets, like:
Recommendation engines
Fraud detection
Knowledge graphs
Social networking
Amazon Managed Blockchain
Create and manage blockchain networks with open-source frameworks
Distributed ledger system, lets multiple parties run transactions and share data without a central authority
Amazon Quantum Ledger Database
Immutable system of record
Complete history of all changes made to data
Amazon ElastiCache
Adds caching layers on top of dbs to improve read times of common requests
Supports Redis and Memcached
Amazon DynamoDB Accelerator (DAX)
In-memory cache for DynamoDB
Improves response times from single-digit milliseconds to microseconds