Module 5 - Storage and Databases Flashcards
Instance stores
Provides temporary block-level storage for Amazon EC2 instance.
disk storage physically attached to host computer for EC2 instance - has same lifespan as instance
instance terminated - you lose any data in instance store
Amazon Elastic Block Store (Amazon EBS)
service that provides block-level storage volumes that you can use with Amazon EC2 instances.
o Stop/terminate Amazon EC2 instance, all data on attached EBS volume remains available.
o Create EBS volume – define configuration e.g. volume size and type and provision it.
o After you create EBS volume, can attach to Amazon EC2 instance.
o EBS volumes for data that needs to persist – important to back up data.
o Can take incremental backups of EBS volumes by creating Amazon EBS snapshots.
Amazon EBS Snapshots
o First backup taken of a volume copies all data.
o Subsequent backups – only blocks of data that have changed since most recent snapshot saved.
Incremental backups different from full backups
o All data in a storage volume copies each time backup occurs.
o Full backup includes data that has not changed since most recent backup.
Object storage
- Each object consists of data, metadata, and a key.
- Data might be image, video, text document, or any other type of file.
- Metadata contains information about what data is, how it is used, object size, and so on.
- Object’s key is its unique identifier.
- When you modify a file in block storage, only the pieces that are changed are updated.
o When a file in object storage is modified, the entire object is updated.
Amazon Simple Storage Service (Amazon S3)
- Service that provides object-level storage.
- Amazon S3 stores data as objects in buckets.
- Can upload any type of file to Amazon S3 e.g. images, videos, text files etc.
- Offers unlimited storage space.
- Maximum file size for object in Amazon S3 is 5 TB.
- Upload file to Amazon S3 – can set permissions to control visibility and access to it.
- Amazon S3 versioning feature to track changes to objects over time.
Amazon S3 Storage Classes
S3 Standard
S3 Standard-Infrequent Access (S3 Standard-IA)
S3 One Zone-Infrequent Access (S3 One Zone-IA)
S3 Intelligent-Tiering
S3 Glacier
S3 Glacier Deep Archive
S3 Standard
- Designed for frequently accessed data.
- Stores data in a minimum of three Availability Zones.
- Provides high availability for objects.
- Good choice for wide range of use cases – websites, content distribution, data analytics.
- Higher cost than other storage classes intended for infrequently accessed data and archival storage.
S3 Standard-Infrequent Access (S3 Standard-IA)
- Ideal for infrequently accessed data but requires high availability when needed.
- Similar to S3 Standard but has lower storage price and higher retrieval price.
- S3 Standard and S3 Standard-IA store data in minimum of three Availability Zones.
- S3 Standard-IA provides same level of availability as S3 Standard
lower storage price and higher retrieval price.
S3 One Zone-Infrequent Access (S3 One Zone-IA)
- Stores data in single Availability Zone.
- Has lower storage price than S3 Standard-IA.
• Good storage class to consider if following conditions apply: o You want to save costs on storage.
o You can easily reproduce your data in the event of an Availability Zone failure.
S3 Intelligent-Tiering
- Ideal for data with unknown or changing access patterns.
- Requires small monthly monitoring and automation fee per object.
- Amazon S3 monitors objects’ access patterns.
- Haven’t accessed object for 30 consecutive days – Amazon S3 automatically moves it to infrequent access tier: S3 Standard-IA.
- If you access object in infrequent access tier – Amazon S3 automatically moves it to frequent access tier: S3 Standard.
S3 Glacier
- Low-cost storage designed for data archiving.
- Able to retrieve objects within few minutes to hours.
- E.g. store archived customer records, older photos and video files.
S3 Glacier Deep Archive
• Lowest-cost object storage class ideal for archiving.
• Able to retrieve objects within 12 hours.
o Deciding between Amazon S3 Glacier and Amazon S3 Glacier Deep Archive – consider how quickly you need to retrieve archived objects.
File Storage
- Multiple clients e.g. users, applications, servers etc. can access data stored in shared file folders.
- Storage server uses block storage with local file system to organise files.
- Clients access data through file paths.
- File storage ideal for use cases in which large number of services and resources need to access same data at same time.
Amazon Relational Database Service (Amazon RDS)
• Service that enables you to run relational databases in AWS Cloud.
• Managed service that automates tasks e.g. hardware provisioning, database setup, patching, backups.
o Spend less time completing administrative tasks and more time using data to innovate your applications.
• Can integrate Amazon RDS with other services to fulfil business and operational needs.
o E.g. AWS Lambda to query database from serverless application.