Module 5 - Storage and Databases Flashcards
What are Instance Stores?
Block-level storage volumes behave like physical hard drives - provide temporary block-level storage for an Amazon EC2 instance
- a disk storage that is physically attached to the host computer for an EC2 instance (when instance is terminated, you lose any data in the instance store)
- AWS recommends instance stores for use cases that involve temporary data which you do not need in the LT
What is Amazon EBS?
Amazon Elastic Block Store - a service that provides block-level storage volumes that you can use with Amazon EC2 instances
- if you stop or terminate an Amazon EC2 instance, all the data on the attached EBS volume remains available (main difference from normal instance stores) - ideal for data that requires retention
What are Amazon EBS Snapshots?
Allows you to take incremental backups of EBS volumes
What is Object Storage?
A technology that manages data as objects - each object consists of data (image, video, text doc), metadata (means supporting data), and a key (unique identifier)
What is Amazon S3? What are five traits?
A storage device.
- Store and retrieve an unlimited amount of data
- Store data as objects - store objects in buckets
- Upload a maximum object size of 5 TB
- Version objects
- Create multiple buckets
What are the six Amazon S3 storage classes?
- S3 standard
- designed for frequently accessed data
- 99.999999999% durability (probability of being intact after 1 year)
- data is stored in at least three Availability Zones (facilities)
- static website hosting (meaning a collection of each HTML file and each file is equivalent to a physical page of the actual site) - S3 Standard-IA
- used for data that is accessed less frequently, but requires rapid access when needed
- similar to S3 Standard, but has lower storage price and higher retrieval price
- data is stored in at least three Availability Zones
- perfect to store backups, disaster recovery files, or objects requiring LT storage - S3 One Zone-IA
- stores data in a single availability zone
- ideal: i) you want to save costs on storage, and ii) you can easily reproduce your data in the event of an Availability Zone failure - S3 Intelligent-Tiering
- ideal for data with unknown or changing patterns
- requires a small monthly monitoring and automation fee per object - S3 Glacier
- “free data” (e.g., save data for auditing purposes)
- low-cost storage designed for data archiving
- can lock data in vaults
- WORM - write once/read many
- three options of retrieval (minutes to hours) - S3 Glacier Deep Archive
- lowest-cost object storage class ideal for archiving
- able to retrieve objects within 12 hours
What is Amazon EBS? (what are the four characteristics)
Elastic Block Store
- Sizes up to 16 TiB (Tebibyte)
- Survive termination of their EC2 instance
- Solid state by default
- HDD options
What is Amazon EFS?
Amazon Elastic File System - a scalable file system used with AWS Cloud services and on-premises resources
- unique feature: as you add and remove files, Amazon EFS grows and shrinks automatically → it can scale on demand to petabytes without disrupting applications
What is the difference between Amazon EBS and Amazon EFS?
- EBS stores data in a single Availability Zone while EFS stores data across multiple Availability Zones
Define relational databases.
- data is stored in a way that relates it to other pieces of data
- uses SQL (Structured Query Language) to store and query data → allows data to be stored in an easily understandable, consistent, and scalable way
What is Amazon RDS?
Amazon Relational Database Service - a service that enables you to run relational databases in the AWS cloud
- automates tasks such as hardware provisioning, database setup, patching, and backups
- can integrate with other services (e.g., AWS Lambda)
- provides security options: encryption at rest (protecting data while stored) and encryption in transit (protecting data while it is being sent and received)
What are the six types of Amazon RDS database engines?
- Amazon Aurora
- PostgreSQL
- MySQL
- MariaDB
- Oracle Database
- Microsoft SQL Server
Define nonrelational databases.
Sometimes referred to as “NoSQL databases” because they use structures other than rows and columns to organise data
- basically the opposite of relational databases
What is Amazon DynamoDB?
- A key-value database service which delivers single-digit millisecond performance at any scale
- Massive throughput capabilities
- Granular API access
- Serverless
- Automatic scaling → as your database shrinks or grows, DynamoDB automatically scales to adjust for changes in capacity
- ideal for use cases that require high performance while scaling
What is Amazon Redshift?
A data warehousing service that you can use for big data analytics - offers ability to collect data from many sources and helps you to understand relationships and trends across your data