AWS-Module 5: Storage & Databases Flashcards
What is Amazon Elastic Block Store (EBS)?
An EBS is a virtual hard Drive that you can attach to your EC2 instance
With ______ you define the size, type and configurations of the volume you need. Provision the volume, and then attach it to your EC2 instance. From there, you can configure your application to write to the volume and you’re good to go. If you stop and then start the EC2 instance, the data in the volume remains
EBS
EBS allows you to take incremental backups of your data called ________
snapshots
It’s very important that you take regular ________ of your EBS volumes. This way, if a drive ever becomes corrupted, you haven’t lost your data. And you can restore that data from a ___________
snapshots
____________ are different from full backups, in which all the data in a storage volume copies each time a backup occurs. The full backup includes data that has not changed since the most recent backup.
Incremental backups
What is an instance store?
An instance store is disk storage that is physically attached to the host computer for an EC2 instance, and therefore has the same lifespan as the instance. When the instance is terminated, you lose any data in the instance store.
EC2 instances are virtual servers. If you start an instance from a stopped state, the instance might start on another host, where the previously used _______ volume does not exist. Therefore, AWS recommends _________ for use cases that involve temporary data that you do not need in the long term.
instance stores
What is Amazon Simple Storage Service (S3)?
it’s a data store that allows you to store and retrieve a virtually unlimited amount of data at any scale
Amazon S3 stores data as objects in __________.
buckets
The maximum file size for an object in Amazon S3 is ________.
5 TB
Data is stored as objects, but instead of storing them in a file directory, you store them in what we call __________
buckets
What are the different Amazon S3 storage classes?
- S3 Standard
- S3 Standard-infrequent access (S3 Standard- IA)
- S3 One zone -Infrequent Access (S3 One Zone-IA)
- S3 Intelligent-Tiering
- S3 Glacier Instant retrieval
- S3 Glacier Flexible retrieval
- S3 Glacier deep archive
- S3 Outposts
What is S3 Standard best for?
- Designed for frequently accessed data
- Stores data in a minimum of three availability zones
What is S3 Standard- Infrequent Access ( S3 Standard- IA) best for?
- Ideal for infrequently accessed data
- Similar to Amazon S3 standard but has a lower storage price and higher retrieval price
What is S3 One Zone-Infrequent Access (S3 One-Zone - IA) best for?
- Stores data in a single availability zone
- Has a lower storage price than Amazon S3 Standard - IA
What is S3 Intelligent- Tiering best for?
- Ideal for data with unknown or changing access patterns
- Requires a small monthly monitoring and automation fee per object
What is S3 Glacier Instant Retrieval best for?
- Works well for archived data that required immediate access
- Can retrieve objects within a few milliseconds
What is S3 Glacier flexible retrieval best for?
- Low cost storage designed for data archiving
- Able to retrieve objects within a few minutes to hours
What is S3 Glacier Deep Archive best for?
- Lowest cost object storage class ideal for archiving
- Able to retrieve objects within 12 hours
What is S3 Outposts best for?
- Creates S3 buckets to Amazon S3 Outposts
- Makes it easier to retrieve, store, and access data on AWS outposts
____________ and comes with 11 nines of durability. That means an object stored in ____________ has a 99.999999999 percent probability that it will remain intact after a period of 1 year.
S3 Standard
____________ is static website hosting, where a static website is a collection of HTML files and each file is akin to a physical page of the actual site. You can do this by simply uploading all your HTML, static web assets, and so forth into a bucket and then checking a box to host it as a static website. You can then enter the bucket’s URL and bam! Instant website.
Amazon S3
What is S3 Standard-Infrequent Access or S3 Standard-IA?
It’s used for data that is accessed less frequently but requires rapid access when needed. This means it’s a perfect place to store backups, disaster recovery files, or any object that requires long-term storage.
_______________ is used when you we need to retain data for several years for auditing purposes. And we don’t need it to be retrieved very rapidly
S3 Glacier Flexible Retrieval
_____________- are policies you can create that can move data automatically between tiers
Lifecycle policies
In ____________ each object consists of data, metadata, and a key
object storage
The data might be an image, video, text document, or any other type of file. Metadata contains information about what the data is, how it is used, the object size, and so on. An object’s key is its unique identifier.
when you modify a file in ___________, only the pieces that are changed are updated. When a file in __________ is modified, the entire object is updated.
block storage, object storage
What are the two factors you should consider when selecting a Amazon S3 storage class?
- How often you plan to retrieve your data
- How available you need your data to be
What is Amazon elastic file system (EFS)
Its a managed file system
With _______, you can keep existing file systems in place but let AWS do all the heavy lifting of the scaling and the replication. ________ allows you to have multiple instances accessing the data in ______ at the same time. It scales up and down as needed without you needing to do anything to make that scaling happen
EFS
________ is a scalable file system used with AWS Cloud services and on-premises resources. As you add and remove files, Amazon _____ grows and shrinks automatically. It can scale on demand to petabytes without disrupting applications
EFS
_________ volumes attach to EC2 instances and are an Availability Zone-level resource. In order to attach EC2 to _______, you need to be in the same AZ. You can save files on it. You can also run a database on top of it. Or store applications on it. It’s a hard drive. If you provision a two terabyte _____ volume and fill it up, it doesn’t automatically scale to give you more storage
Amazon EBS
In __________, multiple clients (such as users, applications, servers, and so on) can access data that is stored in shared file folders. In this approach, a storage server uses block storage with a local file system to organize files. Clients access data through file paths.
file storage
Compared to block storage and object storage, __________ is ideal for use cases in which a large number of services and resources need to access the same data at the same time.
file storage
What is relational database management system (RDBMS)
means you can store data in a way such that it relates to other pieces of data.
What does Lift-and-Shift mean?
is when you migrate your on-premise database to run on Amazon EC2 (to run your database in the cloud)
What is Amazon Relational Database Service (RDS)?
is a service that enables you to run relational databases in the AWS Cloud
In a ____________, data is stored in a way that relates it to other pieces of data. (ex. the coffee shop’s inventory management system. Each record in the database would include data for a single item, such as product name, size, price, and so on.)
relational database
Relational databases use ____________ to store and query data. This approach allows data to be stored in an easily understandable, consistent, and scalable way
structured query language (SQL)
What is Amazon Aurora?
helps to reduce your database costs by reducing unnecessary input/output (I/O) operations, while ensuring that your database resources remain reliable and available.
Consider Amazon Aurora if your workloads require high availability. It replicates six copies of your data across three Availability Zones and continuously backs up your data to Amazon S3.
What is Amazon DynamoDB?
It’s a serverless database, meaning you don’t need to manage the underlying instances or infrastructure powering it.
With __________, you create tables. A ___________ table, is just a place where you can store and query data. Data is organized into items, and items have attributes. Attributes are just different features of your data. If you have one item in your table, or 2 million items in your table, ___________ manages the underlying storage for you
DynamoDB
___________ stores this data redundantly across availability zones and mirrors the data across multiple drives under the hood for you and has a millisecond response time
DynamoDB
Relational databases, like a standard MySQL Database, require that you have a ___________, in place. That might consist of one, or many tables that might relate to each other. You then use SQL to query the data.
well defined schema
DynamoDB is a __________ database. ___________ databases tend to have simple flexible schemas, not complex rigid schemas, laying out multiple tables that all relate to each other
non-relational
In a ___________, you create tables. A table is a place where you can store and query data.
nonrelational database
__________ are sometimes referred to as “NoSQL databases” because they use structures other than rows and columns to organize data. One type of structural approach for ________________ is key-value pairs. With key-value pairs, data is organized into items (keys), and items have attributes (values)
Nonrelational databases
Amazon DynamoDB is a _________ service. It delivers single-digit millisecond performance at any scale
key-value database
DynamoDB is ________, which means that you do not have to provision, patch, or manage servers.
serverless
With _____, you can add and remove attributes from items in the table, at any time. Not every item in the table has to have the same attributes. This is great for datasets that have some variation from item to item. Because of this flexibility, you cannot run complex SQL queries on it. Instead, you would write queries based on a small subset of attributes that are designated as keys.
DynamoDB
What is Amazon Redshift?
is a data warehousing service that you can use for big data analytics. It offers the ability to collect data from many sources and helps you to understand relationships and trends across your data.
Once data becomes too complex to handle with traditional relational databases, you’ve entered the world of ____________. __________ are engineered specifically for this kind of big data, where you are looking at historical analytics as opposed to operational analysis.
Data warehouses
Data warehouses are best as long as your business question is ___________, then a data warehouse is the right solution for that line of business intelligence.
looking backwards at all
Amazon Redshift is a __________ as a service
data warehousing
What is Amazon database migration service (DMS)?
enables you to migrate relational databases, nonrelational databases, and other types of data stores
With _________, you move data between a source database and a target database. The source and target databases can be of the same type or different types. During the migration, your source database remains operational, reducing downtime for any applications that rely on the database
AWS DMS
_____________ is when you have several databases and want to consolidate them into one central database
Database consolidation