AWS Storage Flashcards by Anna George

Object Storage

Ex. Amazon S3
manage data as individual objects, rather than as blocks and sectors (block-based) or a file hierarchy (file-based)
accessed using a REST API (URL w/ HTTP methods like GET, PUT)

each object includes the data itself, metadata (data about the data), and globally unique identifier

due to its flat file structure, object storage has virtually unlimited scalability and allows the retention of massive amounts of unstructured data

How well did you know this?

Not at all

Perfectly

Block Storage

Ex. Amazon Elastic Block Store (EBS)

data is stored and managed in blocks within sectors and tracks
controlled by a server-based operating system
volumes appear as local disks to the operating systems and can be partitioned and formatted
you can use block storage devices as a boot volume

Use Cases: structured information
ie - file systems, databases, transactional logs, SQL databases and virtual machines (VMs)

How well did you know this?

Not at all

Perfectly

File Storage

Ex. Amazon Elastic File System
file based storage system manage data in a file hierarchy
file system is mounted via the network to a client computer where it then becomes accessible for reading and writing data
protocols used for accessing file systems include NFS or CIFS/SMB

How well did you know this?

Not at all

Perfectly

Amazon Simple Storage Service (S3)

Object-based storage system; an object is just a file, which is uploaded into buckets
object storage built to store and retrieve any amount of data from anywhere - web sites and mobile apps, corporate applications, and data from IoT sensors or devices
you can store any type of file in S3
S3 is designed to deliver 99% durability

Use Cases: backup & storage, application hosting, media hosting, software delivery, static website

How well did you know this?

Not at all

Perfectly

Files & Buckets

Buckets:
files are stored in buckets
buckets are root level folders; you can’t create buckets within buckets

S3 is a universal namespace so bucket names must be unique globally

you create your buckets within a region, so you’re data stays within regions
it is a best practice to create buckets in regions that are physically closest to your users to reduce latency

files can be anywhere from 0 bytes to 5 TB

How well did you know this?

Not at all

Perfectly

Objects (consist of….)

Consist of:

key (name of the object)
value (data made up of a sequence of bytes)
version ID (used for versioning)
metadata (data about the data that is stored)

How well did you know this?

Not at all

Perfectly

Amazon S3 Storage Class:

S3 Standard

S3 Storage Class (6 total)

durable, immediately available, frequently accessed

How well did you know this?

Not at all

Perfectly

Amazon S3 Storage Class:

S3 Intelligent-Tiering

S3 Storage Class (6 total)

automatically moves data to the most cost effective tier

How well did you know this?

Not at all

Perfectly

Amazon S3 Storage Class:

S3 Standard-IA

S3 Storage Class (6 total)

means infrequently accessed; durable, immediately available, infrequently accessed data

How well did you know this?

Not at all

Perfectly

Amazon S3 Storage Class:

S3 One-Zone-IA

S3 Storage Class (6 total)

lower cost for infrequently accessed data with less resilience

How well did you know this?

Not at all

Perfectly

Amazon S3 Storage Class:

S3 Glacier

S3 Storage Class (6 total)

archived data, retrieval times in minutes or hours

How well did you know this?

Not at all

Perfectly

Amazon S3 Storage Class:

S3 Glacier Deep Archive

S3 Storage Class (6 total) 
lowest cost storage class for long term retention

How well did you know this?

Not at all

Perfectly

Amazon S3 Additional Features:

Transfer Acceleration

Transfer Acceleration - speed up data uploads using CloudFront in reverse

How well did you know this?

Not at all

Perfectly

S3 Versioning

maintains multiple variants of an object in the same bucket
can be used to preserve, retrieve, and restore every version of every object in an S3 bucket

can be enabled at any time
once enabled can be suspended
Versioning should be enabled in source and destination buckets

How well did you know this?

Not at all

Perfectly

S3 Replication

You can replicate objects between different AWS Regions or within the same AWS region - Cross-Region Replication (CRR) and Same-Region Replication (SRR)

Why use replication?
meet compliance requirements for storing data at greater distances (CRR)
minimize latency for users who are closer to another AWS Region (CRR)
backup copy of your data in another AWS Region (CRR)
copy the objects to another S3 storage class
aggregate logs into a single bucket

How
enable the AWS Region in account
Enable versioning on source and destination buckets
Ensure S3 has permissions to both buckets
Configure replication

How well did you know this?

Not at all

Perfectly

S3 Replication:

Cross-Region Replication (CRR)

Study These Flashcards

used to copy objects across Amazon S3 buckets in different AWS Regions

S3 Replication:

Same-Region Replication (SRR)

Study These Flashcards

used to copy objects across Amazon S3 buckets in the same AWS Region; newer than CRR

Access S3 Bucket from EC2 with IAM Role

Study These Flashcards

Very common use case

2 ways to access S3 bucket with EC2 instance, or 2 ways to authenticate

copy into EC2, but you can have sensitive info now in your EC2 instance
create an IAM role with policy (which says this role is allowed to access that bucket) - this is utilizing temporary credentials. and prevents sensitive info from being leaked

Amazon Elastic Block Store (EBS)

Study These Flashcards

Provides persistent block storage volumes for use with Amazon EC2 instances in the AWS Cloud
Each Amazon EBS volume is automatically replicated within its Availability Zone to protect you from component failure, offering high availability and durability
EBS volume data persists independently of the life of the instance
EBS volumes do not need to be attached to an instance

EBS and Instances:

You can attach multiple EBS volumes to an instance
you cannot attach an EBS volume to multiple instances (use Elastic File Storage instead)
EBS volumes must be in the same AZ as the instances they are attached to

termination protection

is turned off by default and must be manually enabled (keeps the volume/data when the instance is terminated)
root EBS volumes are deleted on termination by default
extra non-boot volumes are not deleted on termination by default
the behaviour can be changed by altering the “DeleteOnTermination” attribute

EBS Volume Types:

Solid State Drives (SSD)

Study These Flashcards

EBS Volume Type
newer generation, higher performance; default volume type

EBS Provisioned IOPS SSD (IOI): highest performance SSD volume designed for latency-sensitive transactional workloads. Used for I/O-Intensive NoSQL and relational databases; most expensive

EBS General Purpose SSD (gp2): General Purpose SSD volume that balances price performance for a wide variety of transactional workloads. Used for boot volumes, low-latency interactive apps, dev & test

EBS Volume Types:

Hard Disk Drives (HDD)

Study These Flashcards

EBS Volume Type
older generation, cheaper

Throughput Optimized HDD (st1): Lowest cost HDD volume designed for frequently accessed, throughput intensive workloads. Used for big data, data warehouses, log processing

Cold HDD (sc1): Lowest cost HDD volume designed for less frequently accessed workloads. Used for colder data requiring fewer scans per day

Instance Store Volumes

Study These Flashcards

High performance local disks that are physically attached to the host computer on which an EC2 instance runs
instance stores are ephemeral which means the data is lost when powered off (non-persistent)
Ideal for temporary storage of information that changes frequently (buffers, caches, scratch data)
instance store volume root devices are created from AMI templates stored on S3
instance store volumes cannot be detached / reattached

Amazon Elastic File System (EFS)

Study These Flashcards

Newer file storage, not a block or object storage storage
a network-based file system that you connect to; you connect and then mount it to a file system.
Fully managed services that makes it easy to set up and scale file storage in the Amazon Cloud
provides a file system interface and uses the NFSv4.1 protocol

Use Cases: big data and analytics, media processing workflows, content management, web serving, home directories, etc.

Locations: data is stored across multiple AZ’s w/in a region

read after write consistency
pay for what you use (no pre-provisioning required)
can scale up to petabytes
EFS is elastic and grows and shrinks as you add and remove data
can concurrently connect 1 to 1000’s of EC2 instances, from multiple AZ’s
by default you can create up to 10 file systems per account

on premises access can be enabled via Direct Connect or AWS VPN

AWS Storage Gateway

Study These Flashcards

hybrid cloud storage service that gives you on-premises access to virtually unlimited cloud storage
seamlessly connects on-premises applications to cloud storage, caching data locally for low-latency access

Gateway Types (3):

tape gateway - virtual tape backup system
file gateway - accessed using SMB, NFS, or S3 API
volume gateway - block device accessed using iSCSI

Amazon S3 Additional Features: | Requester Pays

Requester Pays - the requester rather than the bucket owner pays for requests and data transfer

Amazon S3 Additional Features: | Tags

Tags - assign tags to objects to use in costing, billing, security, etc.

Amazon S3 Additional Features: | Events

Events - trigger notifications to SNS, SQS, or Lambda when certain events happen in your bucket

Amazon S3 Additional Features: | Static Web Hosting

Static Web Hosting - simple and massively scalable static website hosting

Amazon S3 Additional Features: | BitTorrent

BitTorrent - use the BitTorrent protocol to retrieve any publicly available object by automatically generating a .torrent file

Pricing

``` storage requests storage management pricing data transfer pricing transfer acceleration ```

EBS Snapshots

Take a backup of your volume Snapshots capture a point-in-time state of an instance stored on S3 does not provide granular backup (not a replacement for backup software) if you make periodic snapshots of a volume, the snapshots are incremental, which means that only the blocks on the device that have changed after your last snapshot are saved in the new snapshot Even though snapshots are saved incrementally, the snapshot deletion process is designed so that you need to retain only the most recent snapshot in order to restore the volume snapshots can only be accessed through the EC2 APIs EBS volumes are AZ specific but snapshots are regional specific with snapshots, you could transfer your data to a different AZ

Instance Store Volume vs EBS Snapshot

Instance Store Volume - physically attached to the host | EBS - attached over a network

AWS Storage Flashcards

(32 cards)