AWS Storage Flashcards

1
Q

Object Storage

A

Ex. Amazon S3
manage data as individual objects, rather than as blocks and sectors (block-based) or a file hierarchy (file-based)
accessed using a REST API (URL w/ HTTP methods like GET, PUT)

each object includes the data itself, metadata (data about the data), and globally unique identifier

due to its flat file structure, object storage has virtually unlimited scalability and allows the retention of massive amounts of unstructured data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Block Storage

A

Ex. Amazon Elastic Block Store (EBS)

data is stored and managed in blocks within sectors and tracks
controlled by a server-based operating system
volumes appear as local disks to the operating systems and can be partitioned and formatted
you can use block storage devices as a boot volume

Use Cases: structured information
ie - file systems, databases, transactional logs, SQL databases and virtual machines (VMs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

File Storage

A

Ex. Amazon Elastic File System
file based storage system manage data in a file hierarchy
file system is mounted via the network to a client computer where it then becomes accessible for reading and writing data
protocols used for accessing file systems include NFS or CIFS/SMB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Amazon Simple Storage Service (S3)

A

Object-based storage system; an object is just a file, which is uploaded into buckets
object storage built to store and retrieve any amount of data from anywhere - web sites and mobile apps, corporate applications, and data from IoT sensors or devices
you can store any type of file in S3
S3 is designed to deliver 99% durability

Use Cases: backup & storage, application hosting, media hosting, software delivery, static website

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Files & Buckets

A

Buckets:
files are stored in buckets
buckets are root level folders; you can’t create buckets within buckets

S3 is a universal namespace so bucket names must be unique globally

you create your buckets within a region, so you’re data stays within regions
it is a best practice to create buckets in regions that are physically closest to your users to reduce latency

files can be anywhere from 0 bytes to 5 TB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Objects (consist of….)

A

Consist of:

  • key (name of the object)
  • value (data made up of a sequence of bytes)
  • version ID (used for versioning)
  • metadata (data about the data that is stored)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Amazon S3 Storage Class:

S3 Standard

A

S3 Storage Class (6 total)

durable, immediately available, frequently accessed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Amazon S3 Storage Class:

S3 Intelligent-Tiering

A

S3 Storage Class (6 total)

automatically moves data to the most cost effective tier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Amazon S3 Storage Class:

S3 Standard-IA

A

S3 Storage Class (6 total)

means infrequently accessed; durable, immediately available, infrequently accessed data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Amazon S3 Storage Class:

S3 One-Zone-IA

A

S3 Storage Class (6 total)

lower cost for infrequently accessed data with less resilience

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Amazon S3 Storage Class:

S3 Glacier

A

S3 Storage Class (6 total)

archived data, retrieval times in minutes or hours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Amazon S3 Storage Class:

S3 Glacier Deep Archive

A
S3 Storage Class (6 total) 
lowest cost storage class for long term retention
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Amazon S3 Additional Features:

Transfer Acceleration

A

Transfer Acceleration - speed up data uploads using CloudFront in reverse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

S3 Versioning

A

maintains multiple variants of an object in the same bucket
can be used to preserve, retrieve, and restore every version of every object in an S3 bucket

can be enabled at any time
once enabled can be suspended
Versioning should be enabled in source and destination buckets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

S3 Replication

A

You can replicate objects between different AWS Regions or within the same AWS region - Cross-Region Replication (CRR) and Same-Region Replication (SRR)

Why use replication?
meet compliance requirements for storing data at greater distances (CRR)
minimize latency for users who are closer to another AWS Region (CRR)
backup copy of your data in another AWS Region (CRR)
copy the objects to another S3 storage class
aggregate logs into a single bucket

How
enable the AWS Region in account
Enable versioning on source and destination buckets
Ensure S3 has permissions to both buckets
Configure replication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

S3 Replication:

Cross-Region Replication (CRR)

A

used to copy objects across Amazon S3 buckets in different AWS Regions

17
Q

S3 Replication:

Same-Region Replication (SRR)

A

used to copy objects across Amazon S3 buckets in the same AWS Region; newer than CRR

18
Q

Access S3 Bucket from EC2 with IAM Role

A

Very common use case

2 ways to access S3 bucket with EC2 instance, or 2 ways to authenticate

  • copy into EC2, but you can have sensitive info now in your EC2 instance
  • create an IAM role with policy (which says this role is allowed to access that bucket) - this is utilizing temporary credentials. and prevents sensitive info from being leaked
19
Q

Amazon Elastic Block Store (EBS)

A

Provides persistent block storage volumes for use with Amazon EC2 instances in the AWS Cloud
Each Amazon EBS volume is automatically replicated within its Availability Zone to protect you from component failure, offering high availability and durability
EBS volume data persists independently of the life of the instance
EBS volumes do not need to be attached to an instance

EBS and Instances:

  • You can attach multiple EBS volumes to an instance
  • you cannot attach an EBS volume to multiple instances (use Elastic File Storage instead)
  • EBS volumes must be in the same AZ as the instances they are attached to

termination protection

  • is turned off by default and must be manually enabled (keeps the volume/data when the instance is terminated)
  • root EBS volumes are deleted on termination by default
  • extra non-boot volumes are not deleted on termination by default
  • the behaviour can be changed by altering the “DeleteOnTermination” attribute
20
Q

EBS Volume Types:

Solid State Drives (SSD)

A

EBS Volume Type
newer generation, higher performance; default volume type

EBS Provisioned IOPS SSD (IOI): highest performance SSD volume designed for latency-sensitive transactional workloads. Used for I/O-Intensive NoSQL and relational databases; most expensive

EBS General Purpose SSD (gp2): General Purpose SSD volume that balances price performance for a wide variety of transactional workloads. Used for boot volumes, low-latency interactive apps, dev & test

21
Q

EBS Volume Types:

Hard Disk Drives (HDD)

A

EBS Volume Type
older generation, cheaper

Throughput Optimized HDD (st1): Lowest cost HDD volume designed for frequently accessed, throughput intensive workloads. Used for big data, data warehouses, log processing

Cold HDD (sc1): Lowest cost HDD volume designed for less frequently accessed workloads. Used for colder data requiring fewer scans per day

22
Q

Instance Store Volumes

A

High performance local disks that are physically attached to the host computer on which an EC2 instance runs
instance stores are ephemeral which means the data is lost when powered off (non-persistent)
Ideal for temporary storage of information that changes frequently (buffers, caches, scratch data)
instance store volume root devices are created from AMI templates stored on S3
instance store volumes cannot be detached / reattached

23
Q

Amazon Elastic File System (EFS)

A

Newer file storage, not a block or object storage storage
a network-based file system that you connect to; you connect and then mount it to a file system.
Fully managed services that makes it easy to set up and scale file storage in the Amazon Cloud
provides a file system interface and uses the NFSv4.1 protocol

Use Cases: big data and analytics, media processing workflows, content management, web serving, home directories, etc.

Locations: data is stored across multiple AZ’s w/in a region

read after write consistency
pay for what you use (no pre-provisioning required)
can scale up to petabytes
EFS is elastic and grows and shrinks as you add and remove data
can concurrently connect 1 to 1000’s of EC2 instances, from multiple AZ’s
by default you can create up to 10 file systems per account

on premises access can be enabled via Direct Connect or AWS VPN

24
Q

AWS Storage Gateway

A

hybrid cloud storage service that gives you on-premises access to virtually unlimited cloud storage
seamlessly connects on-premises applications to cloud storage, caching data locally for low-latency access

Gateway Types (3):

  1. tape gateway - virtual tape backup system
  2. file gateway - accessed using SMB, NFS, or S3 API
  3. volume gateway - block device accessed using iSCSI
25
Q

Amazon S3 Additional Features:

Requester Pays

A

Requester Pays - the requester rather than the bucket owner pays for requests and data transfer

26
Q

Amazon S3 Additional Features:

Tags

A

Tags - assign tags to objects to use in costing, billing, security, etc.

27
Q

Amazon S3 Additional Features:

Events

A

Events - trigger notifications to SNS, SQS, or Lambda when certain events happen in your bucket

28
Q

Amazon S3 Additional Features:

Static Web Hosting

A

Static Web Hosting - simple and massively scalable static website hosting

29
Q

Amazon S3 Additional Features:

BitTorrent

A

BitTorrent - use the BitTorrent protocol to retrieve any publicly available object by automatically generating a .torrent file

30
Q

Pricing

A
storage
requests
storage management pricing
data transfer pricing
transfer acceleration
31
Q

EBS Snapshots

A

Take a backup of your volume
Snapshots capture a point-in-time state of an instance
stored on S3

does not provide granular backup (not a replacement for backup software)
if you make periodic snapshots of a volume, the snapshots are incremental, which means that only the blocks on the device that have changed after your last snapshot are saved in the new snapshot
Even though snapshots are saved incrementally, the snapshot deletion process is designed so that you need to retain only the most recent snapshot in order to restore the volume

snapshots can only be accessed through the EC2 APIs
EBS volumes are AZ specific but snapshots are regional specific
with snapshots, you could transfer your data to a different AZ

32
Q

Instance Store Volume vs EBS Snapshot

A

Instance Store Volume - physically attached to the host

EBS - attached over a network