Storage Flashcards

1
Q

EFS Performance Modes

A

Max I/O

General Purpose

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are AWS services that allow you to share file system across multiple EC2 instances?

A

Amazon EFS
Amanzon FSx for Windows
Amazon FSx for Lustre

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How long does it take to access data in

a. AWS Glacier
b. Glacier Deep Archive?

A

A. A few minutes to hours

B. Less than 12 hours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Amazon S3 One Zone-Infrequent Access (S3 One Zone-IA)

Name typical use cases

A

S3 One Zone-IA stores data in a single AZ and costs 20% less than S3 Standard-IA

good choice for storing secondary backup copies of on-premises data or easily re-creatable data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Amazon S3 Intelligent-Tiering (S3 Intelligent-Tiering)

A

use when data access patterns change, without performance impact or operational overhead
S3 Intelligent-Tiering monitors access patterns and then moves objects that have not been accessed in 30 consecutive days to the Infrequent Access tier.

Once you have activated one or both of the Archive Access tiers, S3 Intelligent-Tiering will move objects that haven’t been accessed for 90 consecutive days to the Archive Access tier and then after 180 consecutive days of no access to the Deep Archive Access tier.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

List S3 Access Lifecycles in order of hottest (most accessible) to coldest (rarely accessed) and/or cost.

A
  1. Standard
  2. Standard Intelligent Tiering
  3. Standard Infrequently Accessed (IA)
  4. Standard IA - One Zone
  5. AWS Glacier
  6. AWS Deep Glacier
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

AWS Storage Gateway - Tape Gateway

A

AWS Storage Gateway - Tape Gateway allows moving tape backups to the cloud.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

AWS Storage Gateway - Volume Gateway

A

You can configure the AWS Storage Gateway service as a Volume Gateway to present cloud-based iSCSI block storage volumes to your on-premises applications.

Volume Gateway stores and manages on-premises data in Amazon S3 on your behalf and operates in either cache mode or stored mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

AWS Storage Gateway - Cached Volume Gateway mode

A

primary data is stored in Amazon S3, while retaining your frequently accessed data locally in the cache for low latency access

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

AWS Storage Gateway - File Gateway

A

AWS Storage Gateway’s file interface, or file gateway, offers you a seamless way to connect to the cloud in order to store application data files and backup images as durable objects on Amazon S3 cloud storage. File gateway offers SMB or NFS-based access to data in Amazon S3 with local caching

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

AWS Storage Gateway

A

The service provides three different types of gateways – Tape Gateway, File Gateway, and Volume Gateway – that seamlessly connect on-premises applications to cloud storage, caching data locally for low-latency access.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Supported S3 lifecycle transitions

A

Waterfall approach with the following sequence Std, IA, Intelligence Tiering, One Zone IA, Glacier, and Deep Glacier

The S3 Standard storage class to any other storage class.

Any storage class to the S3 Glacier or S3 Glacier Deep Archive storage classes.

The S3 Standard-IA storage class to the S3 Intelligent-Tiering or S3 One Zone-IA storage classes.

The S3 Intelligent-Tiering storage class to the S3 One Zone-IA storage class.

The S3 Glacier storage class to the S3 Glacier Deep Archive storage class.

RRS can’t be transitioned from any storage class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

S3 Glacier Deep Archive

A
Use for archiving data that rarely needs to be accessed. 
Minimum storage duration period of 180 days and a default retrieval time of 12 hours. If you have deleted, overwritten, or transitioned to a different storage class an object before the 180-day minimum, you are charged for 180 days.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

S3 Glacier

A

Use for archives where portions of the data might need to be retrieved in minutes. Data stored in the S3 Glacier storage class has a minimum storage duration period of 90 days and can be accessed in as little as 1-5 minutes using expedited retrieval. If you have deleted, overwritten, or transitioned to a different storage class an object before the 90-day minimum, you are charged for 90 days.

• Expedited retrievals are typically made available within 1 – 5 minutes. • Standard retrievals typically complete within 3 – 5 hours. • Bulk retrievals typically complete within 5 – 12 hours.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

S3 IA, S3 One Zone IA, Typical Use Cases

A

S3 Standard-IA and S3 One Zone-IA storage classes are designed for long-lived and infrequently accessed data. (IA stands for infrequent access.) S3 Standard-IA and S3 One Zone-IA objects are available for millisecond access (similar to the S3 Standard storage class). Amazon S3 charges a retrieval fee for these objects, so they are most suitable for infrequently accessed data.

S3 Standard-IA — Use for your primary or only copy of data that can’t be re-created.

S3 One Zone-IA — Use if you can re-create the data if the Availability Zone fails, and for object replicas when setting S3 Cross-Region Replication (CRR).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

S3 Reduced Redundancy

What happens if data is lost?

What is the annual expected loss percentage/decimal?

A

The Reduced Redundancy Storage (RRS) storage class is designed for noncritical, reproducible data that can be stored with less redundancy than the S3 Standard storage class.

For durability, RRS objects have an average annual expected loss of 0.01 percent of objects. If an RRS object is lost, when requests are made to that object, Amazon S3 returns a 405 error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

S3 Standard

A

The default storage class. If you don’t specify the storage class when you upload an object, Amazon S3 assigns the S3 Standard storage class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

S3 Intelligent-Tiering storage class

A

designed to optimize costs by automatically moving data to the most cost-effective access tier, without performance impact or operational overhead. It works by storing objects in two access tiers: one tier that is optimized for frequent access and another lower-cost tier that is optimized for infrequent access. The minimum storage duration charge is 30 days

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

AWS Snowmobile

A

Exabyte scale data transfer - tracker trailer
Transfer 100 PB/s

Enhanced security features - GPS tracking, alarm, 24/7 video surveillance, encryption etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

AWS Snowball & AWS Snowball Edge Optimized

A

Pedabyte scale transport appliance which is shipped to you to attach to your local network and transfer files directly to it.

Device will be shipped back to Amazon and data transferred to S3 into the network. Elink shipping label will automatically update and can be track via SNS or console.

Edge Optimized - ideal for transfer scenarios that require additional compute in remote, disconnected or harsh environments

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

AWS S3 Acceleration - what is it and when do you use it

A

Allows fast and easy data transfer into S3 by using CloudFront’s edge locations, where the data is routed to S3 over optimized network paths

Use it when:

  1. Have customers all over the world uploading to central bucket
  2. Transfer gigs or TBs of data across continents
  3. Underutilize the available bandwidth when uploading to S3 over the internet
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

AWS Transfer for SFTP

A

Fully managed highly available SFTP, service that enables applications to transfer files over SFTP directly to S3

You create server, set up user accounts and associate server with one or more S3 buckets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

AWS Data Sync

How is the service billed?

A

Online data transfer service that simplifies, automates, and accelerates copying large amounts of data between on-premises storage systems and AWS Storage services, as well as between AWS Storage services.

DataSync can copy data between Network File System (NFS), Server Message Block (SMB) file servers, Hadoop Distributed File Systems (HDFS), self-managed object storage, AWS Snowcone, Amazon Simple Storage Service (Amazon S3) buckets, Amazon Elastic File System (Amazon EFS) file systems, and Amazon FSx for Windows File Server file systems.

Only pay for the data you copy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

AWS S3 CORS

How is it configured?

A

Allows configuration of your bucket to allow cross origin requests by defining:

  1. Origins that you allow to access bucket
  2. The HTTP methods that will support each origin
  3. Other operations specific info (e.g. allowed heards, max age, etc.)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
S3 versioning
Allows you to recover objects from accidental deletion or overwrite
26
S3 Object Lock
S3 Object Lock for data retention or protection | Use retention period for locking an object for fixed time or a legal hold for a lock until explicitly removed
27
S3 Block Public Access - What does this restrict
Enabled by default 1. Block new public ACLs and uploading public objects 2. Remove public access granted through public ACLs 3. Block new public bucket policies 4. Block public and cross-account access to buckets that have public policies
28
AWS S3 Default Permissions
Only resource owner, an AWS account that created it, by default can access S3 resources - buckets, objects, sub-resources
29
Amazon S3
Object-level storage - change in part of the file requires the whole file to be re-uploaded Object size limit: 5TB Stored redundantly across multiple facilities Supports event notifications that can be sent to you or trigger other processes (e.g. Lamda)
30
What does EBS optimization, optimize?
Network Traffic optimization
31
List Amazon EBS SSD Volumes in order of performance
io2 Block Express io1 and io2 gp3 and gp2
32
What EBS volume supports multi attach
io2 and io1
33
What is the requirement for EBS volumes with respect to the instance(s) they are attached?
Must be in the same AZ
34
Termination Protection (EBS) Is this given by default? What property governs if it is enabled or disabled?
keeps the volume/data when the instance is terminated turned off by default Modify 'DeleteOnTermination'
35
What type of EBS Volumes can't be a boot volume?
Throughput Optimized - HDD st1
36
List types of EBS and type of volumes for each EBS
SSD 1. General Purpose SSD - gp2, gp3 2. Provisioned IOPS SSD - io1, io2, io2 Block Express Throughput Optimized HDD - st1 Cold HDD - sc1 Previous Generation - standard
37
How is EFS billed?
Pay only for resources used
38
SSE-S3
When you use Server-Side Encryption with Amazon S3-Managed Keys (SSE-S3), each object is encrypted with a unique key. As an additional safeguard, it encrypts the key itself with a root key that it regularly rotates. Amazon S3 server-side encryption uses one of the strongest block ciphers available
39
SSE-KMS
Server-Side Encryption with AWS KMS keys (SSE-KMS) is similar to SSE-S3, but with some additional benefits and charges for using this service. There are separate permissions for the use of a KMS key that provides added protection against unauthorized access of your objects in Amazon S3. SSE-KMS also provides you with an audit trail that shows when your KMS key was used and by whom. Additionally, you can create and manage customer managed keys or use AWS managed keys that are unique to you, your service, and your Region
40
SSE-C
Server Side Encryption with Customer Provided keys, you manage keys, S3 manages encryption
41
What are the 4 mutually exclusive options for SSE in S3
SSE-S3 SSE-KMS SSE-C Client side encryption
42
When should you use snowmobile vs snowball?
Use snowball for less than 10PB or distributed in multiple locations - will need multiple devices for more than 80TB and snowmobile for more than 10PB
43
Amazon FSx Windows
for Windows File Server provides fully managed, highly reliable file storage that is accessible over the industry-standard Service Message Block (SMB) protocol. It is built on Windows Server, delivering a wide range of administrative features such as user quotas, end-user file restore, and Microsoft Active Directory (AD) integration. Amazon FSx supports the use of Microsoft’s Distributed File System (DFS) to organize shares into a single folder structure up to hundreds of PB in size.
44
Amazon FSx for Lustre Typical use case
Amazon FSx for Lustre provides a high-performance file system optimized for fast processing of workloads such as machine learning, high-performance computing (HPC), video processing, financial modeling, and electronic design automation (EDA). FSx for Lustre is compatible with the most popular Linux-based AMIs FSx for Lustre file systems can also be linked to Amazon S3 buckets, allowing you to access and process data concurrently from both a high-performance file system and from the S3 API.
45
Amazon EFS
Amazon Elastic File System (Amazon EFS) automatically grows and shrinks as you add and remove files with no need for management or provisioning provides a simple, scalable, fully managed elastic NFS file system for use with AWS Cloud services and on-premises resources
46
What can mount EFS file systems?
EC2 ECS, EKS, Fargate Lambda On Prem Servers
47
Scope of EFS
Regional service
48
Amazon S3 Transfer Acceleration
enables fast, easy, and secure transfers of files over long distances between your client and an S3 bucket using Amazon CloudFront’s globally distributed edge locations
49
EC2 Instance Store Ideal use case
provides temporary block-level storage for your instance located on disks that are physically attached to the host computer ideal for temporary storage of information that changes frequently
50
Instance Store Limitations
specify instance store volumes for an instance only when you launch the EC2 can't detach an instance store volume from one instance and attach it to a different instance data in an instance store persists only during the lifetime of its associated instance
51
What events render data in the instance store lost forever
The underlying disk drive fails The instance stops The instance hibernates The instance terminates
52
Amazon FSx Deployment options OS Support
cost effective high-performance file systems in the cloud fully managed service, it handles hardware provisioning, patching, and backups Single-AZ or Multi-AZ deployment options based on your high availability requirements offer connectivity to Linux, Windows, and macOS users and applications
53
AWS Storage Gateway - Stored Volume Gateway mode
Primary data is stored locally and your entire dataset is available for low latency access on premises while also asynchronously getting backed up to Amazon S3
54
io2
EBS Provisioned IOPS SSD (io2) new generation of the Provisioned IOPS SSD volumes - designed to be better than io1 but at the same cost
55
io2 Block Express
EBS Provisioned IOPS SSD (io2 Block Express) offers the highest performance block storage in the cloud higher throughput, IOPS, and capacity than io2 volumes, along with sub-millisecond latency purpose-built to meet the performance and latency requirements of the most demanding applications
56
io1
EBS Provisioned IOPS SSD (io1) backed by solid-state drives (SSDs) and is a high performance EBS storage option designed for critical, I/O intensive database and application workloads, as well as throughput-intensive database and data warehouse workloads
57
gp3
EBS General Purpose SSD (gp3) ideal for a wide variety of applications that require high performance at low cost, including virtual desktops, medium sized single instance databases, low-latency interactive apps, dev & test, boot volumes
58
gp2
EBS General Purpose SSD (gp2) default EBS volume type for Amazon EC2 instances backed by solid-state drives (SSDs) and are suitable for a broad range of transactional workloads, including dev/test environments, low-latency interactive applications, and boot volumes
59
st1
Throughput Optimized HDD backed by hard disk drives (HDDs) and is ideal for frequently accessed, throughput-intensive workloads with large datasets and large I/O sizes, such as MapReduce, Kafka, log processing, data warehouse, and ETL workloads
60
sc1
Cold HDD (sc1) backed by hard disk drives (HDDs) and provides the lowest cost per GB of all EBS volume types deal for less frequently accessed workloads with large, cold datasets