Module 5 - Storage & Transfer Flashcards

Question

What are the costs for S3? What are the factors that change pricing?

Answer 1

STARRL • Storage pricing • Request and data retrieval pricing (only transfer OUT to other regions or the internet; only PUT, COPY, POST, LIST, GET requests) • Data transfer and Amazon S3 Transfer Acceleration pricing • Data management and analytics pricing • S3 Replication pricing • Processing with S3 Object Lambda

Answer 2

Use when: • Need to write once, and read many times. • Have a large number of users. • Have growing data sets. • Have spiky access to data (in this case, use S3 Standard and S3 Intelligent-Tiering.)

Answer 3

You could use EBS, but only up to 16 attachments. You could use S3 but because it is object-store you don't have the high-performance and read/write capacity of file storage systems so it's not ideal. If you need high throughput changes to files of different sizes, EFS is best.

Answer 4

Elastic File System. It's a managed Network File System, meaning that lots of instances in different AZs can connect to it. No need to provision for capacity, it scales automatically. Pay per use. Highly durable, scalable, and expensive. When a client makes a request, EFS routes to the "mount target" in the AZ closest to the client. Only for Linux-based AMIs (POSIX). Great for CMS, WordPress, content sharing, web serving

Answer 5

FSx for Windows and FSx for Lustre (Linux) are managed services that handle 3rd party file systems for you. Integrates with S3. Links long-term storage with high-performance file systems.

Answer 6

For all: Transfer data and ship back to AWS, stores data in your bucket. (For uploading OR downloading) If it takes longer than a week over the network, use snow! 100TB over 1gbps = 12 days * Snow Cone - small, portable data storage device 8Tb * SnowBall Edge - 80TB device (storage or compute optimized). ❗️Has computing options, can pre-process data. * Snowmobile - shipping container on a semi. 100 PB.

Answer 7

Deploy a software agent on-prem through a virtual instance. Transfer data over WAN to AWS using TLS. It's a service so there is nothing to maintain.

Answer 8

A set of hybrid cloud storage services that provide on-premises access to virtually unlimited cloud storage. An on-prem File Gateway connects to the Storage Gateway service in the cloud. Looks like the File Gateway caches data for lower latency. Or use an Appliance Gateway on-prem. Supports: files, volumes, tapes. NFS or SMB for files, iSCSI for volumes, iSCSI VTL for tapes.

Answer 9

Storage Gateway service.

Answer 10

Amazon EFS.

Answer 11

iSCI SMB NFS

Answer 12

0 bytes --> 5TB

Answer 13

It uses a universal namespace. They must have unique names, like a domain name. ⛔️ No Caps, no underscore, no IPs 3-63 chars must start with lowercase or number

Answer 14

You can optionally turn on "per request" logging; files are saved in a different bucket.

Answer 15

ACL: Legacy feature but still in use, simple way of granting access. Bucket policy: defines more complex rule access. JSON.

Answer 16

In transit: SSL/TLS In transit: client-side encryption. You encrypt before uploading. You can use a library like S3 Encryption Client. At rest: SSE. There are 3 options here.

Answer 17

Cross-Region Replication: You enable this and choose another region. Any object uploaded will automatically be replicated. ❗️You MUST have versioning turned on for both source and destination buckets. SRR = Same Region Replication. Same thing, just in the same region. Both can also replicate to another AWS account. Asynchronous. When you enable, only NEW objects will be replicated. (To do all, use S3 BATCH replication) You can't "chain" replication from bucket #1 to #2 then #2 to #3.

Answer 18

You can't on an object. Once enabled it cannot be disabled. You can only suspend versioning on the bucket.

Answer 19

Generate a Presigned URL. It grants temporary access to an object (up or download) that expires in a number of seconds. (default 1 hour = 3600 secs)

Answer 20

Enable MFA delete, where a user has to provide an MFA code before deleting or changing the versioning state of a bucket. ❗️Versioning must be turned on. Must use CLI to enable it. Only the bucket owner logged in as root can delete. Header in request: x-amz-mfa

Answer 21

aws s3 cp s3://bucketName/folder/file.fileType destination

Answer 22

Performance: • EFS Scale Mode (auto scales to Petabyte scale) • Performance Mode: Max I/O or general • Throughput Mode: Bursting or provisioned Storage: • Standard (frequently accessed) • EFS-IA (infrequently accessed) cost per retrieval but cheaper to store Multi-AZ or 1-Zone

Answer 23

``` When the IAM user-based policy allows it OR the resource-based policy (bucket, ACL) allows it AND there is no explicit deny. ```

Answer 24

S3 can host static websites; index.html. You must make access public.

Answer 25

Cross-Origin Resource Sharing. Trying to get resources from a different origin. Web browser-based security that only allows you to get resources from a different origin if the second origin allows it. Second origin sends headers telling the first origin what they are allowed to do.

Answer 26

If website A needs to get a resource from website B, then B needs to enable CORS in the response headers, otherwise, the request will be blocked by the browser.

Answer 27

All operations are strongly consistent. All changes are immediately available.

Answer 28

NOT in a VPC, it lives in the public space.

Answer 29

Browser - to the public endpoint Programmatically via REST API EC2 can connect from within a VPC through the Internet Gateway and public internet. ❗️EC2 can connect from within a VPC through a PRIVATE connection with the S3 GATEWAY ENDPOINT.

Answer 30

Standard - N/A Intelligent-Tiering, Standard IA, 1Zone IA - 30 days Glacier - 90 days Deep Archive - 180 days

Answer 31

When using the API or CLI, enforces MFA when accessing AWS resources (not just S3). In a bucket policy, the condition will look like this: "Condition":{"Null": {"was:MultiFactorAuthAge": true}} This denies any API operation that does not use MFA.

Answer 32

New objects: If unencrypted then it will encrypt. If encrypted, then nothing happens. (It won't re-encrypt.) Existing objects: Nothing. Only NEW objects will be encrypted.

Answer 33

Sends notifications when events happen in buckets to: SNS, SQS, Lambda & EventBridge.

Answer 34

A way to use simple SQL to access objects (or objects within objects like a zip file) on S3. You can filter out the data you don't need on the server, which costs less in transfer and client-side CPU. ❗️Query based on the bucket's name and object's KEY Need Lambda?

Answer 35

Configure cross-account access using IAM roles.

Answer 36

Bucket policies. Default encryption works like your backup in case you forgot to encrypt.

Answer 37

Logs for ALL requests to S3, then you can use these for analysis. The logs are saved in a different bucket.

Answer 38

S3 Analytics - Storage Class Analysis (not for 1Zone or Glacier). • Daily report

Answer 39

Byte-Range Fetch. Divides up the file and fetches specific ranges in parallel. Also good if you only need one part of the file.

Answer 40

A setting where the requester who downloads from S3 pays the networking cost for the download (not storage). The requester must be authenticated in AWS.

Answer 41

You can't do it directly. | Snowball to S3, then lifecycle policy to Glacier.

Answer 42

Lustre = "Linux cluster" ❗️ML, HPC (video, financial modeling, etc) Seamless integration with S3 Can be used on-prem with VPN or DirectConnect

Answer 43

``` Scratch file system • temp storage • not replicated • super speedy • FOR - short-term processing, save $ ``` Persistent file system • long-term storage • replicated in same AZ (failures replaced in minutes) • FOR - long-term processing, sensitive data

Answer 44

File - NFS (Network File System), SMB • S3, IA • recently used data is cached in the gateway • ❗️integrated with Active Directory for user authentication Volume - iSCSI • cached volume gateway • stored volume gateway - all data is on-prem with scheduled backups to S3. • S3 to EBS snapshots Tape iSCSI VTL • Virtual Tape Library (VTL) backed by S3/Glacier Connects to the cloud: EBS, S3, Glacier

Answer 45

You can use Storage Gateway Hardware Appliance. Will have all you need to run the gateway for you. Good for daily NFS backups where you don't have virtualization available.

Answer 46

Native access to FSx for Windows file server. Lives on-prem, connects to FSx in the cloud. * ❗️Has a cache for frequently accessed data * windows native (Active Directory, SMB, NTFS, etc.)

Answer 47

AWS Transfer Services You can store user credentials or integrate with 3rd party (LDAP, Cognito, AD) 3 Flavors: FTP (only within VPC) SFTP FTPS Service assumes IAM role to access S3/EFS

Answer 48

Expedited retrievals allow you to quickly access your data when occasional urgent requests for a subset of archives are required. Provisioned capacity ensures that your retrieval capacity for expedited retrievals is available when you need it.

Answer 49

File Gateway presents a file-based interface to Amazon S3, which appears as a network file share. It enables you to store and retrieve Amazon S3 objects through standard file storage protocols. File Gateway allows your existing file-based applications or devices to use secure and durable cloud storage without needing to be modified. With File Gateway, your configured S3 buckets will be available as Network File System (NFS) mount points or Server Message Block (SMB) file shares.

Module 5 - Storage & Transfer Flashcards

(76 cards)