Module 4: Adding a storage layer with S3 Flashcards

1
Q

As a cloud engineer working with S3 :

A

Consider access pattern and use cases to choose the correct configuration options, while:

=> Optimising Cost
=> Supporting performance
=> Compliance

And as always, security best practices to protect the resources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Type of storage

A

Block storage
Hierarchical storage (file storage)
Object storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Block storage

A

data stored in fixed block. The application ships the block and store them where is the most efficient. Blocks can be stored accross servers and on different OS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

File storage

A

File storage creates a shared file system. The data is stored in a hierarchical structure. Similar to One drive for example

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Object storage

A

Object storage stores files as object based on attributes and meta data.
An object is data, metadata and a key.
he object key is the unique identifier of the object. When you update an object the entire object is updated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Difference object storage vs block storage

A

In object storage, the entire object must be updated when there is a change to the data. While in Block storage, only part of the data can be changed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Amazon Simple Storage Services (S3)

A

Object storage. Stores massive amounts of unstructured data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Type of storage and what is it stored in with S3

A

Object storage, stored in buckets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Max size of a single object?

A

5TB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

indentifier in S3

A

Unique URL for each object (universal namespace)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Component of objects

A

key, version ID, value, metadata and subresources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does immutable means

A

It’s the charateristic of an object. You can’t change part of it you have to change the whole object outside of S3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are buckets for?

A

Container of objects. They organize the Amazon S3 namespace and identify the account in charge of the objects stored in it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Bucket Geography

A

Regional. Objects stored in a bucket never leave the region unless they are explicitely transfered to another region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is a prefix in a bucket ?

A

similar to a path name, when querying for a prefix it will return the files with a similar path name /photos/2022 for example

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

S3 Benefits

A

Durability
Availability
High performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How Durable is S3?

A

S3 standard storage has 11 nines (99.999999999% of durability) meaning that every year there is a 0.000000001 percent chance of losing an object.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Why is S3 so durable?

A

S3 redundantly stores objects on multiple devices accross multiple facilities in the designated region. It detects and repair failures by comparing files stored in different places. It verifies the integrity of the data by using check sums

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How available is S3?

A

S3 provides 4 nines of availability 99.99%. Meaning the ability to access the data quickly when you want it. Out of 10000 request one would not succeed. It is also scalable (unlimited storage) and gives the ability to encrypt data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Why is S3 high performing?

A

thousands of transactions per second. Scales to high requests.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

S3 use cases:

A

1) Host web content: Use high availability and high performance to address fluctuating and potentially high traffic to the data
2) Static site: Simple storage of html files, videos images…
3) financial (or other) analysis: Stor data that other services can use for analysis
4) Disaster recovery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

S3 example of media hosting

A

S3 caching video content through cloud front to make data available more quickly to a user streaming it, whil another user downloads it directly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Do you need to provision storage for s3?

A

No it scales at need

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Static vs dynamic website

A

on static website the content is statiy and might include client side script. On dynamic website it relies on server side scripts such as PHP, JSP or ASP. S3 does not support server-side scripting. Other AWS services do.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Static website with S3

A

You can host everything on S3 for a static website no need to have a server, nor a virtual machine

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

S3 for analysis

A

Load the raw data in a bucket. Use ETL tool to transform it. (provision an EC2 server for that and use Splot fleet or EMR cluster). Return the transformed data to a new bucket. terminate the instance used for ETL. Perform your analysis on the objects stred into the second bucket. (Athena or Quicksight given as example for analysis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

S3 for disaster recovery

A

Store everything in one s3 bucket. Replicate in another one in another region. Additionnally you can move long term data to S3 Glacier.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is cross region replication

A

Duplicate data in another Bucket, in another region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What permission do I need to store something in S3

A

Write permissions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Objects encrypted by default in S3 (true/False)

A

True and false. Encrypted with server side encryption at upload and decrypted at download but must be enabled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

4 ways to upload on S3

A

AWS management console
AWS Command Line Interface (AWS CLI)
AWS Softwae Devlopment Kit (SDK)
Amazon S3 Rest API

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Uploading an object through the console

A

Use a wizard (UX) based approach to move data in or out of S including drag and drop option. the limit for the management console is 160 GB.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

File size limitation to upload through the management console?

A

160GB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

For size larger than 160GB?

A

Use CLI, SDK, or Rest API

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

CLI and S3?

A

Use command line interface to prompt for an upload or download through a script

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

SDK and S3

A

Programmatically code the access to S3 in your applications

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

API and S3

A

Use put request to upload and get request to download. API access can be embeded into application codes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

You want to upload a big file to S3 or a file for which you know there is a chance of failure in the upload. What can you use?

A

Multipart upload

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What is multipart upload?

A

The object is separated into mulitple part, reassembled and then stored into a bucket

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Advantages of multipart upload?

A

Improved throughput: Uploaded in parallel means quicker storage
Recover quickly from network issues
Pause and resume upload.
Begin an upload as the object is still being built

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

S3 transfer acceleration

A

Bucket level feature that optimizes transfer speeds . Uses Cloudfront and edge location to optimize the network path

42
Q

Why use trnasfer accelerations?

A

Your customers upload to a centralized bucket?
You transfer gigabytes or terabytes regularly accross continents?
You can’t use all your bandwith when uploading?

43
Q

About acceleration: The further from the S3 bucket

A

The better the acceleration

44
Q

AWS transfer Family

A

Fully managed trasnfer service

45
Q

For what services is AWS transfer Family available?

A

Amazon S3 Storage
Amazon Elastic File System (EFS)
Network File System (NFS)

46
Q

Protocoles supported by the Transfer Family

A

Secure Shell (SSH)
File Transfer Protocol
Secure File Transfer Protocol
Applicability Statement 2 (AS2)

47
Q

Transfer Family Benefits

A

Managed service that scales
you don’t need to modify your app or run file transfer protocol infrastructure.
everythinng is managed and included into the AWS family
Only pay for what you use

48
Q

Use case of transfer family for S3

A

Data lakes for upload from third parties
Subscription based data distribution with customers
Internal transfers within org

49
Q

Use case of transfer family for Elastic File System (EFS)

A

Data distribution
Supply Chain
Content Management
Web Serving application

50
Q

Type of S3 storage

A

General Purpose
Intelligent Tiering
Infrequent access
Archive

51
Q

S3 General purpose

A

Suitable for frequent access due to high availability and low latency. Durability accross at least 3 AZ

52
Q

S3 Intelligent tiering

A

Automatically adjust the storage type of the objects, depending on access frequency to move it to the most cost effective tier.

53
Q

S3 Infrequent access

A

Standard infrequent access:
Similar to Stabdard but run on another cost model. There is a standard 30 days storage fee and the cost is higher to retreive the data.
One Zone Infrequent access:
low cost opiton, availability and resiliency not so needed. Good choice for secondary back up that you can recreate, or back up from another region.

54
Q

S3 Archive

A

Glacier instant retrieval: Rarely access data needing still to be accessed rapidly

Glacier Flexible retrieval: Needs the possiblity to access large dataset 1-2 times a year. some latency in accessing the data

Glacier deep archive: Long term retention for rarely accessed data. Good for customer needing to keep older data for compliance

S3 on outpost: S3 infrstructure for data that needs to be stored close to the customer. Kind of renting the hardware and having it on permise. So not quite cloud. (If I get it right)

55
Q

Storage duration charge for Infrequent Access ?

A

30 days

56
Q

Storage duration charge for Glacier ?

A

90 days or 180 days for deep archive

57
Q

Number of AZ with S3

A

> = 3 except S One Zone IA where it’s one.

58
Q

S3 and retrieval charges

A

Retrieval charges per GB retrieved apply except for standard and intelligent tiering

59
Q

What is an S3 lifecycle configuration ?

A

It’s a policy determining the transition of an object from one storage class to another. E.g: No access over the last 30 days => Infrequent access.
No access and object last access more than x month ago => deletion

Lifecycle transition or expiration have associated costs

60
Q

Type of lifecycle operations ?

A

Transition
Or
Expiration

At objetct or Bucket level

61
Q

Advantage of lifecycle on S3

A

The cycling reduces the cost as you pay less for data the further it loses in relevance for you.

62
Q

Lifecycle use case

A
  1. Delete automatically logs after 30 days
  2. Documents are stored in standard for 60 days, in infrequently accessed for 1 year, in Glacier for 7 years, then deleted
63
Q

S3 Versioning use case

A

Protect for accidental overwrite and delets
Enables recovery

64
Q

At what level is versioning enabled

A

At the bucket level

65
Q

How does versioning works

A

Each object has a Version ID and new publication of the object increment the version id by 1. The previous object is not overwritten. When deleted Amazon simply adds a “deleted” marker. But the object remains.

66
Q

Is versioning enabled by default ?

A

No

67
Q

What mechanism allows for object retrieval in Versioning?

A

The version ID

68
Q

Can I recover the object if versioning is Suspended?

A

No

69
Q

Can I recover a deleted object with versioning ?

A

Yes

70
Q

What is the cost of versioning

A

None except for storage cost

71
Q

What issue may you face trying to get an object if the most recent version of it has a delete marker ?

A

It will not succeed and return a 404 not Found error.
If you use a GET request specifying the version then you can access the object

72
Q

How to permanantly delete an object when versioning is active ?

A

You must be the owner of the bucket and specify the version of the object you want to delete.

73
Q

What is the meaning of CORS

A

Cross Origin Resource Sharing

74
Q

What is Cross Origin Resource Sharing?

A

It’s an XML document in which are written:

The origins: Resources enabled to access your document
The Operations (HTTP methods) that will support each origin
Additional operation specific information

75
Q

What is Cross Origin Resource Sharing used for

A

It’s a way for client web application to access storage of another application

76
Q

Example of CORS

A

You have a web font that you use for a website. You want another one to access this resources you create a CORS allowing your second website to access the ressource of the first websote

77
Q

What is strong consistency

A

A mechanism ensuring that object put on the bucket are consistent with what has been effectively transfered. Read-after-write. Allows to not have to make the checks yourself and provision the infrastructure to do it.

It simplifies the migration of on permises workloads.

It is by default

78
Q

Outside of the by default capacity of S3 for strong consistency another Amazon Service allows for consistency control. What’s the name of the service ?

A

S3Guard

79
Q

S3 default security configuration

A

Objects are private and protected by default
Encryption is configured by default
Default encryption: S3 managed keys (SSE-S3)

80
Q

When sharing S3 access

A

Manage and control the access.
Use least priviledge principle

81
Q

Are new S objects encrypted in transit?

A

NO but they are encrypted at rest

82
Q

Default S3 encrpytion

A

S3 Managed Keys (SSE-S3)

83
Q

Can I use another encryption than SSE-S3 ?

A

Yes use AWS KMS (Key management Services) for:
Server side encryption (SSE-KMS)
OR
Dual layer server side encryption (DSSE-KMS)
OR
Customer provided key (SSE-C)

84
Q

Can I protect data in transit ?

A

Yes but yo need client side encryption for that. It happens before being transfered to S3

85
Q

Tools for protecting Buckets and object

A

Block public access option
IAM policies
ACL (Access Control Lists)
S3 Access Point
Preassigned URL (Time limited URL=
AWS Trusted advisor (provides bucket permission Check)

86
Q

ACL vs IAM

A

ACL predates IAM. Prefer IAM or be extra mindful or your ACL setup.

87
Q

Region choice for storage

A

Data privacy laws and compliance
Proximity of users
Service availability in the region
Cost effectinveness

88
Q

What is S3 inventory for?

A

Help manage storage
Use it to audit and report.

You can set up weekly reports and exports in different file formats (CSV, ORC….)

89
Q

Can I query S Inventory through a DBS?

A

Yes with Athena, or redshift for example but also other tools…

90
Q

Default pricing of S3

A

Pay for what you use:

Storage:
Per GB of objects stored per month.
Different pricing for region and storage class

Operation:
PUT; COPY; POST; LIST and lifecylce transition

91
Q

S3 has no charge for transfer

A

1)Out to the internet for up to 100GB a month
2) In from the internet
3) Between S3 Buckets
4) From an S3 bucket to any AWS service with the sae Region
5) Out to Cloud fromt

92
Q

Additional cost for intelligent tiering

A

Monthly monitoring and automation charge for each object

93
Q

S3 cost depends on

A

Object size, storage duration, storage class

94
Q

What are Ingest charges

A

Cost associated to request with PUT, COPY, POST or LIST request. Plus Lifecycle operations

95
Q

Encryption fees in S3.

A

No Fees for standard SSE-S3 or SSE-C Pay for encryption when using AWS KMS.
DSSE-KMS includes further charges for the second encryption layer

96
Q

Free tier in S3

A

Gb of storage. 20000 GET;2000 PUT,COPY, POST or LIST; 100 GB of data Transfer each month

97
Q

Well architected best practices,
Security Pillar for S3

A

Enfore Encryption at Rest
Enfore Access Control

98
Q

Well architected best practices,
Performance Efficiencyr for S3

A

Learn About and understand availavble cloud servicies and features
Factor cost into architectual decisions

99
Q

Well architected best practices,
Cost Optimization for S3

A

Perform cost analysis for different usage over time

100
Q

Reliability

A

Select the appropriate location and multi-lication devlopment if appropriate