S3 Flashcards

1
Q

What is S3?

A
  • Simple Storage Solution
  • Buckets store objects
  • objects contains key : value pairs
  • No object can live outside of a bucket
  • Flat File System
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the storage capacity of S3?

A

unlimited storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the object size range in S3 (smallest possible to largest possible)?

A

0 bytes to 5TB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the durability of S3?

A

Data is stored across 3 AZs to ensure 11-9’s of durability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Describe the naming structure of a bucket

A
  • Bucket names are global and must be unique across a partition (a grouping of regions).
  • Names must be between 3-63 characters long.
  • Names can only contain upper or lowercase, numbers, dots(.) or hyphens(-).
  • Names must begin and end with a letter or a number.
  • Names cannot be formatted like an IP address.
  • Names can not begin with xn–
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe the data consistency model for S3.

A
  • Read after write consistent for New PUTS (posts)
  • Eventually consistent for overwrite puts.
  • Eventually consistent for DELETES
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What operations can you perform on an object?

A
  • PUT,
  • GET,
  • DELETE (RM),
  • LIST
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Versioning?

A
  • Objects are given a version ID
  • When new objects are uploaded the old objects are kept.
  • You can access any object version.
  • When you delete an object that is versioned then the previous version is restored.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Are buckets versioned by default?

A
  • No. Versioning must be enabled, and can be enabled at anytime.
  • Once turned on it can only be suspended.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

T/F - All new buckets are private by default

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are bucket policies?

A

A resource based policy JSON documents that control access Grants other AWS accounts or IAM users access permissions for the bucket and objects in it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Access Control Lists

A
  • Legacy permissions control. Still used though.
  • Grants access to objects and buckets with simple actions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Cross Region Replication (CRR)?

A
  • Allows files to be replicated across regions for greater durability.
  • Versioning must be enabled.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Cross Region Replication - What gets replicated?

A
  • Any new objects added after CRR is enabled.
  • Object Metadata Tags Encryption (only SSE-KMS & SSE S3)(if source file is encrypted)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Cross Region Replication - What is NOT replicated?

A
  • Objects that existed in source bucket before CRR was enabled.
  • Objects encrypted with SSE-C
  • Source objects that the bucket owner does not have read permissions on.
  • Updates to bucket level sub resources (i.e changes to lifecycle configuration)
  • Objects in the source bucket that are there as a result of replication from another bucket.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do delete operations work on files that are Cross Region Replication?

A
  • For delete WITHOUT version ID, S3 will add a delete marker which CRR DOES replicate.
  • For delete with WITH a version ID (source) the source object is deleted but the destination object in NOT deleted.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is transfer acceleration?

A
  • provides faster and secure uploads from anywhere in the world.
  • Data is uploaded to an Edge location, then that data is transported to your S3 bucket via AWS backbone network.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a presigned URL?

A
  • A URL generated via the AWS CLI and SDK. Provides temporary access to write or download object data.
  • Users given a pre-signed URL inherit the permissions of the person who generated the URL for GET / PUT.
  • Pre-signed Urls are commonly used to access private objects.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Name the (6) S3 storage classes

A
  1. Standard
  2. Intelligent Tiering
  3. Standard Infrequent Access (IA)
  4. One Zone IA
  5. Glacier
  6. Glacier Deep Archive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

S3 Standard

A
  • Fast.
  • 11-9’s of durability 99.99% availability
  • replicated across at least 3 AZs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

S3 Intelligent Tiering

A
  • Uses machine learning to analyze your object usage and determines the appropriate storage class.
  • Data is moved to the most cost effective class w/o any performance impact or added overhead.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Standard IA

A
  • Cheaper then standard (50%).
  • reduced availability.
  • Good if file is accessed only once a month or less.
  • Additional retrieval fee applied.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

One Zone IA

A
  • Objects only exist in 1 AZ -> Data could get destroyed
  • Availability = 99.95%
  • Cheaper then Standard IA (20%)
  • Retrieval fee applied
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Glacier

A
  • Long term cold storage
  • Retrieval can take minutes to hours
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Deep Glacier

A
  • Lowest cost storage
  • Retrieval time = 12 - 48 hours
26
Q

Life Cycle Management in S3

A
  • Automates the process of moving objects to different storage classes or deleting objects all together.
  • Can be used together with versioning.
  • Can be applied to both current and previous versions.
27
Q

How is S3 Encryption in Transit achieved?

A

Traffic between your local host and S3 is achieved via SSL/TLS

28
Q

What are the 2 types of default encryption that can be applied to an S3 bucket?

A
  • SSE-AES - S3 handles the key and used AES-256 algorithm.
  • SSE-KMS - Envelope encryption, AWS KMS and you manage the keys.
29
Q

What are other types of encryption that can be used for S3 but not offered by default?

A
  • SSE-C : Server Side. Customer provided key
  • Client-Side Encryption: you encrypt your own files before uploading to S3 and you manage the keys.
30
Q

Buckets are PUBLIC or PRIVATE by default?

A

private

31
Q

What is the difference between SSE-C and Client Side encryption for s3 buckets?

A
  • SSE-C is server-side encryption that uses data keys that are fully managed by the customer outside of AWS.
  • Client Side Encryption may use a client library like Amazon S3 Encryption client (therefore managing keys within AWS)
32
Q

What is the difference between SSE-S3 & SSE-KMS S3 encryption services?

A
  • SSE-KMS - keys are managed within AWS KMS Service which allows more user control and provides an audit trail.
  • SSE-S3 is fully managed as part of S3 (no user control or audit trail).
33
Q

Which s3 encryption method requires HTTPS?

A

SSE-C

34
Q

What is S3 MFA - delete?

A
  • forces user to generate a code on a device (usually a mobile phone or hardware) before doing important operations on S3
  • Versioning must be enabled
35
Q

Who can enable MFA-delete on an S3 bucket?

A

Root account

36
Q

From which source can S3 MFA-deleted be enabled?

A

CLI only

37
Q

What will you need S3 MFA delete for?

What will you NOT need it for?

A

You will need MFA to

  • permanently delete an object version
  • suspend versioning on the bucket

You will NOT need MFA for

  • enabling versioning
  • listing deleted versions
38
Q

What are S3 Access Logs?

A
  • For audit purposes you may want to log all access to S3 buckets
  • Any request made to S3, from any account, authorized or denied, will be logged into another S3 bucket
  • That data can be analyzed using data analysis tools, or Amazon Athena
39
Q

Where shoud access logs be held in relation to the bucket being monitored? (Same bucket or different bucket)

A

In a different bucket! Otherwise, it will create a logging loop and bucket size will grow exponentially == HUGE BILL

40
Q

Can Cross Region Replication (CRR) and Same Region Replication (SRR) happen across different accounts?

A

Yes, as long as the correct IAM permissions are granted

41
Q

Does Cross Region Replication and Same Region Replication happen sychronously or asychronously?

A

asychronously

42
Q

What are some use cases for Cross Region Replication?

A
  • Compliance
  • Lower latency access
  • Replication across accounts
43
Q

What are some use cases for Same Region Replication?

A
  • Log aggregation
  • Live replication between production and test accounts
44
Q

What is the default length of time that a pre-signed URL is good for?

A
  • Valid for a default of 3600 seconds (1-hour).
  • Can change this by modifying the –expires-in [TIME_BY_SECONDS] argument.
45
Q

What are some use cases to use a pre-signed URL?

A
  • Allow only logged-in users to download a premium video on your s3 bucket.
  • Allow an ever changing list of users to download files by generating URLs dynamically
  • Allow temporay ability to a user to upload a file to your bucket (in a specific location).
46
Q

What are the 3 retrieval options for Amazon Glacier and how long will it take to retrieve and object from each type?

A
  • Expedited (1 to 5 minutes) - extra $
  • Standard (3 -5 hours)
  • Bulk (5-12 hours)
47
Q

What is the minimum storage time for Glacier objects?

A

90 days

48
Q

What are the 2 retrieval options for Glacier Deep Archive and their associated times?

A
  • Standard (12 hours)
  • Bulk (48 hours)
49
Q

What is the minimum storage duration for Glacier Deep Archive?

A

180 days

50
Q

Which S3 storage classes charge a retrieval fee?

A
  • S3 Standard-IA
  • S3 1-Zone IA
  • S3 Glacier
  • S3 Glacier Deep Archive
51
Q

Is there a minimum storage duration for S3 Standard objects?

A

No, but there are for all other tiers

30 Day Min Storage

  • S3 Intelligent Tiering
  • S3 Standard-IA
  • S3 One Zone IA

90 Day Min

  • Glacier

180 Day Min

  • Glacier Deep Archive
52
Q

How many requests per second, per prefix can be achieved for an applications S3 bucket?

A

requests per second / per bucket prefix

  • 3,500 PUT / COPY / POST / DELETE
  • 5,500 GET / HEAD
53
Q

What are some S3 - KMS Limitations?

A
  • KMS as different quotas / second depending on the region, and these quoatas can not be increased.
  • If quotas are exceeded then the S3 request(s) is throttled
54
Q

When is multipart-upload recommended in S3?

When are they required?

A
  • Recommended for files > 100MB
  • Must be used for files > 5GB
55
Q

How can data be filtered server-side S3? Why would you do this?

A
  • Select & Glacier Select
  • Can retrieve less data using SQL by performing server side filtering
  • Can filter by rows and columns
  • Can not aggregate data
  • Do this for less network transfer = less CPU cost client side
56
Q

What are the 3 event targets for S3?

A
  • SNS
  • SQS
  • Lamba Function
57
Q

What is Athena?

A
  • Serverless service to perform analytics against S3 files
  • Uses SQL language to query the files
  • Has JDBC / ODBC driver
  • Charged per query and amount of data scanned
58
Q

What data formats does Athena support?

A
  • CSV
  • JSON
  • ORC
  • Avro
  • Parquet
59
Q

What are some use cases for Athena?

A
  • Business Intelligence
  • Analytics
  • Reporting
  • VPC Flow Logs
  • ELB Logs
  • CloudTrails
60
Q

How can you analyze data on S3? (or ELB Logs/VPC Flow Logs / CloudTrail etc)

A

Athena

61
Q

How could you guarantee that an object isn’t deleted from S3 or Glacier?

A
  • S3 Object ock or
  • Glacier Vault Lock
  • Both polices adopt a WORM models (Write Once Read Many)
  • Blocks an object deletion for a specific amount of time
  • Helpful to lock against future edits
  • Helpful for compliance and data retention