S3 101 Flashcards

1
Q

S3

A

Simple Storage Service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is S3 for?

A

Its’ AWS primary storage service. Provides a secure, durable and highly scalable object storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are S3’s file size limits?

A

Files can go from 0 bytes to 5 TB’s

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Does S3 have storage limitations?

A

S3 has unlimited storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is meant by “S3 is a universal namespace”?

A

S3 names must be unique globally

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Where are objects?

A

An object is a file and any metadata that describes it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the format of object name?

A

A web address

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What do you get in return upon a successful load of an object to S3?

A

You receive an HTTP 200 code

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a Bucket?

A

It’s a root level folder you create in S3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the structural requirements when creating a Bucket?

A

You must specify the Region where the bucket to exist. Any objects loaded to the S3 bucket will be physically located in the data center in that Region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a good practice when creating a bucket?

A

Chose a Region that is physically closest to you to reduce transfer latency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does an object consists of?

A
  1. Key (Name of the object)
  2. Value (data)
  3. Version ID (for versioning)
  4. Metadata
  5. Subresources
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does a subresource consist of?

A
  1. Access Control List (ACL)

2. Torrent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Access Control List (ACL)?

A

Permissions of the object at Bucket or object level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the data consistency model in S3?

A
  1. Read after write consistent for PUTS of new objects

2. Eventual consistency for overwrite PUTS and DELETES

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are S3 Guarantees?

A
  1. 99.99% S3 platform availability
  2. 99.9 availability
  3. 99.999999999% durability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are S3 features?

A
  1. Tiered Storage
  2. Lifecycle Management
  3. Versioning
  4. Encryption
  5. MFA Delete
  6. Secure data using ACL’s and Bucket policies.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

S3 Storage Classes

A
  1. Standard
  2. Infrequent access (IA)
  3. One Zone IA
  4. Intelligent Tiering
  5. Glacier
  6. Archive Glacier
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

S3 Standard

A
  1. Most expensive
  2. 99.99 availability 3. 99.999999999 durability
  3. Redundant storage across multiple facilities and devices
  4. Able to sustain the loss of facilities concurrently
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

S3 Infrequent Access

A
  1. For data less frequently accessed but requires rapid access
  2. Lower fees than standard
  3. Charged a retrieval fee
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

S3 One Zone Infrequent Access

A
  1. Lower cost than IA

2. No redundant data storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Which is more resilient S3 Standard or S3 One Zone IA?

A

Standard S3 because data is spread across multiple availability zones.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

S3 Intelligent Tiering

A

Automatically moves data to most cost effective access tier w/o performance impact or operational overhead

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Glacier

A
  1. For data archiving
  2. Can store any amount of data
  3. Retrieval times can go from minutes to hours
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Glacier Deep Archive

A
  1. Lowest cost storage

2. Retrieval times of up to 12 hours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is object durability?

A

Is the percent over a one year time period that an object stored in S3 will not be lost

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is object availability ?

A

Is the percent over a one year period that an object stored in S3 will be accessible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

S3 Charges

A

S3 Charges for:

  1. Storage
  2. Requests
  3. Storage Management
  4. Data Transfers
  5. Transfer acceleration
  6. Cross Region Replication
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is Transfer acceleration?

A
  1. Enables fast, easy and secure transfer of objects over long distances between end users and an S3 bucket
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is Cross Region Replication?

A

Replicates objects in buckets across regions (for high availability and disaster recovery)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

How does Transfer Acceleration works?

A

It takes advantage of CloudFront global distribution and edge locations. As data arrives to edge locations data is routed to S3 over an optimized network path

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

By Default S3 Bucket and files within are…

A

not accessible to anyone unless they are set to public

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is a potential risk with S3 One Zone IA?

A

If zone fails you can lose data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

When should you use S3 One Zone IA?

A

If there is not need to worry about redundancy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Generally, which one is a better option, Standard or Intelligent Tier?

A

Intelligent Tier may be a better option S3 standard because it can S3 standard and infrequent access (IA)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

By default newly created buckets are….

A

Private

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What tools can be leveraged to set up access controls to buckets?

A
  1. Bucket polices - for bucket level controls

2. Access Control List - for individual object level controls

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What are S3 access logs?

A

Logs of all requests made to S3 bucket

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What is S3 In Transit Encryption?

A

Protects data as it travels to and from Amazon S3 achieved by SSL/TLS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

What is S3 At Rest Encryption?

A

Protects data stored on server side (disk) on Amazon data centers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

Type of S3 Encryption

A
  1. In Transit

2. At Rest (On server side)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

How is S3 Server side encryption managed?

A
  1. S3 Managed Keys - Amazon manages encrypt/decrypt keys (SSE-S3)
  2. AWS Kay Management Service - User and Amazon manage keys together (SSE-KMS)
  3. Server Side Encryption with Customer Provided Keys - User gives Amazon own keys that are managed by user (SSE-C)
  4. Client Side Encryption
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Versioning

A
  1. Stores all version of an object (writes & deletes)
  2. Once enable, it cannot be disabled (only suspended)
  3. Integrates with Lifecycle rules
  4. MFA delete capability can be used to provide additional layer of security
  5. Great back up tool
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

Are versions public permission carried over?

A

No. Each version has to be made individually public

45
Q

Can an individual version be permanently deleted?

A

Yes

46
Q

What is an object lifecycle?

A

Is a set of rules that automate the migration of objects storage classes to a different storage class based on specified time intervals

47
Q

What is an S3 object lock?

A

Used to store objects using a write model, read many (WORM) model. It helps prevent objects from being deleted or modified for a fixed amount of time/indefinetely

48
Q

Object Lock use cases

A
  1. If you have an object and you don’t want anyone to delete it or modify it
  2. To meet regulatory requirements that required WORM storage
  3. To add extra layer of protection against object changes and deletion
49
Q

What are the object lock modes?

A
  1. Governance Mode

2. Compliance Mode

50
Q

Governance Mode

A
  1. Users can’t overwrite or delete an object version or alter its lock settings unless they have special permissions.
  2. Can grant some users permissions to alter the retention setting or delete an object if necessary.
51
Q

Compliance Mode

A
  1. Ensure object version can’t be overwritten or deleted for the duration of the retention period including the root user in the AWS account
  2. Retention mode cannot be changed or shortened
52
Q

Retention Periods

A

Protects an object version for a fixed amount of time

53
Q

How do Retention Periods work?

A

S3 stores a timestamp in the object’s version metadata indicating when the retention period expires. When the retention period expires the object’s version can overwritten or deleted unless you place a legal hold on the object

54
Q

What is a legal hold?

A

Similar to Retention period but it has no associated retention period and remains in effect until removed.

55
Q

Who can place or remove a legal hold?

A

Any user who has S3:PutOnObjectsLegalHold permission can place or remove a legal hold

56
Q

Glacier Vault Lock

A

Way of locking your objects inside of glacier similar to S3 object lock

57
Q

How does Vault Lock works?

A

Allows you to deploy and enforce compliance control to individuals with vault lock policy. Once locked the policy cannot be changed.

58
Q

Can object locks be applied to individual objects only?

A

No, Object lock can also be applied across the bucket

59
Q

What is a legal hold?

A

Similar to retention period. It prevents an object version from being overwritten or deleted but it has no retention period associated with it and remains in effect until removed.

60
Q

How can you enforce regulatory and compliance controls in S3 and Glacier?

A

By applying object locks and glacier vault policies, retention periods and legal holds on object versions

61
Q

How can you design applications to maximize performance in S3?

A

By using:

  1. Prefixes
  2. S3 Limits on KMS
  3. Multi Part uploads
  4. Multi Part downloads
62
Q

Where are Prefixes in the physical location of the object?

A

It’s the middle part between the bucket name and object name

63
Q

How do multi part uploads maximize performance in S3?

A

Allows to split into chunks and load files in parallel to S3

64
Q

What are the guidelines and requirements for multi part uploads to S3?

A

Recommended for files over 100 megs and required for files over 5GB’s

65
Q

How does multi part download maximize performance in S3?

A

Parallel download by specified byte ranges.
If there is a failure in the download, it’s only for the specified byte range.
It can be use to download partial amounts of the file

66
Q

True or False. The more prefixes the better performance?

A

True. Performance is improve by spreading prefixes

67
Q

How do you maximize performance using Prefixes?

A

S3 has extremely low read/write latency. You can also achieve a high number of request per second per prefix (about 3,500) so you can get better performance by spreading your reads across different prefixes

68
Q

How does KMS impacts S3 read/write performance?

A

There are Region specific quota limits on the number of S3 requests per second

69
Q

What is S3 Select?

A

It’s a feature that enables application to retrieve only a subset of data by using simple SQL and without having to download, decompress or process entire file.

70
Q

What is Glacier Select?

A

Similarly to S3 select it allows you to run SQL queries directly from the Glacier

71
Q

Glacier Select use case

A

Highly regulated companies write data to Glacier to satisfy compliance needs (e.g., Health Care and Financial Services)

72
Q

AWS Organizations

A

It’s an account management service that enables you to consolidate multiple AWS accounts into an organization that you create and centrally manage

73
Q

Consolidated Bills

A

It takes the aggregate of all of your AWS accounts into one bill. Makes it easier to track , manage and allocate resources and take advantage of volume pricing discounts.

74
Q

How do I enable access to buckets and objects across AWS accounts?

A
  1. Bucket Policies & IAM
  2. Bucket ACL’s & IAM
  3. Cross-Account & IAM Roles
75
Q

Where do you apply Bucket policies & IAM?

A

It’s applied across the entire bucket and by programmatic access only

76
Q

Where do you apply Bucket polices and ACL’s?

A

It’s applied to individual objects in the bucket and by programmatic access only

77
Q

Where do you apply Cross-Account IAM Roles

A

it’s applied at bucket and object levels either by programmatic or console access

78
Q

Transfer Acceleration

A

Utilizes CloudFront edge network to accelerate uploads to S3

79
Q

How does Transfer Acceleration works?

A

Instead of loading directly to S3 bucket, you can use a distinct URL to upload directly to an edge location which will then transfer file to S3

80
Q

S3 AWS Datasync

A

Is a service that allows you to move large amounts of data from on-prem data systems such NFS (Network File Systems) into AWS Data storage services

81
Q

What data transfer safety features are supported by Datasync?

A
  1. Automatically encrypts and accelerates data transfers over WAN
  2. Performs automatic data integrity check in-transit and at-rest
82
Q

CloudFront

A

Is a content delivery network (CDN) system of distributed servers (Network) that delivers web content to a user based o the geo location of the user. origin of the web page and the content delivery server

83
Q

What is an edge location?

A

Location where the web content will be coached

84
Q

What is the origin?

A

Origin of all the files the CDN will distribute (e.g., S3, EC2)

85
Q

What is the distribution?

A

Name of the CDN or collection of edge locations

86
Q

How does CloudFront work?

A

Delivers web contents using global network of edge locations by routing request to the nearest edge location so that contents is delivered with the best possible performance

87
Q

How many types of CloudFront distribution?

A
  1. Web (for websites)

2. RTMP (for media streaming)

88
Q

Are edge locations Read only?

A

No. You can write to them too.

89
Q

What CloudFront features are available to restrict content access?

A
  1. Signed URL’s

2. Signed Cookies

90
Q

When do you Signed URL’s?

A

When you want to secure contents to individual files so only authorize individuals have access to them

91
Q

When do you use Signed Cookies?

A

When you want to secure contents to multiple files so only authorize individuals have access to them

92
Q

How do you create Signed URL’s or Cookies?

A

By attaching a policy that includes:

  1. URL expiration
  2. IP Range
  3. Trusted signers
93
Q

What are Trusted Signers?

A

AWS accounts that can create signed URL’s

94
Q

Snowball

A

It is a peta-byte data transport solution that uses secure data appliances to transfer large amounts of data into and out of AWS

95
Q

Snowball advantages

A
  1. Avoid high network costs
  2. Long transfer times
  3. Sercurity
  4. One fifth the cost of high speed internet
96
Q

Snowball options

A

Comes in 50 TB or 80 TB

97
Q

Snowball Edge

A

100 TB data transfer service to move large amounts of data in and out of AWS

98
Q

Snowball Edge advantages

A
  1. Has compute and storage capabilities

2. Can run Lambda functions

99
Q

Storage Gateway

A

It is a hybrid cloud storage service that give you on-premise access to cloud storage

100
Q

What are Storage Gateway types?

A
  1. File Gateway
  2. Tape Gateway
    3 Volume Gateway
101
Q

What is File Gateway for?

A

For flat files stored directly in S3

102
Q

What is Tape Gateway for?

A

Tape Gateway allows you to replace and store physical tapes with virtual tapes in Amazon S3, Amazon S3 Glacier, and Amazon S3 Glacier Deep Archive,

103
Q

Types of Volume Gateways

A
  1. Stored Volumes

2. Cache Volumes

104
Q

What is Stored Volumes?

A

Stores entire Dataset on site that is asynchronously back up in S3

105
Q

What is cached Volumes?

A

Entird Dataset is stored S3 and the most frequently accessed data is cached on-site

106
Q

Athena

A

It is an interactive query service that enables you to analyze and query data located in S3 using standard SQL

107
Q

Athena benefits

A
  1. Allows to treat S3 as a DB
  2. Serverless
  3. Works directly with data stored in S3
108
Q

Macie

A

Security service which uses machine learning and Natural Language Processing to discover, classify and protect sensitive data stored in S3

109
Q

Macie benefits

A
  1. Work directly with data stored in S3

2. Can analyse Cloudtrail logs