S3 101 Flashcards

1
Q

What does S3 stand for?

A

Simple Storage Service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is S3 used for?

A
  • S3 provides developers + IT teams w/ secure, durable, highly-scalable object storage.
  • retrieve and store any amount of data from anywhere on the web
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What type of storage does S3 use?

A

S3 uses Object-based storage – i.e. allows you to upload files

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What size limitations are there for individual S3 objects? What about aggregate limitations?

A
  • S3 files can be from 0 Bytes to 5 TB.
  • There is no aggregate limitation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are S3 buckets? What are they used for?

A

S3 buckets store files. (Think of them like a file folder)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What type of namespace does S3 use?

A

S3 uses a universal namespace. That is, names must be globally unique.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When you successfully upload a file to S3, what will you receive back?

A

an HTTP 200 code

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the components of an S3 object? What do each of these components represent?

A

An S3 object consists of the following:

  • Key (The name of the object)
  • Value (the data, made up of a sequence of bytes)
  • Version ID (Important for versioning/version control)
  • Metadata (data about data you are storing)
  • Subresources (Access Control Lists, Torrent)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How does S3 keep data consistent?

A
  • Read after Write consistency for PUTS of new Objects
  • Eventual Consistency for overwrite PUTS and DELETES (takes some time to propagate)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

For what % availability was the S3 platform built?

A

99.99%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What % availability does Amazon guarantee for S3 Standard?

A

99.9%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What % durability does Amazon guarantee for S3 Standard information?

A

99.999999999% durability (11 9’s)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the key features of S3?

A

​(V MELTS)

  • Versioning
  • MFA Delete
  • Encryption
  • Lifecycle Management
  • Tiered Storage
  • Secure Data using Access Control Lists and Bucket Policies
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the key features of S3 Standard?

A
  • 99.99% Avail
  • 11 9’s Durability
  • Stored redundantly across multiple devices in multiple facilities,
  • designed to sustain the loss of 2 facilities concurrently
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the “IA” stand for in S3-IA?

A

Infrequently Accessed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What type of data is best stored in S3-IA?

A

S3-IA is best for data that is not accessed frequently, but requires rapid access when needed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the pricing structure of S3-IA? Specifically, how does it differ from that of S3 Standard?

A
  • S3-IA has a lower base storage fee than S3 Standard.
  • However, S3-IA charges a retrieval fee.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the key differences between S3-IA and S3 One Zone - IA?

A

Compared to S3-IA, S3 One Zone- IA has lower cost but less durability.

  • S3 One Zone-IA is a lower-cost option for IA data
  • S3 One Zone-IA does not give the multiple Availability Zone resilience of S3 standard and S3 IA.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is S3 - Intelligent Tiering?

A

S3 Intelligent tiering uses ML and is designed to optimize costs by automatically moving data to the most cost-effective access tier, without performance impact or operational overhead. (Basically, it’s the autopilot mode for S3 tiering)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is S3 Glacier primarily used for?

A

S3 Glacier is mostly used for data archival at low-cost

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How long does it take to retrieve something from S3 Glacier?

A

Retrieval times from S3 Glacier are configurable and range from minutes to hours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is S3 Glacier Deep Archive?

A

S3 Glacier Deep Archive is S3’s lowest-cost storage class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How long does it take to retrieve something from S3 Glacier Deep Archive?

A

S3 Glacier Deep Archive is for cases where a retrieval time of 12 hours is acceptable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are the areas on which you are charged for using S3?

A
  • Storage (amount you are storing)
  • Requests
  • Storage Management Pricing (Tier)
  • Data Transfer
  • Transfer Acceleration
  • Cross - Region Replication
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is Transfer Acceleration?

A
  • Used for fast, easy, secure transfers over long distances between end user and an S3 bucket
  • Uses CloudFront’s globally distributed edge locations: as data arrives at an edge location, data is routed to S3 over an optimized network path
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is the format of the DNS name created for an S3 bucket in a specific region?

A

“http://s3.aws-region.amazonaws.com/bucketName

OR

“http://bucketname.s3.aws-region.amazonaws.com”

(<a>https://docs.aws.amazon.com/AmazonS3/latest/dev/UsingBucket.html#access-bucket-intro</a>)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What would you use to install an operating system on S3?

A

S3 is NOT suitable to install an operating system on.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

How can I help, at a bucket-configuration level, to protect against someone going in and deleting data from S3?

A

Turn on MFA Delete

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

How does the pricing model work for S3 Intelligent Tiering

A

Very similar to S3 Standard EXCEPT

  • you have access to IA which is less expensive
  • There is a monitoring / automation cost per thousand objects per month
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What are the default access control permissions for newly created buckets?

A

By default, all newly created buckets are PRIVATE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

How can I set up access control to buckets?

A
  • Bucket Policies
  • Access Control Lists (object-specific)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

How can I set up my S3 bucket to log all requests made to it?

A

S3 buckets can be configured to create access logs, which log all requests made to the S3 bucket.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What are AWS Organizations?

A

AWS Organizations is an account management service that enables you to consolidate multiple AWS accounts into an organization that you create and centrally manage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is Consolidated Billing? How would I set it up? What are three major advantages to it?

A

Consolidated Billing is an advantage of AWS Organizations. The three major advantages are:

  • One Bill per AWS account
  • Very easy to track changes & allocate costs
  • Volume Pricing Discount – the more you use, the lower your rate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What are two forms of best practices for a root account with AWS organizations?

A
  • Always enable multi-factor authentication on a root account
  • Always use a strong and complex password on a root account
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

How can I enable/disable AWS services for either the organizational level or on individual accounts?

A

Enable/Disable AWS services using Service Control Policies (SCPs) either on Organizational Units or on individual accounts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is the difference between Bucket Policies and Bucket ACLs as it relates to sharing S3 buckets across accounts?

A
  • Bucket Policies apply across the entire bucket
  • Bucket ACLs apply to individual objects
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

Which method of sharing S3 buckets across accounts is the only one that provides both Programatic and Console access?

A

Cross-account IAM Roles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What are the 3 ways to share S3 buckets across accounts? What level of access does each provide?

A
  • ​Bucket Policies & IAM – (applies across entire bucket) Programatic Access Only

  • Bucket ACLs & IAM (applies to individual objects) Programatic Access Only
  • Cross-account IAM Roles.Programmatic AND Console Access
40
Q

Does Cross-Region Replication require bucket versioning?

A

Yes. Cross-region replication requires bucket versioning on both the source and destination buckets.

41
Q

When performing cross-region replication, what permissions – at the time of creation – are different between the source bucket and the destination bucket?

A

by default, there are NO differences between the source and replicated buckets

42
Q

When performing cross-region replication, what files – at the time of creation – are different between the source bucket and the destination bucket?

A

When using cross-region replication, files in an existing bucket are NOT replicated automatically.

43
Q

When performing cross-region replication, what discrepancies will there be between the source and replication buckets?

A
  • All file (versions) made before CRR was turned on are not automatically copied at creation
  • Delete markers, deleted versions, and deletes of delete markers are NOT replicated
44
Q

At a high level, how does S3 Transfer Acceleration work?

A

Instead of uploading directly to a bucket, the user utilizes a distinct (given) URL to upload to an edge location, which then transfers through Amazon Backbone and directly uploads to an S3 bucket

45
Q

What is AWS DataSync used for?

A

AWS DataSync is used primarily for moving/copying large amounts of data from on-premises to AWS

46
Q

With what types of file systems is AWS DataSync compatible?

A

DataSync is used with NFS- and SMB-compatible file systems

47
Q

How can you start DataSync replication?

A

Install the DataSync agent to start the replication​

48
Q

How often is data replication performed by AWS DataSync?

A

Replication in DataSync can be done hourly, daily, or weekly.

49
Q

How can AWS DataSync be used to replicate EFS to EFS?

A

Install the DataSync Agent on an EC2 instance connected to EFS

50
Q

What does CDN stand for?

A

Content Delivery Network

51
Q

What is a Content Delivery Network?

A

A Content Delivery Network is a system of distributed servers that deliver web content to a user based on the geographic locations of the user, the origin of the webpage, and a content delivery space.

52
Q

What is an Edge Location?

A

A location where content is cached. (This is separate from an AWS Region/AZ)

53
Q

In the context of CloudFront and Content Delivery Networks, what is an origin? What are some examples of origins?

A

The origin of all the files that the CDN will distribute.

This can be an S3 bucket, an EC2 instance, an Elastic Load Balancer, or Route53

54
Q

In the context of CloudFront, what is a Distribution?

A

the name given to the CDN, which consists of a collection of edge locations

55
Q

Are edge locations read-only?

A

No. You can write to an edge location too!

56
Q

What is Amazon CloudFront?

A

CloudFront can be used to deliver your entire website, including dynamic, static, streaming, and interactive content, using a global network of edge locations.

57
Q

Why is Amazon CloudFront good for performance?

A

Geographic Cacheing. Requests for your content are automatically routed to the nearest edge location, so content is delivered with best performance possible.

58
Q

What does RTMP stand for?

A

Real-Time Messaging Protocol

59
Q

What are the 2 types of distributions used for CloudFront?

A
  • Web Distribution - for websites
  • RTMP - for media streaming
60
Q

Can you clear cached objects in an edge location?

A

Yes, but you will be charged. (Invalidating the Cache)

61
Q

What is Time To Live?

A

How long objects are cached in an edge location. This is a configurable amount.

62
Q

What S3 functionalities would you want to use for restricting content access?

A

CloudFront Signed URLs and Cookies and S3 Signed URLs

63
Q

What is the key difference between a CloudFront Signed URL and a CloudFront Signed Cookie?

A
  • A signed URL is for individual files (1 file = 1 URL)
  • A signed cookie is for multiple files (1 cookie = multiple URLs)
64
Q

What can be included in the policy attached to a signed URL or signed cookie?

A
  • URL expiration (how long it is validd)
  • IP ranges
  • Trusted Signers (which AWS accounts can create signed URLs)
65
Q

What does OAI stand for?

A

Origin Access Identity

66
Q

Describe the process by which you get a CloudFront Signed URL

A
  1. Client Authenticates and Authorizes to log in to the application
  2. Application Uses CloudFront SDK to generate signed URL
  3. Application Returns Signed URL to client
  4. Client logs into Cloudfront using signed URL
67
Q

Can you use S3 Signed Cookies if your origin is in EC2?

A

No. If your origin is EC2, use CloudFront

68
Q

What is Amazon Snowball used for?

A

BIG data Transfers into and out of AWS, including importing to and exporting from S3

69
Q

What is the idea behind Amazon Storage Gateway?

A

Connecting an on-premises software app with cloud-based storage for smart storage (mixing cloud storage and on-premises storage).

70
Q

What does NFS stand for?

A

Network File System

71
Q

How does Amazon File Gateway work?

A
  • Form of Amazon Storage Gateway
  • Files are stored as S3 buckets
  • Files are accessed through an NFS mount point
  • Once objects are transferred to S3, they get managed as native S3 objects, and bucket policies apply directly to them.
72
Q

What are Amazon Stored Volumes? How do they work?

A
  • Store primary data locally, while asynchronously backing it up to AWS
  • On-premises applications get low latency access to their entire datasets. AND you get durable, off-site backups
  • Data is backed up to S3 in the form of EBS snapshots
73
Q

What are Amazon Cached Volumes? How do they work?

A
  • Lets you use S3 as your primary data storage while retaining frequently accessed data in your storage gateway.
  • Minimize need to scale on-premises storage
  • Data is backed up to S3 in the form of EBS snapshots
74
Q

What are the two types of Amazon Volume Gateways? In what key ways do they differ?

A
  • Storage Volumes let you keep all of your data on premises (thus on-premise data is the primary storage)
  • Cached Volumes let you keep your frequently accessed data on-premises (thus S3 is your primary data storage)
75
Q

How can you bring a system that runs on tapes into Amazon S3?

A

Use amazon’s Virtual Tape Library (VTL)

76
Q

What are the three types of Amazon Storage Gateways?

A
  • File Gateway
  • Volume Gateway
  • Gateway Virtual Tape Library
77
Q

What is Amazon Athena? What is it commonly used for?

A
  • Athena is an interactive query service that allows you to query data located in S3 using SQL
  • Serverless
  • Commonly used to analyse log data stored in S3
78
Q

What does PII stand for?

A

Personally Identifiable Information

79
Q

What is Amazon Macie?

A
  • Macie is a security service which uses ML and NLP to discover, classify, and protect sensitive data used in S3
  • Can be used to analyze CloudTrail logs for suspicious API activity
  • Includes Dashboards, Alerts, Monitoring
  • Great for PCI-DSS complicance and preventing Identity Theft
80
Q

What does KMS stand for?

A

Key Management Service

81
Q

What is the availaility of S3-OneZone-IA?

A

99.50%

82
Q

How many S3 buckets can I have per account by default?

A

100

83
Q

What is the general use case for S3 Transfer Acceleration?

A

Accelerating uploads to S3

84
Q

Where can S3 access logs be stored?

A

S3 Access logs sent to another bucket or even another bucket in another account.

85
Q

What does TTL stand for?

A

Time To Live

86
Q

When using Storage Volumes, how is data backed up to S3?

A

asynchronously, as EBS snapshots

87
Q

How can I restore a file if I went to “Actions -> Delete” on it in S3?

A

Delete the delete marker

88
Q

By default, are items automatically encrypted when they are stored in S3?

A

No, Default encryption is NOT enabled by default

89
Q

By default, is Transfer Acceleration enabled for a newly created S3 bucket?

A

No, by default, transfer acceleration is suspended in newly-created S3 buckets

90
Q

When creating a new S3 bucket, what bucket policies does it have by default?

A

None.

By default, bucket policy does not exist for newly created S3 buckets

91
Q

By default, is versioning enabled for newly created S3 buckets?

A

No

92
Q

What are the S3 bucket policies?

A
  • Versioning
  • Server Access Logging
  • Static Website Hosting
  • Object-Level Logging
  • Tags
  • Transfer Acceleration
  • Events
  • Requester Pays

(<a>https://docs.aws.amazon.com/AmazonS3/latest/user-guide/view-bucket-properties.html</a>)

93
Q

When uploading objects, what prefix must all user-defined metadata have?

A

**x-amz-meta-**

94
Q

In the console, after enabling logging on a source bucket, what permission do you need to give the destination bucket to ensure that the logs can be written there?

A

You don’t have to do anything.

When you enable logging on a bucket, the console both enables logging on the source bucket and adds a grant in the target bucket’s access control list (ACL) granting write permission to the Log Delivery group.

(<a>https://docs.aws.amazon.com/AmazonS3/latest/dev/enable-logging-console.html</a>)

95
Q
A