S3 Flashcards

1
Q

S3

A

Simple Storage Service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is S3

A

provides developers and IT Teams with secure, durable, highly-scalable object storage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Object based storage

A

where you can store files, pictures, pdfs etc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Block Based Storage

A

EC2 - where you install operating system, databases or applications

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

File size that can be stored in S3

A

0 - 5 TB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

S3 storage

A

unlimited

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

S3 Files are stored in

A

Buckets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

S3 naming

A

universal namespace - it has be unique globally;

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Sample of an S3 name

A

https://s3-eu-west-1.amazonaws.com/acloudguru

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When you upload a file to S3, this will be returned when the upload is successful

A

HTTP 200

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Data consistency model for S3

A
  1. Read after write consistency for PUTS of new objects

2. Eventual Consistency for overwrite PUTS and DELETES (can take sometime to propagate)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Read after Write consistency

A

Uploading a file and able to read/access it right away (millisecond after)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Eventual consistency

A

if we’re updating/deleting a file, we might get 2 files -old and new but eventually you’ll be able to get the new file (after a min) - delay is due to the fact that S3 is spread across multiple AZs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

S3 is object based; objects consists of the following

A
  1. Key
  2. Value
  3. Version ID
  4. Metadata
  5. Subresources
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Key

A

name of the object

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Value

A

simply the data and is made up of sequence of bytes; it’s the data inside the files (hello cloud gurus)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Metadata

A

tags (owned by sales marketing etc)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Access Control List

A

putting individual permissions on a file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Version ID

A

important for versioning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

S3 durability for all storage classes

A

99.99999999 (11x9s)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

S3 Tiered Storage

A

storage classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

LIfecycle Management

A

archiving files; moving files from one storage tier to another based on how old the file is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Versioning

A

multiple versions of the file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Securing your data in S3 using:

A
  1. Access Control Lists

2. Bucket Policies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Access Control Lists

A

goes down to the individual file level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Bucket Policies

A

locking down the bucket itself at the bucket level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

S3 has the following features

A
  1. Tiered Storage Available
  2. Lifecycle Management
  3. Versioning
  4. Encryption
  5. MFA Delete
  6. Securing data using ACL and BP
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

S3 Storage Classes

A
  1. S3 Standard
  2. S3 Intelligent Tiering
  3. S3 Standard-IA
  4. S3 One Zone-IA
  5. S3 Glacier
  6. S3 Glacier Deep Archive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q
S3 Standard Availability 
Availability SLA
Availability Zones
Min capacity charge/object
Min storage duration charge
Retrieval Fee
First byte latency
A

99.99%
99.9%
>=3
NA
NA
NA
milliseconds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

S3 Standard is stored

A

redundantly across multiple devices in multiple facilities and is designed to sustain the loss of 2 facilities concurrently

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q
S3 Standard - IA Availability 
Availability SLA
Availability Zones
Min capacity charge/object
Min storage duration charge
Retrieval Fee
First byte latency
A
99.9%
99%
>=3
128kb
30 days
Per GB retrieved
milliseconds
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

S3 - IA is for

Use case

A
  1. data that is accessed less frequently but requires rapid access when needed
  2. lower fee than S3 but you are charged a retrieval fee

ideally suited for long-term file storage, older sync and share storage, and other aging data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q
S3 One Zone - IA Availability 
Availability SLA
Availability Zones
Min capacity charge/object
Min storage duration charge
Retrieval Fee
First byte latency
A
99.5%
99%
1
128kb
30 days
Per GB retrieved
milliseconds
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

S3 One Zone - IA is for

Use case

A
  1. were you want a lower cost option for infrequently accessed data but do not require the multiple AZ data resilience

backup copies, disaster recovery copies or other easily re-creatable data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q
S3 Intelligent Tiering Availability 
Availability SLA
Availability Zones
Min capacity charge/object
Min storage duration charge
Retrieval Fee
First byte latency
A
99.9%
99%
>=3
NA
30 days
NA
milliseconds
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

S3 - Intelligent Tiering

Use case

A

Designed to optimize costs by automatically moving data to the most cost-effective access tier, without performance impact or operational overhead

Unknown access patterns;an also be used to store new data sets where, shortly after upload, access is frequent, but decreases as the data set ages. Then you can move the data set to S3 One Zone-IA or archive it to S3 Glacier.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q
S3 Glacier Availability 
Availability SLA
Availability Zones
Min capacity charge/object
Min storage duration charge
Retrieval Fee
First byte latency
A

99.99%
99.9%
>=3
40 KB
90 days
per GB retrieved
select minutes or hours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

S3 Glacier
Retrieval times

Use case

A
  1. secure, durable and low-cost storage class for data archiving
  2. retrieval times configurable from minutes to hours

media asset workflows, healthcare information archiving, regulatory and compliance archiving, scientific data storage, digital preservation, magnetic tape replacement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q
S3 Glacier Deep Archive Availability 
Availability SLA
Availability Zones
Min capacity charge/object
Min storage duration charge
Retrieval Fee
First byte latency
A

99.99%
99.9%
>=3
40 KB
180 days
per GB retrieved
select hours

40
Q

S3 Glacier Deep Archive

Use case

A

Amazon S3’s lowest cost storage class where a retrieval time of 12 hours is acceptable

can also be used for backup and disaster recovery use cases, and is a cost-effective and easy-to-manage alternative to magnetic tape systems, whether they are on-premises libraries or off-premises services.

41
Q

S3 charges

A
  1. Storage
  2. Requests
  3. Storage Management Pricing
  4. Data Transfer Pricing
  5. Transfer Acceleration
42
Q

S3 Transfer Acceleration

A

enables fast, easy and secure transfers of files over long distances between end users and an s3 bucket

43
Q

Transfer acceleration takes advantage of

A

Amazon’s CloudFront’s globally distributed edge locations. As the data arrives at an edge location - data is routed to Amazon S3 over an optimized network path.

44
Q

Transfer acceleration process

A

○ The users upload their files to the edge location instead of directly uploading it to the S3 bucket.
○ Edge location - is a small Data Center that is near the user
○ Once uploaded, it is then sent over amazon’s backbone network
Instead of the user uploading it using their internet access directly to the S3 bucket, they are uploading it to the edge location then amazon have a much better communication between their edge locations and data centers which will be a lot faster.

45
Q

S3 naming bucket rules

A
  • Bucket names must be unique across all existing bucket names in Amazon S3.
  • Bucket names must comply with DNS naming conventions.
  • Bucket names must be at least 3 and no more than 63 characters long.
  • Bucket names must not contain uppercase characters or underscores.
  • Bucket names must start with a lowercase letter or number.
  • Bucket names must be a series of one or more labels. Adjacent labels are separated by a single period (.). Bucket names can contain lowercase letters, numbers, and hyphens. Each label must start and end with a lowercase letter or a number.
  • Bucket names must not be formatted as an IP address (for example, 192.168.5.4).
46
Q

MFA Delete

A

You can turn it on for security

47
Q

S3 Encryption

A
  1. Client Side Encryption

2. Server Side Encryption

48
Q

3 Server Side Encryptions (Encryption at Rest)

A
  1. Server Side encryption with Amazon S3 Managed Keys (SSE-S3)
  2. Server Side encryption with KMS (SSE-KMS)
  3. Server side encryption with Customer Provided Keys (SSE-C)
49
Q

S3 Managed Keys - SSE - S3

A
  1. Amazon manages the keys for you automatically so you don’t have to worry about the keys at all
  2. 256-bit Advanced Encryption Standard (AES-256)
50
Q

SSE - KMS

A

You and amazon manage the keys together

51
Q

SSE-C

A

Where you give amazon your own keys that you manage and you can encrypt your S3 objects doing that

52
Q

S3 Versioning, once enabled, can it still be disabled?

A

you can only suspend it, if you want to delete it, you have to delete the bucket itself

53
Q

How to restore a deleted file in S3?

A

Remove the delete marker in the versioning

54
Q

Versioning integrates with

A

Lifecycle rules

55
Q

Cross Region replication requires

A

versioning to be turned on for the bucket

56
Q

what will happen to the existing files when you just turn on Cross Region replication

A

It will not copy existing files, it has to be manually moved

57
Q

When new files are created after the Cross Region replication has been turned on

A

they will automatically be replicated in the target bucket

58
Q

What won’t be replicated in the Target bucket?

A
  1. If you delete files from the source bucket, they won’t be replicated in the target bucket
  2. If you delete individual versions, it won’t get replicated either
59
Q

CDN

A

Content delivery network is a system of distributed servers that delivers webpages and other web content to a user based on geographic locations of the user, the origin of the webpage, and a content delivery server

60
Q

Edge location

A

location where the contents are cached

61
Q

Origin

A

origin of all the files that the CDN will distribute. This can be an S3 bucket, EC2 instance, an Elastic Load Balancer or Route 53.

62
Q

Distribution

A

name given the CDN which consists of a collection of Edge Locations

63
Q

Web Distribution

A

typically used for websites

64
Q

RTMP

A

Used for media streaming

65
Q

TTL

A

Time to Live; objects are cached for the life of the TTL

66
Q

What will happen if you invalidate cached objects?

A

you will be charged

67
Q

Snowball

A

petabye scale data transport solution that uses secure appliances to transfer large amounts of data into and out of AWS

68
Q

Snowball sizes

A

50 TB or 80 TB

69
Q

AWS Snowball Edge

A

is a 100TB data transfer service with on board storage and compute capabilities

70
Q

AWS Snowmobile

A

Exabyte scale data transfer service used to move extremely large amounts of data to AWS

71
Q

Snowball can ? to S3

A

Import to S3 and Export from S3

72
Q

AWS Storage Gateway

A

> service that connects an on premise software appliance with cloud based storage to provide seamless and secure integration between an organization’s on-premise IT environment and AWS’s storage infrastructure
a virtual of physical device that’s going to replicate your data into AWS (used to be virtual device but they’ve released a hardware appliance now so you can actually have a physical storage gateway)

73
Q

How can the AWS Storage Gateway’s software used?

A

available for download as a virtual machine (VM) image that you install on your datacenter

74
Q

Storage Gateway supports :

A
  1. VMWare ESXi

2. Microsoft Hyper V

75
Q

3 different types of storage gateway

A
  1. File Gateway (NFS)
  2. Volume Gateway (iSCSI)
  3. Tape Gateway (VTL)
76
Q

File Gateway

A

way of storing files in S3

77
Q

File Gateway: Files are stored as objects in your S3 buckets, access through:

A

a network file system (NFS) mount point

78
Q

Volume Gateway

A

> presents your applications with disk volumes using the iSCSI block protocol.
Data written to these volumes can be asynchronously backed up as a point in time snapshots of your volumes, and stored in the cloud as Amazon EBS snapshots.

79
Q

Snapshots

A

are incremental backups that capture only changed blocks. All snapshot storage is also compressed to minimize your storage charges

80
Q

2 types of Volume Gateway

A
  1. Stored Volumes - data written to your stored volumes is stored on your on prem storage hardware then it is asynchronously backed up to S3 in the form of EBS snapshots. (1GB - 16 TB in size for stored volumes)
  2. Cached Volumes - uses S3 as the primary data storage while retaining frequently accessed data locally in your storage gateway. (1GB- 32 TB)
81
Q

Tape Gateway

A

durable cost effective solution to archive your data in the AWS cloud

82
Q

Tape Gateway is supported by

A

NetBackup, Backup Exec, Veeam

83
Q

Are there differences between how Amazon EC2 and Amazon S3 work with Availability Zone-specific resources?

A

Yes. Amazon EC2 provides you the ability to pick the AZ to place resources, such as compute instances, within a region. When you use S3 One Zone-IA, S3 One Zone-IA assigns an AWS Availability Zone in the region according to available capacity.

84
Q

Q: Can I have a bucket that has different objects in different storage classes and Availability Zones?

A

Yes, you can have a bucket that has different objects stored in S3 Standard, S3 Standard-IA and S3 One Zone-IA.

85
Q

S3 Restore Speed Upgrade

A

override of an in progress restore to a faster restore tier if access to the data becomes urgent

86
Q

How much data can I retrieve from Amazon S3 Glacier for free?

A

You can retrieve 10GB of your Amazon S3 Glacier data per month for free with theAWS free tier.

87
Q

There are three ways to restore data from Amazon S3 Glacier –

A

Expedited, Standard, and Bulk Retrieval

88
Q

What is “Query in Place” functionality?

A

Amazon S3 allows customers to run sophisticated queries against data stored without the need to move data into a separate analytics platform.

89
Q

What is S3 Select?

A

S3 Select is an Amazon S3 feature that makes it easy to retrieve specific data from the contents of an object using simple SQL expressions without having to retrieve the entire object.

90
Q

What is Amazon Athena?

A

Amazon Athena is an interactive query service that makes it easy toanalyze data in Amazon S3 using standard SQL queries. Athena is serverless, so there is no infrastructure to setup or manage, and you can start analyzing data immediately.

91
Q

What is Amazon Redshift Spectrum?

A

Amazon Redshift Spectrum is a feature of Amazon Redshift that enables you torun queries against exabytes of unstructured data in Amazon S3with no loading or ETL required. When you issue a query, it goes to the Amazon Redshift SQL endpoint, which generates and optimizes a query plan. Amazon Redshift determines what data is local and what is in Amazon S3, generates a plan to minimize the amount of Amazon S3 data that needs to be read, requests Redshift Spectrum workers out of a shared resource pool to read and process data from Amazon S3.

92
Q

What are Amazon S3 Event Notifications?

A

Amazon S3 event notifications can be sent in response to actions in Amazon S3 like PUTs, POSTs, COPYs, or DELETEs. Notification messages can be sent through eitherAmazon SNS,Amazon SQS, or directly toAWS Lambda.

93
Q

What does it cost to use Amazon S3 event notifications?

A

There are no additional charges for using Amazon S3 for event notifications. You pay only for use of Amazon SNS or Amazon SQS to deliver event notifications, or for the cost of running an AWS Lambda function.

94
Q

S3 supports how many requests per second to add data?

A

3500

95
Q

S3 supports how many requests per second to retrieve data?

A

5500

96
Q

How many buckets can I have per account by default?

A

100