Storage Flashcards
1
Q
S3
A
- Bucket must have a globally unique name
- Bucket are defined at the region level
- Naming convention
- No uppercase
- No underscore
- 3-63 characters long
- Not an IP
- Must start with lowercase letter or number
- Object key is its full path
- Max 5TB
- More than 5GB, must use “multi-part upload”
- Strong Consistency
2
Q
S3 Standard General Purpose
A
- Use for frequently accessed data
- Low latency and high throughput
- Sustain 2 concurrent facility failures
3
Q
Infrequent Access
A
- Use for less frequently access but require rapid access when needed
- Standard IA
- Use Cases : Disaster Recovery and Backups
- One Zone IA
- Use Cases : Secondary backup
4
Q
Glacier
A
- Glacier Instant Retrieval
- milli second retrieval
- min 90 days storage
- Use Case : Data access once a quarter
- Glacier Flexible Retrieval
- Expedited 1-5 mins
- Standard 3-5 hours
- Bulk 5-12 hours
- min 90 days storage
- Glacier Deep Archive
- Standard 12 hours
- Bulk 48 hours
- min 180 days storage
5
Q
S3 Intelligent Tiering
A
- Small monthly monitoring and auto-tiering fee
- Moves objects automatically between Access Tiers based on usage
6
Q
S3 Moving Between Storage Classes
A
- For IA accessed object, move them to STANDARD_IA
- For archive objects, move to Glacier or Deep_Archive
- Moving objects can be automated using a lifecycle configuration
7
Q
S3 Lifecycle Rules
A
- Transition Rules
- defines when objects are transitioned to another storage class
- Expiration Rules
- Configure objects to expire after some time
- can be used to delete old versions of files if versioning is enabled
- can be used to delete incomplete multi part uploads
- Configure objects to expire after some time
8
Q
S3 Versioning
A
- Enable at bucket level
- Same key overwrite will increment the version
- Use Cases : 1 Protect against unintended delete 2 Easy roll back
- Any file is not versioned prior to enabling versioning will have version “null”
- Suspending versioning does not delete the previous versions
9
Q
S3 Replication
A
- Must enable versioning in source and destination
- Cross Region Replication
- Same Region Replication
- Buckets can be in different account
- Copying is asynchronous
- After activation, only new objects are replicated
- Optionally, you can replicate existing objects using S3 Batch Replication to replicate existing objects and objects that failed replication
- For DELETE operation
- can replicate delete markers from source to target
- deletions with a version ID are not replicated
- There is no chaining replication
10
Q
S3 Performance
A
- Durability 99.999999999%
- Availability 99.99%
- 100-200ms latency
- 3500 Put Copy Post Delete /sec /prefix in a bucket
- 5500 Get Head /sec /prefix in a bucket
- Multi Part Upload
- Recommended for files > 100 MB
- Required for files > 5GB
- S3 Transfer Acceleration
- Increase transfer speed by transferring file on an AWS edge location which will forward the data to the S3 Bucket in the target region
- Compatible with multi-part upload
- S3 Byte-Range Fetches
- Parallelize GETs by requesting specific byte ranges
- Better resilience in case of failures
11
Q
S3 KMS
A
- SSE-KMS will be impacted by KMS limit
- When upload, calls GenerateDataKey KMS API
- When download, calls Decrypt KMS API
12
Q
SSE-S3
A
- “x-amz-server-side-encryption” : “AES256”
13
Q
SSE-KMS
A
- Pros : Use Control + Audit Trail
- “x-amz-server-side-encryption” : “aws:kms”
14
Q
SSE-C
A
- HTTPS must be used
- Encryption key must be provided in HTTP header
15
Q
S3 Bucket Settings for Block Public Access
A
- Block public access to buckets and objects
- Block public and cross-account access to buckets and objects thr any public bucket or access point policies
16
Q
S3 Event Notification with Amazon EventBridge
A
- Advanced Filtering options with JSON rules (metadata, object size, name, …)
- Multiple Destinations –> Step Functions, Kinesis Streams/Firehose
- EventBridge Capability –> Archive, Replay Events, Reliable delivery