S3-Advanced Flashcards
Moving between storage classes
Moving S3 objects between storage classes can be automated using ____.
lifecycle rules
S3 Lifecycle Rules can be made up of ____ actions and ____ actions.
Transition
Expiration
Using S3 Lifecycle Rules, you can setup ____ actions to configure objects to transition to another storage class.
transition
For example:
- move objects to Standard-IA class 60 days after creation
- move to Glacier for archiving after 6 months.
Using S3 Lifecycle Rules, you can setup ____ actions to configure objects to expire (delete) after some time.
expiration
For example:
- access log files can be set to deleted after 365 days
- can be used to delete old versions of files (if versioning is enabled)
- can be used to delete incomplete Multi-Part uploads
Using S3 Lifecycle Rules, rules can be created for a certain ____ or object ____.
prefix
tags
Lifecycle Rules - Scenario
Your application on EC2 creates images of thumbnails after profile photos are uploaded to S3. These thumbnails can be easily recreated, and only need to be kept for 60 days. The source images should be able to be immediately retrieved for these 60 days, and afterwards, the user can wait up to 6 hours. How would you design this?
S3 source images can be on Standard, with lifecycle configuration to transition them to Glacier after 60 days.
S3 thumbnails can be on One Zone-IA, with a lifecycle configuration to expire them (delete) after 60 days
Lifecycle Rules - Scenario
A rule in your company state that you should be able to recover your deleted S3 objects immediately for 30 days, although this may happen rarely. After this time, and for up to 365 days, delted objects should be recoverable within 48 hours.
Enable S3 Versioning in order to have object versions, so that “deleted objects” are in fact hidden by a “delete marker” and can be recovered.
Transition the “noncurrent version” of the object to Standard-IA
Transition afterwards the “noncurrent versions” to Glacier Deep Archive
S3 ____ help you decide when to transition objects to the right storage class.
Analytics
Amazon S3 Analytics provides recommendations for which two storage classes?
Which storage classes does it not work for?
works: Standard / Standard-IA
not work: One Zone-IA / Glacier
Amazon S3 Analytics can take __ to __ hours to start seeing data analysis.
The Amazon S3 Analytics report is updated with what frequency?
24 to 48 hours
daily
In general, ____ pay for all S3 storage and data transfer costs associated with their bucket.
bucket owners
With ___ buckets, the requestor instead of the bucket owner pays the cost of the request and the data download from the bucket.
The requestor must be authenticated in AWS (cannot be anonymous).
Requestor Pays
The S3 ____ bucket feature is helpful when you want to share large datasets with other accounts.
Requestor Pays
S3 event notifications typically get delivered to ther destination in seconds, but can sometimes take a minute or longer.
You can create as many S3 event notifications as you want.
S3 event notifications with Amazon EventBridge:
- advanced filtering options with JSON rules
- multiple destinations - ex step functions, kinesis streams / firehose …
- EventBridge Capabilities - archive, replay events, reliable delivery
S3 event notifications can be sent to what 3 AWS destinations?
You can also send all events to Amazon ____ for further processing.
SNS topic
SQS queue
Lambda function
EventBridge
S3 automatically scales to high request rates, latency __-__ ms
100-200
With S3 your application can achieve at least 3500 put/copy/post/delete or 5500 get/head requests per second per ____ in a bucket.
prefix
There are no limits to the number of prefixes in a bucket.
Multi-part S3 upload is:
- recommended for files > ____
- required for files > ____
Multi-part uploads help ____ uploads (speed up transfers).
100MB
5GB
parallelize
S3 ____ increases transfer speed by transferring files to an AWS edge location which will forward the data to the S3 bucket in the target region.
Compatible with multi-part upload.
Transfer Acceleration
S3 ____ parallelize GETs by requesting specific byte ranges.
This is used to speed up downloads.
Provide better resilience in case of failures.
Can be used to retrieve only partial data, such as the head of a file.
Byte-Range Fetches
Using S3 ____ you can perform bulk operations on existing S3 objects with a single request:
- modify object metadata & properties
- copy objects between buckets
-** encrypt un-encrypted objects - modify ACLs, tags
- restore objects from S3 Glacier
- invoke Lambda function to perform custom action on each object
Batch Operations
An S3 Batch Operations ____ consists of a list of objects, the action to perform, and optional parameters.
job