Advanced Amazon S3 Flashcards
You can transition ________ between storage classes
objects
- For infrequently accessed object, move them to standard IA
- For archive objects that you don’t need fast access to, move them to glacier or glacier deep archive
- Moving object can be automated using a lifecycle rules
What are some features of Amazon S3 - Lifecycle rules?
- Transition actions: configure object to transition to another storage class
- Move objects to standard IA class 60 days after creations
- Move to glacier for archiving after 6 months - Expiration actions: configure object to expire (delete) after some time
- Access log files can be set to delete after a 365 days
- Can be used to delete old version of files (if versioning is enabled)
- Can be use to delete incomplete multi part uploads
- Rules can be create for a certain prefix (ex. S3://mybucket/MP3’s/*)
- Rules can be created for certain objects tags (ex. Department finance)
__________ help you decide when to transition objects to the right storage class
Amazon S3 analytics - Store Class analysis
What are some features of Amazon S3 Analytics- storage class analysis
- Recommendations for standard & standard IA
- Does not work for one-zone IA or glacier
- Report is updated daily
- 24-48 hours to start seeing data analysis
- Good first step to put together lifecycle rules (or improve them)
What are some features of S3- Requester Pays?
- In General bucket owners pay for all Amazon S3 storage & data transfer costs associated with their bucket
- With requester pays buckets the requester instead of the bucket owner pays the cost of the request and the data download from the bucket
- Helpful when you want to share large datasets with other accounts
- The requester must be authenticated in AWS (can’t be anonymous)
What are some features of S3 event notifications ?
- S3: Objectcreated, S3: objectremoved
- S3:Objectrestore, S3: replication - Object Name filtering possible (*.jpg)
- Use case: generate thumbnails of images uploaded to S3
- Can create as many “S3 events” as desired
- S3 Event notifications typically deliver events in seconds but can sometimes take a minute or longer
For S3 events notifications to work, need to have __________
IAM permissions (& need to attach an SNS resource (access) IAM policy) & if you use SQS need to have SQS resource (access) IAM policy & if you use a lambda then need to have lambda resource policy
Having ___________ with S3 event notification can give you advanced filtering options with JSON rules
Amazon EventBridge
Also get: Multi destinations: ex step functions, kinesis streams/ firehose
EventBridge capabilities
- archive, replay events reliable delivery
Why are some features of S3 baseline performance ?
- Amazon S3: automatically scales to high request rates, latency 100-200ms
- Your application can archive at least 3,500 PUT/COPY/POST/DELETE or 5,500’GET/HEAD request per second per prefix in a bucket
- There is no limit to the number of prefixes in a bucket
- So if you spread reads across all four prefixed evenly, you can achieve 22,000 request per second for GET and HEAD
What are some ways to optimize S3 performance?
- Multi Part Upload:
- recommend for files > 100MB
must use for files > 5GB
- Can help parallelize uploads (speed up transfers) - S3 Transfer acceleration
- Increase transfers speed by transferring file to an AWS edge location which will forward the data to the S3 bucket in the target region
- Compatible with multi-part upload
How can S3 performance read /get files in the most efficient way?
Can use S3 Byte - Range fetches
- Parallele GETs by requesting specific byte ranges
- Better resilience in case of failures
- Can be used to speed up downloads
- Can be used to retrieve only partial data (for example the head of a file)
What are some features of S3 select & glacier select?
- Retriever less data using SQL by performing sever-side filtering
- Can filter by rows & columns (simple SWL statements)
- Less network, transfer less CPU cost client side