Section 14: Advance Amazon S3 & Athena Flashcards
What is S3 MFA-Delete?
- MFA(multi factor authentication) for the user to generate a code on a device(mobile phone) before doing important operations on S3
- Only the bucket owner(root account) can enable/disable MFA-Delete.
Explain the different S3 classes
1) S3 Standard - General Purpose
- 99.99% Available
- Used for frequently accessed data
- Use Case: Big data analytics,mobile and gaming
2) S3 Standard - Infrequently Access
- For data that is less frequently access but requires rapid access when needed
- Lower cost than S3 Standard
a) AWS S3 Standard-Infrequent Access
-99.99% Availability
- Use cases: Disaster recovery,backups
b) AWS S3 One Zone-Infrequent Access
- High Durability(99.999999999%) in a single AZ
- Use Cases:Storing secondary backup copies of no-premise data
3) S3 Glacier Storage Class
- Low cost object storage meant for archiving /backup
a) Amazon S3 Glacier Instant Retrieval
- Millisecond retrieval, great for data access once a quarter
- Min storage duration of 90 days
b) Amazon S3 Glacier Flexible Retrieval
- Expedited(1 to 5minites),Standard(3 to5 hours),Bulk(5 to 12 hour)
- Min storage duration of 90 days
c) Amazon S3 Glacier Deep Archive - for long term storage:
- Standard (12 hours), Bulk(48 hours)
- Min storage duration of 180 days
4) S3 Intelligent - Tier
- Small monthly monitoring and auto-tiering fee
- Move objects automatically between Access Tiers bases on usage
- There are no retrieval charges in S3 Intelligent-Tiering
How to improve S3 performance?
1) Multi-Part upload
- Uploading of files in multiple part
- recommended for files > 100MB, must be used for files > 5GB
2) S3 Transfer Acceleration
- Increase transfer speed by transferring file to an AWS edge location which will forward it to your bucket at a higher speed.
What is S3 Requester Pays buckets?
P.160
- Normally, the onwer pays for the baucket and the cost to transfer the data.
- With Requester Pays bucket, the requester will pay the cost of the data transfer
- Helpeful when you want ot share large datasets with other accounts.
- The requester must be authenticated in AWS.
What is Amazon Athena?
- Serverless query service to perform analytics against S3 objects.
- Use standard SQL language to query the files
- Exam tip: if they mention analyze data in S3 using serverless SQL, use Athena