Data Engineering and Storage Flashcards
What is the maximum size of an S3 object?
5 TB
What are the available S3 storage classes?
- Standard
- Infrequent Access (IA)
- Intelligent Tiering
- One-Zone IA
- Glacier Instant Retrieval
- Glacier Flexible Retrieval
- Glacier Deep Archive
For S3 Intelligent Tiering, what are the available tiers and when does an object enter each of those tiers?
-Standard Tier - Default tier
-IA tier - Not accessed for 30 days
-Archive Instant Access Tier - Not accessed for 90 days
-Archive Access Tier- configurable from 90 days to 700+ days
-Deep Archive Access Tier - configurable from 180 to 700+ days
For S3 Intelligent Tiering, which tiers are automatically configured and which are optional?
Automatic: Standard, IA, Instant Archive Access
Optional: Archive Access, Deep Archive Access
What is the minimum object billable size for S3 Glacier Flexible Retrieval and Glacier Deep Archive?
40KB
What are the available retrieval times fo Glacier Flexible Retrieval?
-Expedited: 1-5 min
-Standard: 3-5 hours
-Bulk: 5-12 hours (Free)
What is the minimum object billable size for S3 Infrequent Access?
128 KB
What S3 feature can you use to automatically transition objects between S3 Storage Classes?
S3 lifecycle rules
What is a default use case for S3 One-Zone IA?
Storing historical data that can easily be regenerated
Aside from automatic storage class transition, what else can S3 Lifecycle rules be used for?
Delete old files or old file versions after a predetermined amount of time has passed
What can you use to discriminate which objects should be affected by an S3 lifecycle rule or not?
Object prefix and tags
What is S3 Analytics?
An S3 feature that analyses S3 objects and recommends storage classes for them.
Which storage classes can S3 Analytics recommend?
Standard or IA.
What are the User-Based security features available for S3?
IAM Policies
What are the resource-based security features available for S3?
-Bucket policies (Allows cross account)
-Object Access Control List (More fine grained)
-Bucket Access Control List (less common)
What are the main uses of Bucket Policies?
-Grant Public Access to files
-Grant Cross-Account access to files
-Enforce object encryption at upload
What are the types of server side encryption available for S3 objects?
-SSE - S3 (Enabled by default)
-SSE - KMS
-SSE - C
True or False: SSE-KMS has no impact on KMS usage quotas
False, SSE-KMS uses both GenerateDataKey and Decrypt APIs, and those may impact the quotas of the KMS service
True or False: SSE-C accepts both HTTP and HTTPS
False, only HTTPS
True or False: You can use bucket policies to force S3 to only accepts transfers through HTTPS
True
What are the retrieval types for Glacier Deep Archive data?
- Standard (12 hours)
- Bulk (48 hours)
What are the possible consumers for Kinesis Data Streams data?
- Applications
- Lambda
- Amazon Kinesis Data Firehose
- Managed Service for Apache Flink
For how long can data stay stored on Kinesis Data Stream?
365 days
What is the size of a data shard on Kinesis Data Streams?
1MB