S3 Storage Flashcards

1
Q

What are S3 use cases?

A
  • Backup / Archive
  • Disaster recovery
  • Hybrid cloud storage
  • Application hosting
  • Media hosting
  • Big data
  • Software delivery
  • Static web site
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

S3 Overview - Buckets

A
  • Objects (files) stored in buckets
  • Buckets have globally unique name
  • Defined at region level - but stored in 3 AZs
  • Follow naming convention
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

S3 Overview - Objects

A
  • Objects (files) have a key
  • Key is the full path: bucket + file name
    s3: //my-bucket/m-file.txt
  • There are no folders/directories in buckets, just looks that way
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How are S3 buckets secured ?

A
  • User based (IAM policies, user with AWS account)
  • Resource based - bucket policies (allow public access, cross account access)
  • Access by EC2 instance requires IAM role
  • Encryption
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What security/control is based on S3 Bucket Policies ?

A
  • JSON based
  • Grant public access
  • Force encryption at upload
  • Grant cross account access
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

S3 Web Site

A
  • Setting to use as web site
  • URL: .s3-website-.amazon.aws.com
  • Make sure bucket policy allow for public access or else 403 error happens
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

S3 Versioning

A
  • Enabled at bucket level
  • Version incremented at overwrite
  • Versioning is best practice
  • Delete markers
  • null version for files added before enabling versioning
  • Suspending versioning does not delete previous entries
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are S3 Access Logs

A
  • Used for access/request auditing
  • a ‘data’ bucket is configured to use a ‘logging’ bucket (use Edit Server Access Logging to enable)
  • logs are written to logging bucket and analysed using AWS tools
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are types of S3 replication and use cases?

A
  • CRR - Cross Region Replication
    • use: compliance, across accounts, lower latency
  • SRR - Same Region Replication
    • use: log aggregation, live replication between prod and test
  • must enable versioning in source and target buckets
  • must have proper IAM permissions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the S3 Storage Classes

A
  • S3 Standard - General purpose
  • S3 Standard - Infrequent Access (IA)
  • S3 One Zone - Infrequent Access
  • S3 Intelligent Tiering
  • Glacier
  • Glacier Deep Archive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

S3 Durability / Availability

A
  • High durability across all classes

- Availability varies on class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

S3 Standard Use Case

A
  • 99.99% Availability
  • Frequently accessed data
  • low latency / high throughput
  • sustain 2 concurrent facility failures
  • Big data analytics, mobile and gaming, content distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

S3 Standard - Infrequent Access

A
  • 99.9% Availability
  • For data that is less frequently accessed, but requires rapid access when needed
  • Lower cost, but has retrieval fee
  • Stored in multiple AZ’s
  • Use: disaster recover. Backups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

S3 Intelligent Tiering

A
  • 99.9% Availability
  • Low latency / high throughput like S3 standard
  • Moves between 2 access tiers - S3 Standard and S3 Standard IA
  • Good for backups / disaster recovery files
  • Cost optimized - automatically moves objects based on access patters (frequent access/Infrequent access)
  • No retrieval fees
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

S3 One-Zone Infrequent Access

A
  • 99.5 availability (lower because in single AZ, not redundant)
  • Same as IA but single AZ
  • Lower cost that Standard IA
  • Use: storing secondary backup, data that can be recreated
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Glacier and Glacier Deep Archive

A
  • Low cost meant for archiving
  • Long term data retention
  • Low cost + retrieval fees
  • Glacier tiered up to 12 hours retrieval
  • Deep archive 12-48 hours retrieval
17
Q

S3 Moving Data between Classes

A
  • Can be configured to optimize cost
  • Automated through lifecycle configuration. Set on bucket / Create lifecycle rule
  • can move through all tiers and ultimately delete the data
18
Q

What are Snowcone, Snowball Edge and Snowmobile. What is Edge computing

A
  • Snowcone - Small portable device. 8 TB. Edge capable
  • Use case- data collected where there is not network
  • Snowball Edge - TBs or PBs of data. Can process the data while it is being moved. Edge capable
  • Storage Optimized (80 TB)
  • Compute Optimized (42 TB)
  • Snowmobile - truck that moved exabytes of data
  • Edge Computing - process data at location with no internet/cloud access
19
Q

AWS Storage Gateway

A
  • Hybrid cloud storage
  • Bridge between on-premise and S3 cloud storage
  • On-premise can seamlessly access S3
  • Use - disaster recovery, backup/restore, tiered storage
20
Q

What is OpsHub?

A
  • Manage Snowball devices graphically