AWS Storage Flashcards
Amazon Simple Stirage Service (S3) is good for
- Maintain backup archives, log files and DR images
- Running analytics on a big data at rest
- Hosting static websites
What is a difference between EBS and S3
With block-level storage on a raw physical storage device is divided into individual blocks whose use is managed by a file system. NTFS is a common filèsystem used by Windows, and Linux might use btrfs or ext4. The filesystem, on behalf of the installed OS, is responsible for allocating space for the files and data that are saved to the underlying device and for providing access whenever the OS needs to read some data.
An object storage system like S3, on the other hand, provides what you can think of as a
flat surface on which to store your data. This simple design avoids some of the OS-tz
complications of block storage and allows anyone easy access to any amount of probs
ally designed and maintained storage capacity.
When you write files to $3, they’re stored along with up to 2 KB of metadata.
What is default max of how many buckets you can create for a given account
- By default for a given account you can create 100 buckets
- it can be changed for additional fee
What is the unique scope of backer names
- The bucket name should be unique within an entire S3 system
- Though bucket and its content exist only within a single AWS region
What is prefix is used for
- S3 does not have a hierarchical structure but use of prefixes can simulate a more structured organization
- prefixes tell S3 to group objects together as related
- S3 recognizes folder/directory structures as they are uploaded and emulates their hierarchical design within the buckets , automatically converting slashes into delimiters.
- “/“ is a delimiter
Is there a limit to the Tom total amount of data that can be stored at S3
- There is no theoretical limit to the amount of data that can be stored at S3 bucket
- but the single object can not be larger then 5 TB
- individual upload cannot be larger then 5 GB
- Multiparty Upload feature should be used for any large object over 100MB large object upload. It automatically splits object into the smaller pieces and transmission of each pice can be restarted without restarting entire transmission
- Multiparty Upload is automatically used while one uses CLI and high-level API for uploads
How Amazon S3 Transfer Acceleration is working
If you need to transfer large files Amazon S3 Transfer Acceleration configuration could be used
- when buckets are configured to use TA uploaded are routed through geographically nearby AWS edge location and , from there routed using Amazon internal network
- Amazon S3 Acceleration Speed Comparison Tool can be used to find out if its helps
What are types of encryption to be used with S3
Two types on a high level
- Sever side encryption
- Client side encryption
What is SSE-S3
SSE-S3 is a server-side encryption used by S3 to encrypt and decrypt very step using AWS own enterprise standart
What is SSE-KMS
Sever-Side Encryption with AWS KMS-Managed Keys. Beyond SSE-S3 envelop keys are used with additional audit trail of key usage. You can import your own keys using AWS KMS service
What is SSE-C
Server-Side Encryption with Customer-Provided Keys (SSE-C). YTOu provide keys to used by S3 for encryption
Client-side encryption
It is possible to encrypt data before it is transferred to S3
This could be done using an AWS KMS-Managed Customer Master Key (CMK) which produces a unique key for each object before its uploaded.
- You can also use Client-Side Master Key, which you provide through the AWS S3 encryption client
The regulatory demands can force you to use client-side encryption as only viable solution to maintain a full control of keys.
S3 logging
- tracking S3 event to log files is disabled by default
- when you enable logging you would need to specify target bucket as well as bucket where data would be logged
- optionally you can specify delimiters and prefixes to make easier to identify logs from multiple buckets
Basic event logged are: - account IP address of the requestor
- source bucket name
The action that was requested (GET, PUT, POST, DELETE, etc) - the time the request was issued
- the response status (including error code)
S3 storage classes
Different storage classes of S3 and Glasier provide different levels of DURABILITY and AVAILABILITY
- S3 Standard
- S3 Standard-IA
- S3 One-Zone-IA
- S3 Intelligent-Tiering
S3 durability
- S3 standard 99,999999999 (11 nines)
- S3 Reduced Redundancy Storage (RRS) - 99,99 - available for historic purpose only and not recommended.