Store Your Data Flashcards
Types of storage
Block
File
Object
Types of Block storage
Stores only the changes
EBS general purpose SSD (solid state drives) Provisions IOPS SSD Throughout optimised HDD (hard disk drives) Cold HDD
Types of file storage
EFS STANDARD
EFS infrequent access (IA)
FSx for Windows
FSx for Lustre
Types of object storage
Replaces old with new
S3 Standard S3 standard IA S3 one zone IA S3 intelligent tiering S3 glacier S3 glacier deep archive
S3
Simple storage services
Global usage
Standard - grater than or equal to 3 AZ
Amazon elastic block storage (EBS)
Dedicated in EC2
Accessed only by ME
each EBS volume is auto replicated within its AZ
SNAPSHOT functionality
Example: transactional data like who what when cookies were bought
S3 one zone IA
One zone, copied three times in one AZ
S3 Intelligent Tiering
Automatic change of type of storage
Elastic file storage
EFS
Shared in the same region shared by multiple people
Instance store
If EC2 crash stop hang, this temp file can be hosted in instances store
S3 Glacier
Retrieval time:
3-5 min expedite
3-5 hours standard
3-5 hrs but < 24 hrs bulk
Size of data does not matter
More expensive than Glacier deep archive
Delete after 5 years
S3 Glacier Deep Archive
No retrieval only for emergencies or compliance
Size of data doesn’t matter
7-10 years long term retention
Storage cheap, fees high
S3 common scenarios
Back up storage
Application hosting
Media hosting
Software delivery
DynamoDB
Fully managed noSQL offering, available in most regions for users to consume
Key value and document database
Fully managed
Multi region
Built in security, backup and restore, in memory caching for internet scale apps
Cannot use for SQL on s3
Aurora
Fully managed MySQL and PostgreSQL compatible, relational database engine
5x fatter than a traditional mySQL DB and 3x of postgreSQL without changes to most of your existing applications
Speed and reliability of high end commercial database with the simplicity of cost effectiveness of open source database
Self healing
AWS RDS
Relational database service
Automated patches and backups
Allows you to store unstructured data
Cost efficient and resizable capacity while automating admin tasks like hardware provisioning and database setup
AWS EKS
Elastic kubernetes service
Fully Managed service that simplifies kubernetes deployed on AWS by eliminating the need to install, operate, and or maintain its own Kubrnetes control plane
Amazon elastic container service (ECS)
Container management that facilitates containers management on the cluster including running and stopping the Containers
The container based apples could be launched / stopped using simple API calls
AWS Fargate
ECS and EKS compatible Serverless compute engine for containers
Amazon aurora
Amazon Aurora is a MySQL and PostgreSQL-compatible relational database built for the cloud, that combines the performance and availability of traditional enterprise
databases with the simplicity and cost-effectiveness of open source databases. You cannot use Aurora for SQL analysis on S3 based data.
Redshift
Amazon Redshift is a fully-managed petabyte-scale cloud-based data warehouse product designed for large scale data set storage and analysis. Amazon Redshift is the most popular and fastest cloud data warehouse. Though analytics can be run on Redshift, in the current use case, old data is residing on S3, and Athena is the right choice since analytics can be run directly while data is sitting on S3. You cannot use Redshift for SQL analysis on S3 based data.
Amazon Redshift is a data warehousing service that you can use for big data analytics. It offers the ability to collect data from many sources and helps you to understand relationships and trends across your data
Athena
since analytics can be run directly while data is sitting on S3
Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.
Amazon FSx for Windows File Server
Amazon FSx for Windows File Server provides fully managed, highly reliable, and scalable file storage that is accessible over the industry-standard Service Message Block (SMB) protocol. It is built on Windows Server, delivering a wide range of administrative features such as user quotas, end-user file restore, and Microsoft Active Directory (AD) integration.
Read replicas in Amazon RDS
Amazon Relational Database Service (Amazon RDS) makes it easy to set up, operate, and scale a relational database in the cloud. Read replicas allow you to create read-only copies that are synchronized with your master database. You can also place your read replica in a different AWS Region closer to your users for better performance. Read replicas are an example of horizontal scaling of resources.
Also for Improved DR