Store Your Data Flashcards

1
Q

Types of storage

A

Block
File
Object

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Types of Block storage

A

Stores only the changes

EBS 
general purpose SSD (solid state drives)
Provisions IOPS SSD 
Throughout optimised HDD (hard disk drives)
Cold HDD
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Types of file storage

A

EFS STANDARD
EFS infrequent access (IA)
FSx for Windows
FSx for Lustre

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Types of object storage

A

Replaces old with new

S3 Standard 
S3 standard IA
S3 one zone IA
S3 intelligent tiering 
S3 glacier 
S3 glacier deep archive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

S3

A

Simple storage services
Global usage

Standard - grater than or equal to 3 AZ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Amazon elastic block storage (EBS)

A

Dedicated in EC2
Accessed only by ME
each EBS volume is auto replicated within its AZ
SNAPSHOT functionality

Example: transactional data like who what when cookies were bought

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

S3 one zone IA

A

One zone, copied three times in one AZ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

S3 Intelligent Tiering

A

Automatic change of type of storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Elastic file storage

A

EFS

Shared in the same region shared by multiple people

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Instance store

A

If EC2 crash stop hang, this temp file can be hosted in instances store

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

S3 Glacier

A

Retrieval time:
3-5 min expedite
3-5 hours standard
3-5 hrs but < 24 hrs bulk

Size of data does not matter

More expensive than Glacier deep archive

Delete after 5 years

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

S3 Glacier Deep Archive

A

No retrieval only for emergencies or compliance

Size of data doesn’t matter

7-10 years long term retention

Storage cheap, fees high

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

S3 common scenarios

A

Back up storage
Application hosting
Media hosting
Software delivery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

DynamoDB

A

Fully managed noSQL offering, available in most regions for users to consume

Key value and document database
Fully managed
Multi region
Built in security, backup and restore, in memory caching for internet scale apps

Cannot use for SQL on s3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Aurora

A

Fully managed MySQL and PostgreSQL compatible, relational database engine

5x fatter than a traditional mySQL DB and 3x of postgreSQL without changes to most of your existing applications

Speed and reliability of high end commercial database with the simplicity of cost effectiveness of open source database

Self healing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

AWS RDS

A

Relational database service

Automated patches and backups
Allows you to store unstructured data
Cost efficient and resizable capacity while automating admin tasks like hardware provisioning and database setup

17
Q

AWS EKS

A

Elastic kubernetes service

Fully Managed service that simplifies kubernetes deployed on AWS by eliminating the need to install, operate, and or maintain its own Kubrnetes control plane

18
Q

Amazon elastic container service (ECS)

A

Container management that facilitates containers management on the cluster including running and stopping the Containers

The container based apples could be launched / stopped using simple API calls

19
Q

AWS Fargate

A

ECS and EKS compatible Serverless compute engine for containers

20
Q

Amazon aurora

A

Amazon Aurora is a MySQL and PostgreSQL-compatible relational database built for the cloud, that combines the performance and availability of traditional enterprise

databases with the simplicity and cost-effectiveness of open source databases. You cannot use Aurora for SQL analysis on S3 based data.

21
Q

Redshift

A

Amazon Redshift is a fully-managed petabyte-scale cloud-based data warehouse product designed for large scale data set storage and analysis. Amazon Redshift is the most popular and fastest cloud data warehouse. Though analytics can be run on Redshift, in the current use case, old data is residing on S3, and Athena is the right choice since analytics can be run directly while data is sitting on S3. You cannot use Redshift for SQL analysis on S3 based data.

Amazon Redshift is a data warehousing service that you can use for big data analytics. It offers the ability to collect data from many sources and helps you to understand relationships and trends across your data

22
Q

Athena

A

since analytics can be run directly while data is sitting on S3

Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.

23
Q

Amazon FSx for Windows File Server

A

Amazon FSx for Windows File Server provides fully managed, highly reliable, and scalable file storage that is accessible over the industry-standard Service Message Block (SMB) protocol. It is built on Windows Server, delivering a wide range of administrative features such as user quotas, end-user file restore, and Microsoft Active Directory (AD) integration.

24
Q

Read replicas in Amazon RDS

A

Amazon Relational Database Service (Amazon RDS) makes it easy to set up, operate, and scale a relational database in the cloud. Read replicas allow you to create read-only copies that are synchronized with your master database. You can also place your read replica in a different AWS Region closer to your users for better performance. Read replicas are an example of horizontal scaling of resources.

Also for Improved DR

25
Q

Well architected framework pillars

A

Operational excellence, Security, Reliability, Performance efficiency, Cost optimization

26
Q

AWS cost and usage report

A

One stop shop for accessing most granular data on your AWS costs and usage, can also load your costs and usage information into Athena, redshift, quick sight, or a tool of your choice

27
Q

Minimum number of AZ to set up your application load balancer

A

2 AZs

28
Q

AWS OpWorks

A

Config management services that helps customers configure and operate applications both on premises and in AWS cloud

29
Q

Programmatically interaction

A

Via an API to send and receive messages

SDK and access keys

30
Q

Amazon Neptune

A

Fast reliable fully managed grah database service that makes it easier to build and run ap that work with highly connected data sets