L6b - Cloud Systems Flashcards
Amazon’s elastic block storage (EBS)
What is it?
Is it persistent?
Amazon Elastic Block Store (Amazon EBS) is a storage service for Amazon EC2 instances (VM).
short: EBS is a disk for VM
persistent
Amazon’s elastic file system (EFS)
What is it?
Is it persistent?
It allows you to create and share file systems that can be accessed concurrently by multiple EC2 instances, providing a shared file storage solution for your applications.
persistent
Amazon’s S3 storage
What is it?
Is it persistent?
persistent object storage (object storage means, objects are accessed via HTTP GET/PUT/DELETE)
short: slow but cheap storage
persistent
What is S3 commonly used for?
- backup
- data archiving
- content storage for websites
- as a data lake for analytics
What are two purposes of load balancers?
performance: distribute load across machines
availability: send work only to available machines
What is DMZ?
It acts as a buffer zone to add an extra layer of security by isolating publicly accessible services, such as web servers or email servers, from the internal network to minimize the potential impact of security breaches.
What is the purpose of DMZ? What does it provide?
- protect the rest of the system
- firewall access
- create network groups
- controlled SSH access
- virtual network
What is Azure’s Jump Box?
you access only to one VM (the Jump Box) and then access the rest of the machine over the Jump Box
You can control and monitor access, restrict IPs that can connect, etc.
extra: Bastion even better (no public IP needed to connect)
The tiers in charge of processing requests are built as a pool of VMs. What are those called?
compute tiers
Why do we have them?
For isolation. Tiers are isolated from each other for security.
extra: often, a tier can only be accessed form the tier above
What are different approached to implement data tier?
- using an VM and running a database on it
- using a data service (no need to maintain the system, many additional features such as automatic replication, etc.)
- growing trend is towards managed data services (cloud native databases)
Amazon’s Route 53?
A distributed, reliable DNS
- maps systems inside the cloud to URLs and external addresses
- implements flow and traffic control according to user defined policies (location, latency, round robin, failures status, etc.), includes geo-routing (routing requests to other data centers)
Why do we need DNS resolution for the cloud using an additional service?
Most of the system in the cloud sit on virtual private networks…
but systems needs to be visible from the outside
What are the different storage needs?
- local disk (EBS)
- local file system (EFS)
- large storage (S3)
- archival (S3 Glacier)
What is a CDN?
A CDN manages traffic to web sites so that the content is downloaded from a near-by location (stored or cached there).