Domain 2 Flashcards
What is scaling?
Scaling in Amazon Web Services (AWS) refers to the ability to adjust resources allocated for your applications to meet demand efficiently. AWS provides two main types of scaling: vertical scaling and horizontal scaling.
Explain vertical scaling
This involves changing the instance type within the same instance family to a larger or smaller size. For example, moving from a t2.micro to a t2.large instance.
Vertical scaling is limited by the size of the server and typically involves downtime since the instance needs to be stopped and restarted.
Explain horizontal scaling
This involves adding more instances of the same size (scaling out) or removing instances (scaling in) to adjust to demand.
AWS services like Auto Scaling Groups in EC2 enable horizontal scaling by automatically adding or removing instances based on defined policies, thresholds, and metrics.
Horizontal scaling is usually more flexible and can be achieved without downtime.
Explain elasticity
Elasticity in the context of Amazon Web Services (AWS) refers to the ability of a system to dynamically adjust its resource allocation to match the current demand.
Key word ‘Auto’
Explain auto scaling
Auto Scaling in AWS is a feature that automatically adjusts the number of compute resources available to your applications based on demand.
Scales based on predfined rules
Explain EC2 auto scaling
This service automatically adjusts the number of EC2 instances in an Auto Scaling group. It uses scaling policies and rules based on metrics like CPU utilization, request count, or custom CloudWatch metrics to make scaling decisions.
Explain Lambda scaling
AWS Lambda automatically scales the number of concurrent executions in response to incoming requests. There is no need to manage the underlying infrastructure, and it scales horizontally to accommodate varying workloads.
True or False: ECS and EKS support auto scaling
True
Explain RDS auto scaling
RDS supports read replica auto scaling, allowing it to add or remove read replicas based on demand, which helps in scaling read-heavy workloads.
Explain Dynamo DB auto scaling
DynamoDB offers auto scaling for its read and write capacity units, automatically adjusting the throughput based on traffic patterns.
Explain EMR auto scaling
EMR can automatically scale clusters based on demand, adding or removing nodes to optimize the performance and cost of big data processing.
Explain Elasticache auto scaling
ElastiCache can automatically scale clusters by adding or removing nodes in response to changing demands for in-memory data caching.
How does auto scaling work (name 3)
- Scaling plans and policies
- Health checks
- Integration with cloud watch
Explain amazon edge
Amazon Edge refers to a network of distributed servers and services that are part of Amazon Web Services (AWS) and designed to bring content and computing capabilities closer to end users. This proximity helps improve response times, reduce latency, and enhance the overall performance of applications and content delivery.
Services could include
Cloud Front
Global Accelerator
Outposts
Wavelength
Local zones
Lambda Edge
Explain transfer family
Amazon Transfer Family is a fully managed service from AWS that enables you to transfer files securely into and out of Amazon S3 or Amazon EFS using Secure File Transfer Protocol (SFTP), File Transfer Protocol Secure (FTPS), and File Transfer Protocol (FTP). It is designed to simplify and automate the exchange of files between your organization and external parties, making it easier to manage data transfers securely, efficiently, and in compliance with regulatory requirements.