ELB & ASG Flashcards

Question 1

Q

What does vertical scalability mean in AWS?

Explain the concept of vertical scalability.

How does vertical scalability work in AWS?

Answer

A

Vertical scalability in AWS refers to increasing the capacity or power of a single instance (such as an EC2 instance) by enhancing its resources, like adding more CPU, RAM, or storage to handle increased demands. It involves upgrading the existing instance’s size to accommodate more workload.

Vertical scalability is like upgrading your own superhero suit when you need more powers. It means making your computer in the cloud stronger by giving it more muscles (like adding more strength or memory) so it can do bigger tasks without needing more friends to help.

Rather than adding more instances, AWS makes instances more powerful

Vertical scalability is like making your computer in the cloud a stronger superhero suit when it needs more powers.

Question 2

Q

What is horizontal scalability?

Explain the concept of horizontal scalability.

How does horizontal scalability work in expanding resources?

Answer

A

Horizontal scalability refers to the capability of adding more instances (like more servers or computers) to a system to handle increased workload or demand. It involves scaling out by adding more similar units, spreading the load across multiple machines rather than increasing the power of a single machine.

Horizontal scalability is like inviting more friends to help build a huge Lego castle. Instead of making one person build faster, everyone brings their own Legos and builds together, making the castle bigger and stronger.

It’s about adding more computers or servers to share the work

Horizontal scalability is like inviting more friends to build a bigger Lego castle together.”

Question 3

Q

What is Load Balancing?

Explain the concept of load balancing.

How does load balancing work in computing?

Load balancer is server that carries out load balancing process

Answer

A

Load balancing is the process of distributing incoming network traffic across multiple servers or resources to ensure efficient utilization, optimal performance, and preventing any single server from getting overloaded. It helps evenly distribute work to handle varying levels of demand.

Load balancing is like sharing candies equally among friends so that no one feels left out. It’s about making sure that all the computers helping out with a task get an equal number of jobs to do, so none of them gets too tired or overwhelmed.

Ensure that all resources involved in a task share the work evenly

Load balancing is like sharing candies equally among friends, so no computer gets overloaded with too much work.

Question 4

Q

What is the purpose of using load balancers?

Why are load balancers used in computing?

How do load balancers benefit systems?

Answer

A

Load balancers are used to evenly distribute incoming network traffic across multiple servers or resources, ensuring no single server gets overwhelmed and optimizing performance. They help in achieving high availability, scalability, and reliability by preventing overloads and providing redundancy.

Load balancers are like traffic managers for computers, making sure that no one computer gets too much work, just like a teacher making sure each student gets a fair share of activities in class.

Ensure none of the EC2s gets too busy & can handle lots of user-requests

Load balancers are like traffic managers for computers, ensuring they all share work equally and the system runs smoothly.

Question 5

Q

What are the different types of load balancers available on AWS?

Explain the variations of load balancers on AWS.

How do the different load balancers on AWS function?

Answer

A

On AWS, there are mainly three types of load balancers:
1. Application Load Balancer (ALB); operates at the application layer.
2. Network Load Balancer (NLB); NLB at the network layer
3. Classic Load Balancer (CLB); CLB is the traditional load balancer handling both layers but with fewer features. Its deprecated

ALB is good for websites.
NLB is faster for more technical stuff.
CLB is a bit older and less fancy but still gets the job done.

AWS has ALB for websites, NLB for technical tasks, and CLB, the older.

ALB Use Cases: Web applications, API services, microservices architectures, content-based routing, modern application deployments using containers.
NLB Use Cases: High-performance scenarios, gaming applications, IoT (Internet of Things) setups, situations requiring static IP addresses, handling non-HTTP(S) protocols.

Choosing between ALBs and NLBs depends on the specific requirements of your application, the type of traffic you’re dealing with, and the level of functionality and features needed for your load balancing setup. Often, a combination of both types of load balancers is used within a system to cater to different traffic types and application needs.

Question 6

Q

What is an Application Load Balancer (ALB)?

Explain the purpose and functionalities of an Application Load Balancer.

How is an Application Load Balancer used in AWS?

Answer

A

An Application Load Balancer (ALB) is a type of load balancer on AWS that operates at the application layer (Layer 7) of the OSI model. It intelligently directs incoming web traffic and routes requests to specific targets (such as EC2 instances or containers) based on content, allowing for more advanced routing and support for features like path-based routing and host-based routing.

ALB on AWS directs web traffic intelligently, sending requests to specific parts of an application for better performance.

Question 7

Q

Application load balancer

Use-case Question: How can an Application Load Balancer be used to direct incoming traffic to different microservices within an application architecture?

use-cases

Answer

A

An ALB can utilize its advanced routing capabilities to examine incoming requests and route them to specific microservices within an application based on URL paths, headers, or hostnames. For instance, if an application has different microservices handling user authentication, profile management, and payments, the ALB can intelligently route requests to these services based on the URL paths or headers, ensuring efficient handling of different functionalities within the application.

ALB are a great fir for micro services & container based applications.

ALB manages web traffic, directing requests based on their content to different parts of an application running on servers

Question 8

Q

What is a Target Group in AWS?

Explain the purpose and functionality of a Target Group in AWS.

How is a Target Group used within the AWS ecosystem?

Answer

A

In AWS, a Target Group is a logical grouping of targets, typically instances (like EC2 instances), containers, or IP addresses, for routing requests from a load balancer. It defines where the load balancer sends traffic by directing requests to registered targets based on configured rules and health checks.

A Target Group in AWS is like a team in a treasure hunt game. The team members (targets) are grouped together, and the Target Group (team) follows specific rules to guide the treasure (incoming requests) to the right team members, making sure the treasure hunt goes smoothly.

TG helps LB send requests to specific targets based on rules.

AWS Target Groups group together targets for load balancers to efficiently direct traffic to specific instances or resources.”

Question 9

Q

What is a Network Load Balancer (NLB)?

Explain the purpose and functionalities of a Network Load Balancer.

How is a Network Load Balancer utilized within the AWS infrastructure?

Answer

A

A Network Load Balancer (NLB) in AWS is a high-performance load balancer that operates at the network layer (Layer 4) of the OSI model.

NLB has one static IP per AZ and supports assigning elastic IPs.

NLBs are used for extreme performance TCP/UDP

NLB efficiently manages traffic at a network level, swiftly sending requests to different targets without much processing overhead.

Lower lantecy compared to APL

AWS Network Load Balancers efficiently direct traffic at a network level, ensuring fast and reliable routing to different servers.

Question 10

Q

What is a Gateway Load Balancer (GWLB)?

Explain the purpose and functionalities of a Gateway Load Balancer.

How is a Gateway Load Balancer used within the AWS environment?

Batman, Ironman

Answer

A

A Gateway Load Balancer (GWLB) in AWS is a highly scalable load balancing service that allows users to deploy, scale, and manage virtual appliances, such as firewalls, intrusion detection systems, and other network appliances, easily. It handles incoming traffic, distributing it across multiple virtual appliances to enhance security and performance.

A Gateway Load Balancer is like a superhero team leader assigning tasks to different superheroes (security appliances) to protect the city. It makes sure each superhero (virtual appliance) gets the right job (traffic) to keep the city (network) safe and running smoothly.

GWLB manages traffic flow to different security appliances to ensure better security and performance of the network.

AWS Gateway Load Balancers efficiently manage traffic among different security appliances to enhance network security and performance.

Question 11

Q

What are Sticky Sessions?

Explain the concept of Sticky Sessions in web applications.

How do Sticky Sessions impact user sessions in web environments?

Answer

A

Sticky Sessions, also known as session affinity, is a mechanism in web applications where a load balancer directs a user’s requests to the same server for the duration of their session. It ensures that subsequent requests from the same user are sent to the server that initially served their first request, maintaining session persistence.

Sticky Sessions are like a waiter at a restaurant who remembers your table number and always brings your food to the same table, making sure you always sit in your favorite spot and don’t have to move around.

Sticky Sessions (session affinity) ensure that your requests go to the same server, making your website experience more consistent.

Question 12

Q

What is Cross-Zone Load Balancing?

Explain the concept of Cross-Zone Load Balancing in AWS.

How does Cross-Zone Load Balancing impact load balancing within AWS?

Answer

A

Cross-Zone Load Balancing in AWS refers to the distribution of traffic evenly across instances in multiple Availability Zones. It ensures that incoming requests are directed across all available instances in different zones, optimizing performance and ensuring better fault tolerance by utilizing resources across zones.

Use-case Explanation: For instance, if an online shopping website utilizes Cross-Zone Load Balancing, it ensures that customer traffic is evenly distributed across servers in different Availability Zones. If one zone experiences high traffic or goes down, the other zones can handle the load, ensuring the website remains accessible and responsive.

Cross-Zone Load Balancing spreads traffic across different zones to make sure no single zone gets too busy, improving performance and reliability.

AWS Cross-Zone Load Balancing evenly spreads traffic across different Availability Zones, enhancing performance and fault tolerance.

Question 13

Q

What are TLS Certificates?

Explain the role and importance of TLS Certificates in web security.

How do TLS Certificates contribute to secure communication on internet?

Answer

A

TLS Certificates, also known as SSL Certificates, are digital certificates that facilitate secure communication between a web browser and a server. They encrypt data transmitted over the internet, verifying the identity of websites and ensuring data integrity and confidentiality.

TLS Certificates are like secret codes that only the right people (websites and browsers) know to keep their messages safe from spies (hackers). It’s like using a special lock and key to keep a treasure box’s content secret while sending it across the internet.

TLS Certificates secure data transferred between websites and browse

TLS Certificates encrypt internet data, keeping it safe and ensuring that websites are trustworthy.

Question 14

Q

What is Server Name Indication (SNI)?

Explain the purpose and functionality of Server Name Indication.

How does SNI enhance web server functionality?

Under the TSL protocol

Answer

A

Server Name Indication (SNI) is an extension of the TLS protocol that allows a server to host multiple SSL certificates for different domains on the same IP address. It enables the server to identify which certificate to present to the client during the SSL/TLS handshake, facilitating secure communication with multiple websites on a single server.

Analogy: Server Name Indication is like a magic tag attached to different doors in a house (server), telling guests (web browsers) which room (website) they want to visit. It helps the server show the right certificate to the browser, making sure everyone goes to the correct place securely.

Only works for ALB, NLB or Cloudfront and doesn’t work with CLB.

SNI allows a server to handle multiple secure websites on the same IP.

Server Name Indication helps a server manage multiple secure websites on one IP address by showing the right certificate to web browsers.

Question 15

Q

What is Connection Draining or Deregistration Delay?

Explain the concept and purpose of Connection Draining or Deregistration Delay.

How do Connection Draining/Deregistration Delay impact load balancers?

Answer

A

Connection Draining or Deregistration Delay is a feature in load balancers that allows existing connections(instances) to complete before removing an instance from the pool of available targets.

It ensures ongoing requests are completed before taking an instance out of service, preventing disruption and loss of data during the transition.

Question 16

Q

What is an Auto Scaling Group (ASG)?

Explain the purpose and function of an Auto Scaling Group.

How does an Auto Scaling Group operate within the AWS environment?

Answer

A

An Auto Scaling Group (ASG) is a feature in AWS that automatically adjusts the number of instances in response to varying demand. It helps maintain application availability, distributing traffic evenly and efficiently across multiple instances based on conditions set by the user, such as CPU utilization or network traffic.

Analogy: An Auto Scaling Group is like a magical team of robots that can make more copies of themselves when there’s a lot of work and go back to sleep when the job is done. They make sure there are always enough robots (instances) working without wasting energy.

ASG ensure there are enough instances to handle varying levels of demand

Auto Scaling Groups in AWS automatically manage the number of instances to match demand, ensuring efficiency and availability.

Question 17

Q

Differences between Auto Scaling Group (ASG) and Target Group (TG)?

Explain the distinctions between ASG and TG in AWS.

How do ASG and TG serve different purposes within the AWS environment?

Answer

A

An Auto Scaling Group (ASG) manages the number of instances to match varying demand, automatically adjusting the number of servers based on pre-set conditions. On the other hand, a Target Group (TG) is used by load balancers to route incoming traffic to multiple instances within a load balancer.
- ASG handles instance management
- TG manages how traffic is distributed among instances.

Analog:An Auto Scaling Group is like the leader of a robot team, making sure there are enough robots working or resting based on the job’s difficulty. A Target Group is like the traffic controller telling cars (requests) which roads (instances) to take to reach their destination (servers) safely.

ASG adjusts instance numbers,
TG directs traffic to instances managed by Auto Scaling Groups.

Question 18

Q

What are ASG Dynamic Scaling Policies?

Explain the concept and purpose of Dynamic Scaling Policies in Auto Scaling Groups (ASG).

How do Dynamic Scaling Policies impact the behavior of ASGs?

Answer

A

ASG Dynamic Scaling Policies are rules or configurations set within an Auto Scaling Group (ASG) to automatically adjust the number of instances based on changing demand or conditions.
These policies define criteria like -
* CPU utilization,
* network traffic,
* custom metrics, triggering the ASG to add or remove instances accordingly to maintain performance and meet defined thresholds.

Analogy:ASG Dynamic Scaling Policies are like a game with a volume control that gets louder when more players join and quieter when they leave. It’s a way for the game (ASG) to adjust its volume (number of instances) based on how many players (demand) are playing.

ASG Dynamic Scaling Policies automatically adjust the number of instances based on demand to maintain performance and meet specified thresholds monitored by cloudwatch listners.

Question 19

Q

What are some types of ASG Dynamic Scaling Policies?

Explain various types of Dynamic Scaling Policies used in Auto Scaling Groups (ASG).

How do these different policies impact the behavior of Auto Scaling Groups in response to changing demand?

Answer

A

Target Tracking Scaling: Scales the number of instances to maintain a specific target metric, like CPU utilization or request count per instance.
Example; Maintain ASG CPU at 40%.
Step Scaling: Adjusts the number of instances based on a set of specified thresholds and corresponding scaling adjustments.
example; if CPU>70% -> Add 2 units
if CPU < 20 % then remove 1 unit.
Simple Scaling: Increases or decreases the number of instances based on a single specified scaling adjustment. Example; increase the min capacity to x on weekends.
Predictive Scaling: Forcasting load based on historical usage pattern.

CloudWatch alarms are used for monitoring target metric in all scenarios

ASG offers various Dynamic Scaling Policies like Target Tracking, Step Scaling, and Simple Scaling to manage the number of instances based on different conditions or thresholds.

Question 20

Q

What is a Scaling Metric?

Define the concept and role of a Scaling Metric in Auto Scaling.

How does a Scaling Metric influence the behavior of ASG?

Answer

A

A Scaling Metric is a parameter or measure used by Auto Scaling Groups (ASGs) to make decisions about scaling instances up or down.
It can be various factors like
* CPU utilization
* network traffic
* memory usage
* custom application-specific metrics.

ASGs use these metrics to determine when to add or remove instances to maintain optimal performance and meet defined thresholds.

CloudWatch alarms are used for monitoring target metric in all scenarios

Scaling Metrics are measures used by ASGs to decide when to add or remove instances based on factors like CPU usage or traffic.

Question 21

Q

What are ASG Scaling Cooldowns?

Explain the concept and purpose of Scaling Cooldowns in Auto Scaling Groups (ASG).

How do Scaling Cooldowns impact behavior of ASG in response to changes?

Answer

A

ASG Scaling Cooldowns are time periods during which Auto Scaling Groups (ASGs) avoid launching or terminating additional instances after a scaling activity. They prevent rapid fluctuations in instance count by imposing a wait time between scaling actions, ensuring stability and avoiding unnecessary instance changes.

ASG Scaling Cooldowns are like breaks between levels in a game. It gives the team (ASG) time to rest before playing the next level (scaling activity), so they’re not tired or confused and can perform better.

default cooldown period 300 seconds

ASG Scaling Cooldowns prevent rapid instance changes by imposing wait times between scaling actions, ensuring stability in the Auto Scaling process.

Question 22

Q

Application loaad Balancer

You are using an Application Load Balancer to distribute traffic to your website hosted on EC2 instances. It turns out that your website only sees traffic coming from private IPv4 addresses which are in fact your Application Load Balancer’s IP addresses. What should you do to get the IP address of clients connected to your website?

Answer

A

Modify your website’s backend to get the client IP address from the X-forwarded-for header

When using an Application Load Balancer to distribute traffic to your EC2 instances, the IP address you’ll receive requests from will be the ALB’s private IP addresses. To get the client’s IP address, ALB adds an additional header called “X-Forwarded-For” contains the client’s IP address.

Question 23

Q

Sticky Session

You want to create a custom application-based cookie in your Application Load Balancer. Which of the following you can use as a cookie name?
1. AWSALBAPP
2. APPUSERC
3. AWSALBTG
4. AWSALB

Answer

A

2) APPUSERC

The following cookie names are reserved by the ELB (AWSALB, AWSALBAPP, AWSALBTG).

Question 24

Q

ALB vs NLB:

Significance of each in terms of use-case

Answer

A

The major difference lies in the layer at which they operate and the types of traffic they handle.

Layer of Operation:
- ALB (Application Load Balancer): Operates at Layer 7 of the OSI model, also known as the application layer. It understands HTTP and HTTPS traffic and can make routing decisions based on the content of the packets, allowing for more advanced routing capabilities.
- NLB (Network Load Balancer): Operates at Layer 4 of the OSI model, the transport layer. It handles TCP and UDP traffic and focuses on efficiently distributing traffic across servers at a lower level than ALBs. NLBs are more suited for raw network traffic without inspecting the content.

Handling Traffic:
- ALB: Designed for applications that require content-based routing, such as routing based on URL paths, headers, or cookies. It’s well-suited for modern web applications, APIs, and microservices architectures.
- NLB: Primarily used for handling TCP and UDP traffic without the need for inspecting the content. NLBs excel in scenarios requiring high-performance, low-latency, and handling non-HTTP(S) protocols like gaming, IoT, or other specialized applications.

Functionality and Features:
- ALB: Offers advanced features like content-based routing, support for path-based and host-based routing, integration with container services, and SSL termination, making it suitable for modern web applications and microservices.
- NLB: Provides high throughput, low-latency performance, static IP addresses, and efficient handling of TCP and UDP traffic, making it ideal for scenarios where performance and scalability are critical.

Static IP Addresses:
- ALB: Does not provide static IP addresses. Clients connect to ALBs through DNS names.
- NLB: Can be associated with static IP addresses, allowing clients to connect to the applications using fixed IP addresses.

Choosing Between ALB and NLB;
- Select an ALB when you need content-based routing, handling HTTP(S) traffic, advanced routing features, and integration with modern application architectures.
- Choose an NLB for high-performance, low-latency requirements, handling TCP/UDP traffic without inspecting content, and scenarios requiring static IP addresses or non-HTTP(S) protocols.

Both use complex arch to cater diff type of traffic & app requirements

ALB vs NLB:

ALB:

Layer 7 (Application layer)
Handles HTTP/HTTPS traffic
Content-based routing
SSL termination
Ideal for modern apps, microservices
NLB:

Layer 4 (Transport layer)
Manages TCP/UDP traffic
High performance, low latency
Static IP support
Great for gaming, IoT, raw network traffic