L5 - Autoscaling 2/2 Flashcards

Question

Server-side load balancing

Answer 1

LB receives requests at a given port and distributes them

Answer 2

Session persistence ensures that a client will remain connected to the same server throughout a session or period of time. Because load balancing may, by default, send users to unique servers each time they connect, this can mean that complicated or repeated requests are slowed down. stickiness/Session persistence = results in a “sticky session” between a user and a particular server. In this process, a load balancer uses logic to find an affinity between a specific network server and a client for the length of an entire session, defined by the amount of time a unique IP address stays on the site.

Answer 3

A load balancer creates sticky sessions by either tracking a user’s IP details or using a cookie to assign that user an identifying attribute. This allows the load balancer to use the tracking ID to route all of that user’s requests to a specific server throughout the session.

Answer 4

class-aware content-aware client-aware

Answer 5

based on classification of requests into the classes: sensitive, best-effort, undesired eg. based on port numbers (e.g. port 1 is sensitive)

Answer 6

based on request content e.g. URL e.g. direcitng similar requests to the same server to exploit access to same information

Answer 7

based on packet source can improve performance as before

Answer 8

- Round Robin (RR) and Weighted Round Robin - Least connection and weighted least connection - resource based - weighted response time

Answer 9

RR is very simple and just allocates people to servers sequentially. Not good if some people use servers for a long time. → use smart load balancing. Processors circularly assign each process without defining any priority. This results in a faster response in case of similar workload distribution among the processes. All the processes have different loading time. Therefore, some nodes might be heavily loaded, while the others may remain under-utilized weight represents capability of server in weighted Round Robin

Answer 10

distributes to server with the least number of active connections Checks which servers have the fewest connections open at the time and sends traffic to those servers. This assumes all connections require roughly equal processing power.

Answer 11

CPU load of the servers is taken into account Distributes load based on what resources each server has available at the time. Specialized software (called an "agent") running on each server measures that server's available CPU and memory, and the load balancer queries the agent before distributing traffic to that server.

Answer 12

the response time for a health check is used to compute the weights

Answer 13

- Distributes incoming traffic across the instances in the Auto Scaling Group - Can use load balancer metrics (request counts per target) for auto scaling - Can use it for health checks (elastic load balancer sends health check messages to instance to find out if they are active or not)

Answer 14

Distributes requests evenly across availability zones or evenly across all registered instances in the target group.

Answer 15

Routes http requests based on contents to specific target groups

Answer 16

Forwards TCP packets for a certain port to a target group.

Answer 17

Forwards ingress traffic to network appliances, like intrusion detection or monitoring Forwards response traffic from network appliances to target after inspection

Answer 18

HTTP is a protocol which fetches resources such as HTML documents. It is used for exchanging data on the Web and is a client-server protocol which means requests are initiated by the recipient usually the Web browser. HTTP contains specific instructions on how to read and process this data once it arrives. When you type a URL into your web browser, you are sending an HTTP request to a web server. That server will then respond, again using the formatting of HTTP.

L5 - Autoscaling 2/2 Flashcards

(42 cards)