Load Balancing and Autoscaling Flashcards
Cloud Load Balancing is
a fully distributed, software-defined managed service which gives the ability to distribute load-balanced compute resources in single or multiple regions to meet your high availability
requirements, to put your resources behind a single anycast IP address and to scale your resources up or down with intelligent autoscaling.
different types of load balancers
GCP offers different types of load balancers that can be divided into two categories, global and regional.
The global load balancers are the HTTP, HTTPS, SSL proxy and TCP proxy load balancers.
The regional load balancers are the internal and network load balancers:
internal TCP/UDP, network TCP/UDP, internal HTTP/HTTPS
A managed instance group ?
A managed instance group is a collection of identical VM instances that you control as a single entity using an instance template.
You can easily update all the instances in a group by specifying a new template in a rolling update.
Also when your applications require additional compute resources, managed instance groups can scale automatically to the number of instances in the group.
what they are used for
Managed instance groups can work with load balancing services to distributor network traffic to all of the instances in the group.
Managed instance groups can automatically identify and recreate unhealthy instances in a group to ensure that all instances are running optimally.
How can managed instance group identify unhealthy instance
using a health check
Which are Applicable autoscaling policies
Applicable autoscaling policies include scaling based on
1. CPU utilization,
2. load-balancing capacity
3. monitoring metrics or via queue-based workload like Cloud PubSub
Which metrics you can monitor on your instance group
CPU utilization over the past hour, but you can change the time frame and visualize other metrics like disk and network usage.
You can even set up alerts through several notification channels.
how to define a health chek
You define a protocol, port and health criteria
criteria defines:
- how often to check whether an instance is healthy.That’s the check interval.
- how long to wait for a response, that’s the time-out.
- how many successful attempts are decisive, that’s the healthy threshold.
- how many failed attempts are decisive, that’s the unhealthy threshold.
about HTTPS load balancing
HTTPS load balancing which acts at layer seven of the OSI model.
00:06
This is the application layer which deals with the actual content of each message allowing for routing decisions based on the URL.
GCPs HTTPS load balancing main features
- provides global load balancing for HTTPS requests (applications are available at a single Anycast IP address)
- balances HTTP and HTTPS traffic across multiple regions.
- HTTP requests are load balanced on port 80 or 8080 and HTTPS requests are load balanced on port 443.
- supports both IPv4 and IPv6 clients
- is scalable
- requires no prewarming
- You can configure URL maps that route some URLs to one set of instances and route other URLs to other instances.
What are global load balancers
HTTP, HTTPS, SSL proxy, TCP proxy
These load balancers leverage the Google front ends, which are software-defined, distributed systems that sit in Google’s point of presence and are distributed globally.
What are regional load balancers
The regional load balancers are the internal and network load balancers, and they distribute traffic to instances that are in a single GCP region.
what type of instance group you can create
managed stateless instance group
managed stateful instance group
unmanaged instace group
cross-region load balancing
If there are no healthy instances with available capacity in a given region, the load balancer instead sends the request to the next closest region with available capacity.
HTTPS load balancers based by balancing strategy?
- locations & capacity based lb
- content-paste load balancer.