Scalability and High Availability Flashcards

Question 1

Q

What is scalability?

Answer

A

The ability of an app to handle and withstand the increased load without sacrificing the latency.
For instance, if your app takes x seconds to respond to a user request. It should take the same x seconds to respond to each of the million concurrent user requests on your app.
The backend infrastructure of the app should not crumble under a load of a million concurrent requests.

Question 2

Q

What is the latency?

Answer

A

The time taken to process the request & respond

No matter how much the traffic load on a system builds up, the latency should not go up

Question 3

Q

Two Types of latency

Answer

A

Network Latency(The time taken to send the packet from point A to point B. )- To cut down N/W latency, Businesses use CDN(Content Delivery Network) and try to deploy their servers across the globe as close to the end-user.
Application Latency: The time taken to process the user request

Question 4

Q

How to reduce application latency?

Answer

A

Run Stress and Load test on application and scan for bottlenecks

Question 5

Q

What is vertical Scaling?

Answer

A

Adding more power to the existing resources.
The first step towards scaling up when the traffic increases.
Doesn’t require code refactoring or any complex configurations at a code level.
There is a limit for vertical scaling.

Question 6

Q

What is horizontal Scaling?

Answer

A

Adding more hardware to the existing pool of hardware resource pool.
No limit to horizontally scaling

Question 7

Q

What is cloud elasticity?

Answer

A

If the site has a heavy traffic influx more server nodes get added & when it doesn’t the dynamically added nodes are removed.

Question 8

Q

What are the points to consider when running code in a distributed environment?

Answer

A

The code needs to be stateless. No Static instance in the class.

Question 9

Q

Major bottlenecks for scaling an app

Answer

A

Database: Apps horizontally scaled out but all communicating with one single DB. This is a bottleneck scenario. Make use of Database Partitioning, Sharding, or multiple database servers to make module efficient.
2.Application Architecture: A common architectural mistake is not using asynchronous processes & modules where ever required rather all the processes are scheduled sequentially.
Not using Caching in application wisely
Inefficient configuration and setup of LB
Adding business logic to DB
Not picking up the right DB
Need transactions & Strong Consistency- Relational DB
HA and Less Consistency- NoSQL
Bad Coding Standards

Question 10

Q

Steps to Improve the Scalability of the app

Answer

A

Profiling(Memory Profiling, Code Profiling)
Caching(Cache wisely. Cache everywhere. Cache all static content. Hit the DB when it is really required. Try to serve all read requests from the cache. Use a write-through Cache( Write in both DB and Cache while updating/inserting)
CDN (Using a CDN further reduces the latency of the application due to the proximity of the data from the requesting user)
Data Compression (Use apt algorithms to compress the data. Store the data in compressed format. Compressed data consumes less bandwidth and download speed at the client becomes faster)
Avoid unnecessary client-server requests.

Question 11

Q

Parameters to be taken into account while scalability testing:

Answer

A

CPU Usage
Network Bandwidth Consumption
Throughput
No of request processed within a stipulated time
latency
Memory usage of the program
End-user experience when the system is under heavy load

Question 12

Q

What is High Availability?

Answer

A

The ability of a system to stay online despite infrastructure failures in real-time.
To make a system HA, they are made fault-tolerant using redundancy.

Question 13

Q

Reasons for System failures

Answer

A

Software Crashes
Hardware Failures( Overloaded CPU,Overloaded CPU, RAM, hard disk failures, nodes going down. Network outages.)
Human Errors: Flawed configurations & stuff.
Planned Downtime

Question 14

Q

What is fault Tolerant?

Answer

A

The ability of a system to stay up despite taking hits.
An FT System is equipped to handle faults.
In case of these internal failures, the system could work at a reduced level but it will not go down entirely.
Social Media application.
In the case of backend node failures, a few services of the app such as image upload, post likes, etc. may stop working. But the application as a whole will still be up.
This approach is called Fail Soft.
Microservice architecture is the best example of this.

Question 15

Q

What is redundancy?

Answer

A

Duplicating the components or instances & keeping them on standby to take over in case of active instance goes down.
This approach is called Active-Passive HA Mode.

Question 16

Q

What is replication?

Answer

Study These Flashcards

A

Number of similar nodes running the workload together.
No Standby or Passive Instances.
When a single or a few nodes go down, the remaining nodes bear the load of the service

Question 17

Q

What is HA Cluster?

Answer

Study These Flashcards

A

A HA cluster or a failover cluster contains a set of nodes running in conjunction with each other to ensure HA (e.g Kafka Cluster)
The nodes are connected by a private network called Heart Beat Network that continuously monitors the health and status of each node in the cluster.
A single state across all nodes in a cluster is maintained with the help of a shared distributed memory and distributed coordination service like zookeeper

Question 18

Q

Steps to ensure HA Clustering?

Answer

Study These Flashcards

A

Disk Mirroring
Redundant network connections
redundant electrical power
Multiple HA clusters run together in one geographical zone ensuring minimum downtime & continual service.( Kafka Mirroring)

Scalability and High Availability Flashcards

(18 cards)