Infrastructure Flashcards

Question 1

Q

What is a load balancer, and what are common load balancing algorithms?

Answer

A

A load balancer distributes incoming traffic across multiple servers. Common algorithms:

Round Robin: Requests are distributed sequentially.
Least Connections: Routes to the server with the fewest active connections.
IP Hash: Routes based on client IP.

Question 2

Q

What is the CAP theorem, and what are its three components?

Answer

A

CAP theorem states that a distributed system can achieve only two out of three guarantees:

Consistency: Every read gets the latest write.
Availability: Every request gets a response (even if outdated).
Partition Tolerance: System works despite network partitions.

Question 3

Q

How do horizontal and vertical scaling differ in system design?

Answer

A

Horizontal Scaling: Adding more servers to the system.
Vertical Scaling: Increasing resources (CPU, RAM) of existing servers.

Question 4

Q

What is database sharding, and when is it used?

Answer

A

Sharding splits a database into smaller, independent pieces (shards) to improve scalability and performance.

Question 5

Q

Why is caching used, and what are common caching strategies?

Answer

A

Caching stores frequently accessed data in a faster storage layer. Strategies:

Write-Through: Updates cache and database simultaneously.
Write-Back: Updates cache first, database later.
Least Recently Used (LRU): Removes least-accessed items.

Question 6

Q

How does a CDN improve system performance?

Answer

A

A CDN caches static assets (e.g., images, scripts) on edge servers closer to users, reducing latency, bandwidth, and server load.

Question 7

Q

What are the key challenges of distributed systems?

Answer

A

Challenges include:

Data consistency: Keeping data synchronized across nodes.
Fault tolerance: Handling node failures.
Latency: Minimizing delays in communication.
Complexity: Managing multiple systems and the relevant infrastructure.

Question 8

Q

What is microservices architecture, and what are its benefits?

Answer

A

Microservices split applications into independent services, each responsible for a specific domain. Benefits:

Scalability: Scale individual services independently.
Resilience: Faults in one service don’t affect others.

Question 9

Q

What is an API gateway, and why is it used?

Answer

A

An API gateway acts as a single entry point for client requests, handling authentication, routing, and rate limiting. Example: AWS API Gateway.

Question 10

Q

What is rate limiting, and how can it be implemented?

Answer

A

Rate limiting controls the number of requests a client can make in a given time. Implementation:

Token Bucket Algorithm: Tokens are added to a bucket at a fixed rate. Each request consumes a token. If the bucket is empty, the request is rejected or delayed. Allows bursts up to the bucket’s capacity.
Leaky Bucket Algorithm: Requests enter a queue (bucket) and are processed at a fixed rate. Excess requests overflow and are dropped. Ensures a steady outflow rate, smoothing traffic spikes.

Question 11

Q

What is data partitioning, and how is it different from sharding?

Answer

A

Partitioning divides data logically within a system, while sharding is a specific implementation of partitioning across databases.

Question 12

Q

What is eventual consistency in distributed systems?

Answer

A

Eventual consistency ensures that all replicas in a distributed system will converge to the same value over time, often used in systems prioritizing availability (e.g., DynamoDB).

Question 13

Q

What is leader election in distributed systems, and why is it needed?

Answer

A

Leader election designates one node as the leader to coordinate tasks. It ensures consistency in distributed systems.

Question 14

Q

What is a proxy server, and what are its types?

Answer

A

A proxy server acts as an intermediary between a client and server. Types:

Forward Proxy: Represents clients.
Reverse Proxy: Represents servers (e.g., Nginx).

Question 15

Q

What is a message queue, and why is it used?

Answer

A

A message queue allows asynchronous communication between services, decoupling producers and consumers. Examples: Kafka, RabbitMQ.

Question 16

Q

What is replication in databases, and what are its types?

Answer

A

Replication duplicates data across servers for availability and fault tolerance. Types:

Master-Slave: One primary server, multiple replicas.
Multi-Master: All nodes can read/write.

Question 17

Q

What is a heartbeat (health check) mechanism in distributed systems?

Answer

A

Heartbeats are periodic signals sent between nodes to check their health and detect failures.

Question 18

Q

What is a circuit breaker pattern, and why is it used?

Answer

A

A circuit breaker detects failures in a service and stops requests temporarily to prevent cascading failures and overloading dependent systems.

Question 19

Q

What are common data consistency models in distributed systems?

Answer

A

Strong Consistency: Every read gets the latest write.
Eventual Consistency: Reads might be stale but converge over time.
Causal Consistency: Ensures related operations are seen in order.

Question 20

Q

What is idempotency, and why is it important in system design?

Answer

A

Idempotency ensures that performing the same operation multiple times results in the same outcome, critical for handling retries in distributed systems (e.g., re-sending a payment request).