1.3 Availability vs consistency Flashcards

Question

What is Partition Tolerance in distributed systems?

Answer 1

The system will continue to function when network partitions occur.

Answer 2

Networks are reliable.

Answer 3

Partitions

Answer 4

Consistency and Availability

Answer 5

Consistency/Partition Tolerance

Answer 6

Wait for a response from the partitioned node, which could result in a timeout error.

Answer 7

Choose Consistency over Availability when atomic reads and writes are needed.

Answer 8

Availability/Partition Tolerance

Answer 9

The most recent version of the data, which could be stale.

Answer 10

When business requirements allow flexibility around data synchronization.

Answer 11

When the system needs to continue to function despite external errors.

Answer 12

A software trade-off.

Answer 13

They are a fact of life and occur unexpectedly.

Answer 14

Many advantages, but also adds complexity.

Answer 15

Understanding the trade-offs available in the face of network errors.

Answer 16

Your application could fail before deployment.

Answer 17

Consistency in distributed systems refers to how data synchronization is handled when there are multiple copies of the same data: CAP Theorem Definition: Consistency means every read operation receives either the most recent write or an error It's one of the three key components of the CAP theorem (Consistency, Availability, Partition Tolerance) Systems must balance these properties as you can only guarantee two out of three Implementation Considerations: Multiple data copies require synchronization strategies Different consistency models offer different trade-offs Choice of consistency model impacts system behavior and user experience Must consider network latency and partition scenarios Business requirements often dictate consistency needs

Answer 18

Weak consistency is the most relaxed consistency model where: Core Characteristics: Reads may or may not reflect the most recent write No guarantees about when data will be consistent Best-effort approach to data synchronization Fastest performance among consistency models Lowest consistency guarantees Use Cases: Real-time communication systems (VoIP) Video chat applications Multiplayer games Systems where temporary data inconsistency is acceptable Applications where speed is more important than accuracy Example Scenario: Phone call with lost reception When connection resumes, missed audio is not replayed System prioritizes real-time communication over data consistency Implementation Example: Memcached uses this model Provides high performance Sacrifices data consistency for speed No guarantee of data synchronization timing

Answer 19

Eventual consistency provides a middle-ground approach where: Core Characteristics: Data will become consistent over time Reads will eventually reflect all completed writes Typically achieves consistency within milliseconds Data replication happens asynchronously Better performance than strong consistency Implementation Details: Updates propagate gradually through the system No immediate synchronization requirement Systems can continue operating during network partitions May serve stale data temporarily Conflicts resolved through various mechanisms (vector clocks, etc.) Common Applications: DNS systems Email systems Distributed databases Social media platforms Content delivery networks Advantages: Higher availability Better scalability Lower latency Continues functioning during network partitions Good for systems that don't require immediate consistency

Answer 20

Strong consistency is the most rigid consistency model that: Core Characteristics: All reads reflect the most recent write Data is replicated synchronously Provides immediate consistency across all nodes Highest consistency guarantees Most impactful on performance Implementation Requirements: Synchronous replication Coordination between all nodes Consensus protocols Transaction management Conflict prevention mechanisms Common Applications: File systems Relational databases (RDBMS) Banking systems Financial transactions Systems requiring ACID properties Trade-offs: Highest consistency guarantees Lower availability during partitions Higher latency for operations More complex implementation Resource intensive

Answer 21

Availability patterns focus on ensuring system uptime through: Primary Approaches: Fail-over: Systems switch to backup when primary fails Requires redundant hardware Can be active-passive or active-active Requires heartbeat monitoring Needs failover automation Replication: Data copied across multiple nodes Can be synchronous or asynchronous Supports different consistency models Provides redundancy Enables load distribution Implementation Considerations: Hardware requirements Network configuration Data synchronization Monitoring systems Recovery procedures

Answer 22

Active-passive failover (also known as master-slave failover) is a high availability pattern that: Core Components: Active server handling all traffic Passive server on standby Heartbeat mechanism between servers IP address takeover capability Monitoring system Operational Details: Normal Operation: Active server handles all requests Passive server maintains standby state Regular heartbeat checks between servers Continuous data synchronization System monitoring active Failover Process: Heartbeat interruption detected Passive server activates IP address migration occurs Services resume on passive server System alerts generated Standby Modes: Hot Standby: Passive server running and ready Minimal startup time Higher resource usage Faster failover More expensive Cold Standby: Passive server inactive Longer startup time Lower resource usage Slower failover More cost-effective Disadvantages: Additional hardware costs Complex configuration Potential data loss during failover Resource underutilization Higher maintenance overhead

Answer 23

Active-active failover (also known as master-master failover) is a more complex availability pattern where: Core Characteristics: Multiple active servers Load distribution across servers Simultaneous traffic handling Synchronized data states No standby resources Implementation Requirements: Public-Facing Systems: DNS configuration for multiple IPs Load balancer configuration Health checking mechanisms Traffic distribution rules Failover procedures Internal Systems: Application awareness of multiple servers Connection management Load distribution logic State synchronization Conflict resolution Advantages: Better resource utilization Higher throughput capacity Natural load balancing Improved fault tolerance Easier maintenance procedures Challenges: Complex data synchronization Potential consistency issues More sophisticated monitoring needed Higher implementation complexity Increased operational overhead

Answer 24

Availability measurements and calculations involve: Availability Metrics: Three Nines (99.9%): 8h 45min 57s downtime per year 43m 49.7s downtime per month 10m 4.8s downtime per week 1m 26.4s downtime per day Suitable for non-critical systems Four Nines (99.99%): 52min 35.7s downtime per year 4m 23s downtime per month 1m 5s downtime per week 8.6s downtime per day Required for critical systems Calculation Patterns: Sequential Systems: Availability decreases with each component Formula: A(total) = A(component1) * A(component2) Example: Two 99.9% components = 99.8% total More components reduce overall availability Requires higher component reliability Parallel Systems: Availability increases with redundancy Formula: A(total) = 1 - (1-A(comp1)) * (1-A(comp2)) Example: Two 99.9% components = 99.9999% total Better fault tolerance Higher overall availability Implementation Considerations: Cost vs availability requirements System architecture decisions Component reliability needs Monitoring and alerting thresholds Maintenance windows impact

1.3 Availability vs consistency Flashcards

(49 cards)