Replication Flashcards

Question

3 Main Issues with Replication Lag

Answer 1

1. If the user views data shortly after making a write the new data may not have reached the replica. (Reading your own writes) 2. It's possible for a user to see things moving backward in time due to reading from different replicas (Monotonic Reads). 3. No global ordering of writes so some so users may see some parts of the DB in an older state and some in newer state. (Consistent-Prefix Reads)

Answer 2

It provies read-after-write consistency. A guarantee that if user reloads a page they will always see any updates they committed themselves.

Answer 3

- Only read information that could be modified by the user from the leader (Ex: Users Profile information reads from leader and all other profiles read from replicas) - Client can remember timestamp from most recent write and system can ensure the replica serving any reads for that user refelcts updates at least until that timestamp - Might also need to account for multiple DCs and routing necessary reads to the leader which requires additional complexity (Also handle reqs from different device types)

Answer 4

A lesser guarantee than strong consistency but a stronger guarantee than eventual consistency that when a user reads data they may see an old value, but if they make several reads in a sequence they won't see time go backward (see reads that were even older than the previous read).

Answer 5

By making sure that each user always makes their reads from the same replica. Ex: The replica can be chosen based on a hash of the user ID rather than randomly.

Answer 6

A guarantee that says if a sequence of writes happens in a certain order, then anyone reading those writes will see them appear in the same order. (Particularly important in Partitioned/Sharded DBs).

Answer 7

Master-Master or Active-Active replication. This is because each leader simultaneously acts as a follower to the other leaders.

Answer 8

1. Reading your own writes / read-after-write consistency 2. Monotonic Reads 3. Consistent Prefix Reads

Answer 9

When you are only using a single datacenter, because the benefits rarely outweigh the added complexity.

Answer 10

1. Multi-datacenter operation 2. Clients with offline operation 3. Collaborative Editing

Answer 11

In single-leader configuration every write must go over the inernet to the DC with the leader. In multi-leader every write can be processed in the local DC and is replicated asynchronously to the other datacenters which hides network delay from the user (the perceived performance may be better).

Answer 12

In single-leader configuration if the DC with the leader fails failover promotes new leader in another DC, but in multi-leader each DC can continue operating independently of hte others and replication catches up once the DC is back online.

Answer 13

Single-leader configuration is sensitive to problems in inter-datacenter link because writes are made synchronously over this link. Multi-leader with asynchronous replication can usually tolerate network problems better.

Answer 14

Deciding how to handle write conflicts between two leaders.

Answer 15

In single-leader teh second writer will either block and wait or abord the second write transaction forcing the user to retry the write. In multi-leader both writes are successful and conlict is detected asynchronously at some point later.

Answer 16

No, by doing this you lose the main advantage of multi-leader replication: allowing each replica to accept writes independently. For synchronous conflict detection you might as well use single-leader replication.

Answer 17

Conflict Avoidance. Since most multi-leader impementations are bad at handling write conflits it's best to avoid them. Ex: Ensure that requests from a particular user are always routed to hte same DC and use the leader in that DC for reading and writing. From the user's POV the configuration is essentially single-leader.

Answer 18

1. On Write - when the DB detects a conflicts in log of repicated changes it calls a conflict handle to pick the proper value ( Ex: use a UUID, timestamp, hash value and pick the highest one) 2. On Read - When a conclit is detected all the conflicting writes are stored and next time the data is read these values are returned to the application and the user can be prompted to select the appropriate value.

Answer 19

Circular, Star, and All-to-All

Answer 20

If one node fails it can interrupt the flow of replication messages between other nodes causing them to be unable to communicate until the node is fixed. (Tere is a Single Point of Failure)

Answer 21

There can be a problem of **causality** where some nodes receive updates in a different order than others. This could be due to a difference in speed between network links.

Answer 22

It doesn't because failover does not exist. As long as the minimum amount of replicas respond with an OK the client's write is successful. When the failed node(s) is back online it will receive multiple responses for a read and can use versioning numbers to know which is the most updated write to avoid any stale data.

Answer 23

Two mechanisms are often used: 1. Read Repair 2. Anti-Entropy Process

Answer 24

When a client makes a read from several nodes in parallel, it can detect any stale responses. This approach works well for values that are frequenly read.

Answer 25

Some datastores have a background process that constantly looks for differences in the data between replicas and copies any missing data from oen replica to another.

Answer 26

If there are *n* replicas then every write must be confirmed by *w* nodes and we must query *r* nodes for every read. As long as r + w \> n wer're good.

Answer 27

Typically n is an odd number w = r = (n+1) / 2. A workload with few writes an many reads would benefit from setting w = n and r = 1. (This could be a disadvantage though since just one failed node causes all writes to fail).

Answer 28

- You also could have issues with a sloppy quorum - Two writes occur concurrently as it is not clear which happened first. - Node carrying a new value fails and its data is restored from replica carrying an old value - If a write happens concurrently with a read

Answer 29

In a large cluster it's likely a client can connect to *some* database nodes during the interruption but not the nodes to assemble a quorum for a particular value. Designers choose to accept writes anyway and write them to some nodes that are reachable but aren't among the *n* nodes on which the value usually lives. Once network is fixed any writes that one node temporarily accepted are sent to the appropriate "home" nodes (Hinted Handoff).

Answer 30

Yes, that are particularly useful for increasing write availability: as long as any w nodes are available the DB can accept writes. This means that you can't be guaranteed to ever read current data. More of an assurance of durability than an actual quorum.

Answer 31

Number of replicas *n* includes nodes in all DCs. Each write from a client is sent to all replicas regardless of DC but client usually only waits for the ACK from a quorum of nodes within its local DC so it's unaffected by delays and interruptions on the cross-datacenter link.

Answer 32

**Last Write Wins** is the approach of making each replica only store the most "recent" value as long as there is a way to dtermine which write is more "recent" each write will be eventually copied to every replica and all replicas will eventually converge to the same value. (Cassandra) **Merging Concurrently Written Values** where you take the union (shopping card) or just include extra code in the application to handle this.

Answer 33

Using **Version Vectors** and the properties of causality (An operation A happens before another operation B if B knows about A or depends on A or builds upon A in some way).

Answer 34

It's a data structure used for determining the partial ordering of events in a distributed system and detecting causality violations. We use a vesion number per replica as well as per key. Each replica increments its own version number when processing a write and also keeps track of the version numbers it has seen from each of the other replicas. This information indicates which values to overwrite and which values to keep as siblings.

Answer 35

Initially all clocks are zero. Each time a process experiences an internal event, it increments its own logical clock in the vector by one. Each time a process sends a message, it increments its own logical clock in the vector by one (as in the bullet above, but not twice for the same event) and then sends a copy of its own vector. Each time a process receives a message, it increments its own logical clock in the vector by one and updates each element in its vector by taking the maximum of the value in its own vector clock and the value in the vector in the received message (for every element).

Brainscape's Knowledge GenomeTM

Replication Flashcards

Brainscape's Knowledge Genome^TM