Module 9a - Consistency and Replication (part 1) Flashcards by Alex Jabbour

Why do we need to replicate Data?

Improves dependability. Data loss can be prevented in the event of a replica failing, since there are numerous copies
Increases throughput. Replicas can be read/written to in parallel
Decrease Latency. We can keep a data replica close to the client in different geographical regions

How well did you know this?

Not at all

Perfectly

What makes it difficult to design systems that deal with shared mutable states?

We have to account for both Concurrency and Failures

How well did you know this?

Not at all

Perfectly

In a replicated data store, each data object (ex: a row in a table) is _______ at multiple ______.

replicated

hosts

How well did you know this?

Not at all

Perfectly

A replica of an object may be ______ to a process meaning that it resides on the same host, or it may be _______

local

remote

How well did you know this?

Not at all

Perfectly

Why is replicating read-only data straightforward?

Because we don’t have to worry about keeping the data perfectly synchronized across hosts

How well did you know this?

Not at all

Perfectly

If a data store holds ______ state, then we can never keep _______ of this state perfectly _______

mutable
replicas
sychronized

How well did you know this?

Not at all

Perfectly

Why can’t mutable state/object replicas be perfectly synchronized all the time?

Variations in processing speeds

- Network Delays

How well did you know this?

Not at all

Perfectly

What does a consistency model help us with? and how does it do it?

Make a sense of concurrent reading and updating data objects in a distributed system by describing the extent to which replicas are permitted to disagree on the state of data

How well did you know this?

Not at all

Perfectly

What makes it difficult to select a good consistency model?

Application requirements in general do not map neatly to a specific consistency model

How well did you know this?

Not at all

Perfectly

Under what condition is a data store sequentially consistent?

Whenever the result of any execution is the same as if the read/write operations by all processes on the data store were executed in some sequential order and the operations of each individual process appear in this sequence in the order specified by its program.

i.e: the order of execution is the same for sequential or concurrent

There are no contradictions in the order of operations in an execution.

How well did you know this?

Not at all

Perfectly

How do you prove if an execution is sequentially consistent in practice?

You must brute force every possible outcome of the execution order, and find a path which corresponds to a outcome in which the result is the same for reads after writes

Alternatively, you can construct a graph with all the order dependencies of all operations. If there are no cycles in this graph, then the execution is not sequentially consistent

How well did you know this?

Not at all

Perfectly

Under what condition is a data store causally consistent?

Whenever the “causally precedes” condition is met for all processes which read/write the data object in the execution

Op1 occurs before Op2 in the same process
Op2 reads a value written by Op1

How well did you know this?

Not at all

Perfectly

How do you prove that an execution is causally consistent in practice?

You must show that the total order (Ti) for each process (Pi) has these 3 properties:

Ti contains all the operations executed by Pi as well as the writes of any values read by Pi, and nothing else
Each read in Ti returns the value of the most recent write to the same object in Ti (i.e., order is legal)
If 2 operations occur in Ti and also occur in some other Tj, then they must be consistent in order

How well did you know this?

Not at all

Perfectly

______ consistency, _______, and ______ consistency are different ways to define the correct behaviour of operations on shared objects under concurrent access

Sequential
Linearizability
Causal

How well did you know this?

Not at all

Perfectly

The _______ property assumes that operations have well-defined start and finish events. which are ordered by a _____ _____

Linearizability
global
clock

How well did you know this?

Not at all

Perfectly

Under what condition is a data store / execution linearizable?

Study These Flashcards

Whenever the result of the execution is the same as if the operations by all processes on the data store were executed in some sequential order that extends the “happens before” relation - with respect to a global clock:

In other words, if Op1 finishes before Op2 begins, then Op2 must precede Op1 in the sequential order.

Essentially, the order of execution must be serializable sequentially

extra note: linearizability is with context to a global clock whereas sequential consistency is not

How do you prove that an execution is linearizable in practice?

Study These Flashcards

You must find some total order of execution (T) that satisfy 3 properties:

T must only contain all the operations present in the execution from all processes
Each read in T returns the value of the most recent write to that same object in T (i.e. the order is legal)
Draw linearization points from start to finish within the intervals of time which are legal

To prove that an execution is ________ consistent or ________, it suffices to exhibit a total order on the operations of that execution that satisfies certain properties. For ________ consistency, a total order is defined separately for each process.

Study These Flashcards

sequentially
linearizable
causal

To prove that an execution violates _______ consistency or _______, we draw a special graph and exhibit a cycle. For _______ consistency, a graph is defined for each process.

Study These Flashcards

sequential
linearizability
causal

Define eventual consistency

Study These Flashcards

In a replicated system, if no updates take place for a long time, then all replicas will gradually become consistent.

If replicas are never consistent even after infinite time, then they do not have “eventual consistency”

Eventual consistency is when in the ______ of new writes from ______, all servers will ______ hold the same data

Study These Flashcards

absence
clients
eventually

Give a negative example of eventual consistency

Study These Flashcards

the following steps:

P1 executes Write(a, x)
P2 executes Write(b, x)
P3 executes Read(x), gives a
P4 executes Read(x), gives b
step 3 and step 4 repeat forever infinitely

Session guarantees are used to augment eventual consistency. One property of a session guarantee is monotonic reads. What are monotonic reads?

Study These Flashcards

Whenever a process reads the value of a data item key x, then any successive read operations on x by that process will always return that same value or a more recent value.

i.e. there is no stale data

Session guarantees are used to augment eventual consistency. One property of a session guarantee is “read your own writes”. What is this property of “read your own writes”?

Study These Flashcards

Whenever a process writes data item key x and then reads it, the read operation should return either the value written by the same process earlier, or a more recent value.

______ consistency does not promise anything in the case when updates are applied continuously. Therefore, it is augmented with ______ ______ which ______ the behaviour of operations applied by a single process in a single session

eventual session guarantees restrict

Is Eventual consistency a property of a strong system or a weak system?

Weak system. It does not guarantee that a resource in a system will be consistent across all replicas at a given period of time

Module 9a - Consistency and Replication (part 1) Flashcards

(26 cards)