CS7210 Flashcards

Question

In the context of state in a distributed system, what is a postrecording event?

Answer 1

Events that happen after a cut

Answer 2

* Instant recording is impossible * We can't assume a global clock * Networks have delays * They're not deterministic

Answer 3

Iff when it becomes true in a state *S* , it remains true for all states *S'* reachable from *S*

Answer 4

Deadlock Termination

Answer 5

If when it becomes true in a state *S* , it is possibly (but not necessarily) true for a later state reachable from *S*

Answer 6

Agreement among distributed processes about something, e.g. the outcome of a transaction

Answer 7

All non-faulty processes eventually decide on a value, all processes decide on a single value, and the value that's decided on must have been proposed by some process

Answer 8

Run with 1 faulty processor and all messages eventually delivered (matches system model)

Answer 9

An admissible run where some non faulty processors reach a decision

Answer 10

When all admissible runs are also deciding runs

Answer 11

When only one decision is possible

Answer 12

When more than one decision is possible

Answer 13

The Fischer/Lynch/Patterson theorem says that in a system with one fault, no consensus protocol can be totally correct.

Answer 14

They change some of the assumptions and system properties

Answer 15

To make state available at more than one node

Answer 16

In active, both replicas can handle requests. In stand-by, one replica takes requests and others are kept consistent in preparation for a failover.

Answer 17

State replication, and replicated state machine

Answer 18

State replication is where one replica processes a request, and then copies the new resulting state to other replicas. Replicated state machine is where a copy of each operation is sent to and executed at each replica, to produce the same state update.

Answer 19

Pro: no need to re-execute multiple times Con: state may be large or hard to identify where all updates are

Answer 20

Pro: no need to send large state, operation logs may be smaller Con: must re-execute, and it requires that the operations be deterministic

Answer 21

The head node receives a write request, and forwards it down a chain of replicas. This prevents the head replica from having to interact with every other replica.

Answer 22

The tail replica (last one in the chain), because it ensures that all of the replicas have received the updates.

Answer 23

Pros: the leader write node is scalable, it can handle a lot of writes, and it makes strong consistency possible Cons: many workloads are read heavy, and inefficient in that middle nodes may be underutilized

Answer 24

It keeps both the old and new versions of data at each replica while the update propagates through the chain. If both versions are present, it checks with the tail to see if the new version has finished propagating. If it hasn't, it returns the old version. This makes it possible to send reads to the replicas in the middle of the chain.

Answer 25

propogates

Answer 26

A fault that appears and then disappears

Answer 27

A fault that appears on and off

Answer 28

A fault that stays once it occurs

Answer 29

One or more components of the distributed system stop working/responding, like a crash

Answer 30

The system components behave outside of some timing expectations, which can interfere with things like timeouts

Answer 31

When some action is missing

Answer 32

Detection, Removal, and Recovery

Answer 33

Checkpoints and logging

Answer 34

Where the node periodically saves its state, so that it can reload from there

Answer 35

Where the node logs information about the operations performed, so that it can undo or redo changes. The log is kept on persistent storage, but the changes are smaller so less I/O time is needed. The downside is that recovery takes longer than it does in checkpointing.

Answer 36

Checkpoint every so often, and keep logs since the last checkpoint

Answer 37

Uncoordinated, coordinated, and communication-induced

Answer 38

When processes take checkpoints independently

Answer 39

You might have to go very far back before finding a combination of processes' checkpoints that make up a consistent cut. aka domino effect Processes may capture checkpoints that can't be part of a consistent cut May need to keep more than the most recent checkpoints Creates the need to identify obsolete checkpoints

Answer 40

When the processes coordinate their checkpoints so that they get a consistent cut.

Answer 41

Blocking: where an initiator uses 2-phase-commit or some other consensus algorithm to stop underlying application work and take a snapshot Non blocking: Using the global snapshot algorithm, piggybacking the info instead of using a marker to eliminate the need for FIFO networking

Answer 42

Spending compute vs I/O

Answer 43

Logging everything to persistent storage before allowing an event to propagate and commit

Answer 44

Assuming that log will be persisted before a failure occurs, but making it possible to remove effects if abort needed

Answer 45

Ensuring causally related events are deterministically recorded

CS7210 Flashcards

(72 cards)