chapter 7 Flashcards

Question

describe a flat group

Answer 1

- all processes are equal - good for fault tolerance since information exchange immediately occurs with all group members - imposes higher overhead as control is completely distributed - hard to implement

Answer 2

- All communication go through a single coordinator - Loss of the coordinator brings the entire group to a halt - not really fault tolerant and scalable - easier to implement

Answer 3

When a group can mask any k concurrent member failures (k is called degree of fault tolerance).

Answer 4

- Assume crash/performance failure semantics ⇒ a total of k + 1 members are needed to survive k member failures. - Assume arbitrary failure semantics, and group output defined by voting ⇒ a total of 2k + 1 members are needed to survive k member failures.

Answer 5

Assume: - Fail-stop semantics - when a process crashes, this can be reliably detected. - Reliable failure detection - a process P can indeed reliably detect that Q crashed - Unreliable communication Basic idea: - A client contacts a Pi requesting it to execute a command - Every Pi maintains a list of proposed commands - A process group P = {P1,...,Pn} - In round r, Pi multicasts its known set of commands C to all other processes

Answer 6

- An asynchronous system - Communication may be unreliable (meaning that messages may be lost, duplicated, or reordered) - Corrupted messages are detectable (and can thus be discarded) - All operations are deterministic ( can't be interrupted ) - Process may exhibit halting failures, but not arbitrary failures, nor do they collude.

Answer 7

1. client - a thread that requests to have an operation performed 2. proposer - a thread that takes a client’s request and attempts to have the requested operation accepted for execution 3. acceptor - a thread that operates in a quorum to vote for the execution of an operation 4. learner - a thread that eventually performs an operation

Answer 8

1. Safety (nothing bad will happen): - Only proposed operations will be learned - At most one operation will be learned (and subsequently executed before a next operation is learned) 2. Liveness (something good will eventually happen): - If sufficient processes remain non faulty, then a proposed operation will eventually be learned

Answer 9

- Each process is equipped with a failure detection module - A process p probes another process q for a reaction: ---- q reacts →q is alive ---- q does not react within t time units → q is suspected to have crashed Note: in a synchronous system: - a suspected crash is a known crash - referred to as a perfect failure detector

Answer 10

- the eventually perfect failure detector 1. Strong completeness : every crashed process is eventually suspected to have crashed by every correct process. 2. Eventual strong accuracy : eventually, no correct process is suspected by any other correct process to have crashed.

Answer 11

- If p did not receive heartbeat from q within time t → p suspects q. - If q later sends a message (received by p): ---- p stops suspecting q ---- p increases timeout value t - Note: if q does crash, p will keep suspecting q.

Answer 12

1: Client cannot locate server 2: Client request is lost 3: Server crashes 4: Server response is lost 5: Client crashes

Answer 13

1: report back to client 2: Just resend message

Answer 14

3: We need to decide on what we expect from the server: A. At-least-once-semantics: The server guarantees it will carry out an operation at least once, no matter what. [ read ] B. At-most-once-semantics: The server guarantees it will carry out an operation at most once. [ write, transfer 10k ]

Answer 15

4: Detecting lost replies can be hard, because it can also be that the server had crashed. You don’t know whether the server has carried out the operation Solution: None, except that you can try to make your operations: - idempotent: repeatable without any harm done if it happened to be carried out before.

Answer 16

Client crashes but The server is doing work and holding resources for nothing

Answer 17

- Orphan is killed (or rolled back) by client when it reboots - Broadcast new epoch number when recovering ⇒ servers kill orphans - Require computations to complete in a T time units. Old ones are simply removed.

chapter 7 Flashcards

(41 cards)