Distributed Systems Flashcards

Question 1

Q

What is a distributed system?

Answer

A

Software in which components are located on networked computers and can coordinate their actions by sending messages.

Question 2

Q

What is the difference between a parallel system and a distributed system.

Answer

A

Parallel systems still share memory, while distributed systems don’t share any components.

Question 3

Q

Pros of a DS?

Answer

A

Scalability
Reduced latency
Fault tolerance
Mobility

Question 4

Q

Characteristics of DS?

Answer

A

Each entity has its own memory, distributed state needs to be synchronized
Entities communicate using message passing
Each entity maintains parts of the complete picture
Fault tolerant

Question 5

Q

Challenges with a DS?

Answer

A

Partial failure
Unreliable networks
Unreliable time
No single source of truth

Question 6

Q

What is a synchronous system?

Answer

A

The process execution speeds or message delivery times are bounded. This means that timed failure detection, time-based coordination, and worst case performance can exist.

Question 7

Q

What is a asynchronous system?

Answer

A

There are no assumptions about process execution speeds or message delivery times.

Question 8

Q

Why is waiting for a reponse in an asynchronous system ambiguous?

Answer

A

Because you cannot tell if
The request was lost
The remote node is down
The response was lost

The usual remedy is to set timeouts and retry until success

Question 9

Q

What are the two ways to keep time?

Answer

A

Real time clocks (RTCs), which are kept in sync with the NTP protocol with centralized servers.
Monotonic clocks which only move forward.

Question 10

Q

Why are monotonic clocks useful and who maintains them?

Answer

A

They are maintained by the OS, and are helpful for maintaining order within a node.

Question 11

Q

Why are RTCs useful.

Answer

A

THey can synchronize time across nodes with an accuracy of milliseconds, whereas modern CPUs can do millions of operations / ms.

Question 12

Q

External ordering schemes?

Answer

A

Total order: Message rate is globally bounded, synchronized RTCs guarantee order
Causal order: Rely on the happens-before relationship

Question 13

Q

When do we call events concurrent?

Answer

A

When they don’t have a happens-before relationship.

Question 14

Q

What are lamport timestamps?

Answer

A

Each process p maintains a counter LT(p)
* p performs action, increments LT(p)
* p sends a message, includes LT(p)
* P receives a message from q. LT(p) = max(LT(p), LT(q)) + 1

For two events a -> b then LT(a) < LT(b). The reverse is not true.

Question 15

Q

What are vector clocks

Answer

A

On a system of N nodes, each node i maintains a vector Vi of size N. As such:
* Vi[i] is the number of events that occurred in node i
* Vi[j] is the number of events that node i knows occurred at node j

Question 16

Q

How do VC updates work?

Answer

Study These Flashcards

A

Local events at node i increment Vi[i]
WHen node i sends a message to node j, it includes Vi
When node j received Vi, it updates all elements of Vj to Vj[x] = max(Vi[x], Vj[x])

Question 17

Q

What are the guarantees of VCs?

Answer

Study These Flashcards

A

If a -> b then VC(a) < VC(b)
If VC(a) < VC(b) then a -> b
If VC(a) < VC(b) then RT(a) < RT(b)

Distributed Systems Flashcards

(17 cards)