Long Exam 2 - Distributed Systems Flashcards
What is a distributed system?
A collection of autonomous computing elements that appear as a single coherent system with autonomous computing elements (nodes).
What is it meant by a collection of autonomous nodes?
Each node is autonomous and has its own notion of time without a global clock. This however leads to fundamental synchronization and coordination problems.
What is an overlay network?
Each node in the collection communicated only with the other nodes in the system.
What are the two types of overlay networks?
Structured (well-defined set of neighbors through trees and rings) and unstructured (randomly select other nodes)
What are the four goals of a distributed system?
- sharing of resources
- distribution transparency
- openness
- scalability
What are the three types of scalability?
- size scalability
- geographical scalability
- administrative scalability
How to design fault-tolerant systems?
- Identify all possible faults
- Detect and contain the fault
- Handle the fault
What is the acronym RAID for?
Redundant Array of Inexpensive Disks
What is RAID 1?
- mirroring
- can recover form single-disk failure
- requires 2N disks
What is RAID 4?
- dedicated parity disk
- can recover from single-disk failure
- requires N+1 disk
- performance benefits if you stripe a single file across multiple data disks
- all writes hit the parity disk
What is RAID 5?
- spread out parity
- can recover from single-disk failure
- requires N+1 disk
- performance benefits if you stripe a single file across multiple data disks
- writes are spread across disks
What is isolation?
Occurs either completely before or completely after every other concurrent threads
What is the golden rule to achieve atomicity?
Never modify the only copy.
How to make renaming shadow copies atomic?
By using single-sector writes.
What is a shadow copy?
Shadow copies work because they perform updates/changes on a copy and automatically install a new copy using an atomic operation