12.13.Fault Tolerance - Transactions Flashcards by Katie Dimitropoulaki

What is a characteristic that distinguishes distributed systems from single-machine systems?

Partial failure

How well did you know this?

Not at all

Perfectly

What is the goal when partial failure occurs?

Tolerate faults

How well did you know this?

Not at all

Perfectly

What is being fault tolerant related to?

Dependability

How well did you know this?

Not at all

Perfectly

What is dependability>

the trustworthiness of a computing system which allows resilliance to be justifiably placed on the service it delivers

How well did you know this?

Not at all

Perfectly

What are the requirements for Dependability?

Availability
Reliability
Maintainability
Safety

How well did you know this?

Not at all

Perfectly

What does Safety mean?

If and when FAILURES occur the CONSEQUENCES are not catastrophic for the system

How well did you know this?

Not at all

Perfectly

What does Availability mean?

the probability that the system operates correctly at ANY GIVEN MOMENT

How well did you know this?

Not at all

Perfectly

What does Reliability mean?

LENGTH OF TIME that it can run continuously without failure

How well did you know this?

Not at all

Perfectly

What does Maintainability mean?

how EASILY a failed system can be REPAIRED

How well did you know this?

Not at all

Perfectly

Different types of failures?

Crash
Omission
Response
Timing
Arbitrary (Byzanitine)

How well did you know this?

Not at all

Perfectly

What is a technique for failure masking?

Redundancy

How well did you know this?

Not at all

Perfectly

How many types of redundancy are there?

Physical
Information (send extra bits to allow for recovery if need be)
Time (repeat action if need be)

How well did you know this?

Not at all

Perfectly

What is one of our most important considerations in failure masking?

Making sure that a failure won’t leave the system in an inconsistent state

How well did you know this?

Not at all

Perfectly

How is avoiding leaving the system in an inconsistent state achieved?

1.Atomic operations!

“The sequence of operations must execute as an ATOMIC operation”

How well did you know this?

Not at all

Perfectly

When do concurrent executions not interfere with each other?

If their execution is equivalent to a serial one (they don’t interleave)

How well did you know this?

Not at all

Perfectly

What does the property of Isolation in distributed systems refer to?

Study These Flashcards

Isolated excecution (concurrent applications)

What should the distributed application not violate in order to achieve Consistency?

Study These Flashcards

Database’s integrity constraints

What does durability mean in distributed systems?

Study These Flashcards

Changes to the database are persistent

What concept allows for the reinforcement of the ACID propertires?

Study These Flashcards

transactions

What is a transaction?

Study These Flashcards

A set of operations that is either fully committed or aborted as a whole. If aborted no operation in the set is executed.

What algorithms in the implementation of transactions allow for ISOLATION?

Study These Flashcards

Concurrency control

What do concurrency control algorithms do to ensure ISOLATION?

Study These Flashcards

Ensure execution is equivalent to “serial” execution

What algorithms in the implementation of transactions allow for DURABILITY?

Study These Flashcards

Recovery algorithms

What do recovery algorithms do to ensure DURABILITY?

Study These Flashcards

replay actions of committed transactions

- undo effects of aborted transactions

Two ways to improve concurrency control with locking?

- Optimistic concurrency control (transaction executed normal, checked at commit, aborted if problematic) - Timestamp ordering (operations in transactions validated when carried out)

Two ways to do recovery when transaction needs to be aborted?

- Backwards (through state checkpoints->Previously correct state) - Forwards (correct new state)

What problem arises when trying to make transactions where more than one server is involved?

the distributed commit problem(ATOMICITY) | Either all servers commit or all abort

Protocol to support distributed transactions? (more than one server)

1. pick coordinator 2. client communicates transaction to coordinator 2. One or Two phase commit (coordinator communicates abort of transaction to servers)

What is the difference between a one phase commit and a two phase commit when dealing with distributed transactions?

Two phase commit involves the servers being able to Accept and Execute the commit (rather than just receive the command)

What are the drawbacks of the 2-phase commit?

- Coordinator fail (Three-phase commit and multicast?) - participants must trust coordinator - tranaction must be short - distributed deadlock risk

How can deadlock be resolved?

By aborting one of the transactions

When does deadlock occur?

When there is a cycle in the wait- for graph of transactions for locks

What complicates detecting deadlock in a distributed system?

- Locks are held on different servers | - loop in the entire wait-for graph will not be apparent to any one server

One (bad) way of detecting distributed deadlock?

Coordinator stores entire wait-for graph (centra point of failure)

What is a better way to detect distributed deadlock?

Edge chasing (Path pushing)

12.13.Fault Tolerance - Transactions Flashcards

(35 cards)