unit 8: distributed DBMS reliability Flashcards
Topics included in this topic
reliability concepts
failures and fault tolerance
failures in Distributed DBMS
Local reliability protocol
Distributed reliability protocol
dealing with site failure
dealing with network partitioning
what will happen if system has some inputs
system will produce some outputs
what is realibality
reliability is nothing but a measure of how far a system is working
OR
It is a measure used to indicate how successful a system is in providing the service it was intended
Reliable proper definition
A system is considered a reliable if it functions as per its specifications and produces a correct output values for a given set of input values
what is purpose of reliability
purpose of reliability is, ki ham database ki atomicity and durability ko kaise maintain kar sake
jab ki dono, atomicity and durability transaction ki properties he
what is reliable DBMS
a reliable DDBMS is the one which continues to process the user requests even when the system is unreliable i.e. even if the components of distributed computing fails, DDBMS should be able to continue execution of user request without violating database consistency
what does reliability refers to ?
reliability refers to atomicity and durability of transaction.
explain what is an availability ?
- the fraction of the time that a system meet its specification
- the probability that the system is operational at a given time t.
which protocols address these issues of reliability
the protocol, which addresses these issues are commit & recovery protocol resp.
what is meaning of failure
the failure occurs when system does not function according to specifications of a system
OR
the failure of a system occurs when the system does not function according to its specifications and fails to deliver the service for which it was intended
what is erroneous state
the internal state of a system such that there exists circumstances in which further processing, by the normal algorithms of the system, will lead to a failure which is not attributed to a subsequent fault
===
koi internal matter jisse further ki processing nahi hoti he, unn kuch internal faults ya incorrect state ki vajah se aap failure ki taraf jaa sakte he
What is an error
an error in the system occurs when a component of the system assumes a state that is not desirable the fact that the state is undesirable is a subjective judgeing
what are the types of errors / how errors are distinguished
errors are distinguished as, those which are permanent and those which are non-permanent
what is fault
a fault is detected either when an error is propagated from one component to another or failure of the component is observed
fault to failure diagram
Fault [causes] Error [results in] Failure
types of faults
Hard faults
Soft faults
how hard faults are further divided
- permanent
- resulting failures are called hard failures
explain soft faults
- transient of intermittent
- account for more than 90% of all failures
- resulting failures are called soft failures
faults classification
three type of errors
1. permanent error
2. intermittent error
3. transient error
permanent fault and incorrect design causes, permanent error, leads to system failure
unstable or marginal components causes Internal error leads to system failure
unstable environment causes transient error leads to system failure
operator mistake leads to system failure
what is fault-tolerant system
in addition to fault detection scheme, two system has redundant components and sub-system build in
on detection of a fault, these redundant components are used to replace the faulty components
full form of MTBF
mean time between failure
MTTR full form
mean time to repair
types of failure in DBMS
- hardware failures
- software failures
- storage medium failures
- implementation of stable storage
- communication failure
- transaction failure
what are hardware failures divided into further
- design errors
- poor quality control
- over utilization and overloading
- wear out