ADBMS - unit 3 & 4: intro to DDBMS & Arch Flashcards
What is distributed database system?
A distributed database is a database that runs and stores data across multiple computers, as opposed to doing everything on a single machine.
What is Node or Instance?
Typically, distributed database systems operate on two or more interconnected servers on a computer network. Each location where a version of the database is running is often called an instance or a node.
How instance runs on centralized and on distributed?
A distributed database, for example, might have instances running in New York, Ohio, and California. Or it might have instances running on three separate machines in New York. A traditional single-instance database, in contrast, only runs in a single location on a single machine.
what is distributed in D dbms
- program logic
- functions
- data
- control
synonymous terms for D DBMS
distributed data processing
multiprocessors / multi computers
satellite processing
backend processing
dedicated / special purpose computers
timeshared systems
functionally modular systems
peer to peer systems
What is DDB system
- DDB is collection of multiple, logically interrelated databases distributes over a computer network
what is D DBM system software
D-DBMS is the s/w that manages the DDB and provides an access mechanism that makes this distribution transparent to the users
D DBMS = DB + communication
Why not D DBMS
timesharing computer system
loosely or tightly coupled multiprocessor system
database system which resides at one of the nodes of a network of computers - this is a centralized database on a network node
distributed dbms promises
transparent management of distributed, fragmented, and replicated data
improved reliability / availability through distributed transactions
improved performance
easier and more economical system expansion
what is meaning of “Promises of Distributed Databases”
Promises of distributed databases, meaning advantages of distributed databases
what is first promise of distributed database
First promise or advantage of distributed database is
1. transparency of data, fragmentation and replication
what is transparency transparency
Explain transparency of data, fragmentation and replication
- transparency refers to separation of the higher level semantics of a system from lower level implementation issues
- DDBMS hides all the added complexities of distribute allowing users to think that they all working with a single centralized systems
eg:
engineering firm that has offices in boston, mumbai, paris, and delhi
- they run projects and maintain database of these employees ex: projects, employees etc
- let us assume that the database is relational and stored in following two relations
Emp( eno, ename, title )
Proj( Pno, Pname, Budget )
- the other relation to store salary information
SAL(Title, Amt)
the 4 relation to know the assign projects with duration and responsibility indicates as
ASG( eno, pno, resp, dur )
if we want to find out the names and employees who worked on a project for more than 12 months the query that we all going to write is:
select ename amt
from ASG.dur > 12
AND Emp.eno = ASG.eno
AND sal.title = emp.title
based on queries it is going to search in different databases of boston paris etc…
in order to quick processing of query we are going to partition each of the relations and store each partition at a different siet
this is known as fragmentation
- data independence DI
- DI is a fundamental form of transparency
- it is capacity of changing the database scheme at one level of database system without efficiency the schema at the next higher level
2 types
- logical DI
- physical DI
LDI stores information about how data is managed inside
PDI deals with hiding the details of the storage structure from user applications
if network transparency / distribution transparency
- other than data the user should be protected from the operational details of the network
- allowing a user to access a resource (application program or data) without the user needing to know whether the resource is located on the local machine or on a remote machine
- replication transparency
- replication transparency ensures that replication of databases are hidden from the users
- it enables users to query upon a table as if only a single copy of the table exists
- fragmentation transparency
- dividing each database relation into smaller fragments and treat each fragment as a separate database object
- this is for reasons of performance, availability and reliability
so to provide easy and efficient access of the DBMS we need to have fully transparency
what is second advantage of promise of distributed database
reliability through distributed transactions