All Flashcards

Question

What does WSDL stand for?

Answer 1

Web services description language

Answer 2

Protocol based on XML and used to describe and locate web services

Answer 3

Universal discovery, description and integration

Answer 4

Platform independent, XML based registry protocol for businesses to list themselves on the internet

Answer 5

A logically interrelated collection of shared data, physically distributed over a computer network

Answer 6

A model for enabling ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources that can be rapidly provisioned and released with minimal management effort or service provider interaction.

Answer 7

An action or series of actions carried out by a user or applications, which reads or updates DB contents, forming a logical unit of work on the DB

Answer 8

Atomicity  Transactions treated as single unit Consistency  Transactions only bring DB from one valid state to another Isolation  Ensures concurrent execution of trans leaves DB in same state as if they were executed sequentially  Main goal of concurrency control Durability  Guarantees transaction remains committed, even after system failure

Answer 9

* All users should be able to access the same data * A users view is immune to changes made in other views * Users should not need to know physical DB storage details * DBA should be able to change DB storage structures without affecting users’ views * Internal structure of DB should be unaffected by changes to physical aspects of storage * DBA should be able to change conceptual structure of DB without affecting users’

Answer 10

* Users view of the DB | * Describes part of the DB that is relevant to a particular user

Answer 11

* Community view of the DB | * Describes what data is stored in DB and relationships among data

Answer 12

* Physical representation of the DB on a computer | * Describes how the data is stored in the DB

Answer 13

* Immunity of external schemas to changes in conceptual schema * Conceptual schema changes * Shouldn’t require changes to external schema or rewrites of application programs

Answer 14

* Immunity of conceptual schema to changes in internal schema * Internal schema changes * Shouldn’t require changes to conceptual or external schemas

Answer 15

* Entity relationship * Schematic * Functional * Object-Oriented

Answer 16

* Relational Data Model * Network Data Model * Hierarchy Data Model

Answer 17

* Names, types and sizes of data items * Constraints on the data * Names of authorised users * Data items accessible by a user and the type of access * Usage statistics

Answer 18

* One computer with a single CPU and a number of terminals * Processing performed within the same physical computer * Terminals are ‘dumb’, incapable of functioning on their own, cabled to central computer

Answer 19

* Processing distributed about the network, typically a local area network * File server connected to several workstations across a network * DB resides on the file-server * DBMS and applications run on each workstation

Answer 20

o Significant network traffic o Copy of DBMS on each workstation o Concurrency, recovery, and integrity control more complex

Answer 21

* Tier 1: Client manages UI and runs applications | * Tier 2: Server holds DB and DBMS

Answer 22

``` o Wider access to existing DBs o Increased performance o Possible reduction in hardware costs o Reduction in communication costs o Increased consistency ```

Answer 23

Much more scalable compared to the two-tier client server system

Answer 24

o ‘Thin’ client requiring less expensive hardware o Application maintenance centralised o Easier to modify or replace one tier without affecting others o Separating business logic from DB functions makes it easier to implement load balancing o Maps quite naturally to Web environment

Answer 25

Application servers host API to expose business logic and business processes for use by other applications

Answer 26

Additional tiers provide more flexibility and scalability

Answer 27

* Shares business logic, data, and processes through a programmatic interface across a network * Developers can add web service to a web page, or to an executable program, to offer specific functionality to users * Use technologies such as: XML, SOAP, WSDL, UDDI

Answer 28

* Software system that permits the management of the distributed DB and makes the distribution transparent to users * Consists of a single logical DB split into a number of fragments, each stored on one or more computers under the control of separate DBMS, with the computers connected by a network * Each site capable of independently processing user requests that require access to local data and is also capable of processing data stored on other computers in the network

Answer 29

Consumers can obtain, configure an deploy cloud services without help from the provider

Answer 30

Accessible from anywhere, from any standardised platform

Answer 31

DaaS and DBaaS, differing only in data management

Answer 32

* Offers full DB functionality to application developers * Management layer that provides continuous monitoring and configuration of the DB * Spares developers from the ongoing DB administration tasks

Answer 33

o Optimised scaling o High availability o Multi-tenancy o Effective resource allocation in the cloud

Answer 34

* Services enable data definition in the cloud and subsequent querying * Doesn’t implement typical DBMS interfaces but instead data is accessed via common APIs * Enables organisation with valuable data to offer access to others

Answer 35

Query processor • Transforms queries into a series of low-level instructions directed to the DB manager DB Manager • Interfaces with user-submitted application programs and queries • The DM examines external and conceptual schemas to determine conceptual records to satisfy the request • DM then paces call to file manager to perform request File Manager • Manipulates underlying storage files and manages allocation of storage space on disk • Establishes and maintains list of structures and indexes defined in the internal schema DML Pre-processor • Converts DML statements into standard function calls in host language. • Must interact with query processor to generate appropriate code DDL Compiler • Converts DDL statements into a set of tables containing metadata • Tables then stored in system catalog while control information is stored in data file headers Catalog Manager • Manages access to and maintains the system catalog Authorisation Control • Confirms whether the user has the necessary permission to carry out the required operation Command Processor • Control passed to this component once user authority is confirmed Integrity Checker • Ensures requested operation satisfies all necessary integrity constraints for an operation that changes the DB Query optimiser • Determines optimal strategy for query execution Transaction manager • Performs required processing of operations received from transactions. Scheduler • Ensures concurrent operations on DB proceed without conflicting with one another • It controls the relative order in which transaction operations are executed Recovery manager • Ensures the DB remains in a consistent state in the presence of failures • It is responsible for transaction commit and abort. Buffer manager • Responsible for the transfer of data between main memory and secondary storage, such as disk and tape.

Answer 36

Transforms DB from one consistent state to another, although consistency may be violated during a transaction

Answer 37

Transaction is committed to the DB, reaching a new consistent state

Answer 38

Transaction aborts, and the DB must be restored to a consistent state before it is started via roll-back

Answer 39

Yes, provided it is rolled back

Answer 40

• Process of managing simultaneous operations on the DB without having them interfere with one another• • Prevents interference when two or more users are accessing DB simultaneously and at least one is updating data

Answer 41

Objective is to schedule transactions to avoid interference

Answer 42

Limits degree of concurrency or parallelism in system

Answer 43

Successfully completed update is overwritten by another user

Answer 44

Occurs when one transaction can see intermediate results of another transaction before it is committed

Answer 45

Occurs when transaction reads several values but second transaction updates some of them during execution of first

Answer 46

Identifies those executions of transactions guaranteed to ensure consistency and aims to find non-serial schedules that allow transactions to execute concurrently without interfering with one another

Answer 47

Sequence of reads/writes by set of concurrent transactions

Answer 48

Schedule where operations of each transaction are executed consecutively without any interleaved operations from other transactions

Answer 49

No guarantee that results of all serial executions of a given set of transactions will be identical

Answer 50

Operations from set of concurrent transactions are interleaved

Answer 51

A schedule is serialisable if it is non-serial in nature and identical to some serial schedule

Answer 52

* If two transactions only read a data item, they do not conflict and order is not important * If two transactions either read or write separate data items, they do not conflict and order is none important * If one transaction writes a data item and the other reads or writes, order of execution is important

Answer 53

Orders any conflicting operations in same way as some serial execution

Answer 54

Create a precedence graph as follows: 1. Create a node for each transaction 2. Add a directed edge from Ti to Tj if Tj reads the value of an item written by Ti 3. Add a directed edge from Ti to Tj if Tj writes the value into an item after it has been read by Ti If the precedence graph contains a cycle, then the schedule is not conflict serialisable

Answer 55

* Schedule is view serialisable if it is view equivalent to a serial schedule * It is a less stringent definition of schedule equivalence than conflict serialisability * Every conflict serialisable schedule is view serialisable, although the converse is not true * Any view serialisable schedule that is not conflict serialisable contains one or more blind writes

Answer 56

Testing if a schedule is serialisable is NPC

Answer 57

* For each data item x, if Ti reads initial value of x in S1, Ti must also read initial value of x in S2. * For each read on x by Ti in S1,if value read by Ti is written by Tj, Ti must also read value of x produced by Tj in S2. * For each data item x, if last write on x performed by Ti in S1, same transaction must perform final write on x in S2.

Answer 58

* If transactions fail, atomicity requires effects of transaction to be rolled back * Durability states that once a transaction commits, its changes cannot be undone without running another compensating transaction

Answer 59

A schedule where, for each pair of transactions Ti and Tj, if Tj reads a data item previously written by Ti, then the commit operation of Ti precedes the commit operation of Tj

Answer 60

Delay transactions in case they conflict with other transactions

Answer 61

Assume a conflict is rare and only check for conflicts at commit

Answer 62

Locks are used to deny access of other transactions, preventing incorrect updates. Common approach

Answer 63

Shared lock, used for reading data items Exclusive lock, used for writing data items

Answer 64

* If a transaction has a shared lock on item, it can read but not update said item * If transaction has exclusive lock on an item, it can both read and write said item * Reads cannot conflict, so more than one transaction can hold shared locks simultaneously * Exclusive locks gives transaction exclusive access to an item * Some systems allow transactions to upgrade read locks to exclusive locks, or downgrade exclusive to shared locks

Answer 65

Transactions release locks too soon Results in the loss of total isolation and atomicity

Answer 66

Through the addition of a protocol concerning the positioning of lock and unlock operations in every transaction

Answer 67

Transaction follows 2PL if all locking operations precede first unlock operation in the transaction Growing phase: acquires all locks but cannot release any Shrinking Phase: releases locks but cannot acquire any new locks

Answer 68

The schedule is serialisable

Answer 69

Problems can occur with interpretation of when locks can be released.

Answer 70

Occurs when transaction aborts with other transactions dependent on said transaction Results in initial transaction and all subsequent dependent ones rolling back, cascading

Answer 71

2PL, leaving the release of all locks till the end of the transaction

Answer 72

* Could treat each page of index as a data item and apply 2PL * As indexes frequently accesses, particular at higher levels, this may lead to lock contention

Answer 73

* Search path starts from root and moves down to leaves but search never moves back up tree. Thus, once a lower‐level node has been accessed, higher‐level nodes in that path will not be used again. * When new index value is inserted into a leaf, if node is not full, insertion will not cause changes to higher‐level nodes. * Only have to exclusively lock leaf node in such a case, and only exclusively lock higher‐ level nodes if node is full and has to be split

Answer 74

o Obtain shared locks on nodes starting at root and proceeding downwards along required path. o Release lock on node once lock has been obtained on the child node.

Answer 75

Conservative approach would be to obtain exclusive locks on all nodes as we descend tree to the leaf node to be modified.

Answer 76

o Obtain shared locks on all nodes as we descend to leaf node to be modified, where obtain exclusive lock. o If leaf node has to split, upgrade shared lock on parent to exclusive lock. o If this node also has to split, continue to upgrade locks at next higher level.

Answer 77

An impasse that may result when two or more transactions are each waiting for locks held by the other to be released.

Answer 78

Abort one or more of the transactions

Answer 79

So that the DBMS can restart transactions

Answer 80

Because it is unaware of transaction logic, even if it was aware of the transaction history. Possible if there is no user input or the input is not a function of the DB’s state

Answer 81

Timeouts Deadlock Prevention Deadlock Detection and Recovery

Answer 82

* Transaction requesting lock will only wait for a system-defined period of time * If lock is not granted within this period, lock request times out * DBMS assumes transaction deadlocked and it aborts, automatically restarting system

Answer 83

* DBMS looks ahead to see if deadlocks will occur and never allows it to occur * Transactions ordered using transaction timestamps: wait-die or wound-wait

Answer 84

Wait-Die:  Only an older transaction can wait for younger one  Otherwise transaction is aborted and restarted with same timestamp Wound-Wait:  Only a younger transaction can wait for an older one  If older transaction requests lock held by younger one, younger aborted

Answer 85

DBMS allows deadlock to occur but recognises it and breaks it

Answer 86

A wait-for graph is used to show transaction dependencies with a deadlock existing iff the wait-for graph contains a cycle 1. Create a node for each transaction 2. Create an edge from Ti to Tj if Tj waiting to lock an item locked by Tj

Answer 87

o The choice in deadlock victim o How far to roll a transaction back o Avoiding starvation

Answer 88

A unique identifier created by a DBMS that indicates relative starting time of a transaction

Answer 89

o Using system clock at time transaction started | o Incrementing a logical counter every time a new transaction starts

Answer 90

Transactions are ordered globally so that older transactions get priority in the event of conflict. Conflicts are resolved by rolling back and restarting the transaction

Answer 91

No, no locks are implemented during timestamping

Answer 92

* Read/write proceeded only if last update on data item was carried out by an older transaction * Otherwise, transaction requesting read/write is restarted and given a new timestamp

Answer 93

Timestamp of last transaction to read a data item

Answer 94

Timestamp of last transaction to write a data item

Answer 95

* Versioning implemented to increase concurrency * Basic timestamp ordering protocol assumes only one version of data item exists * Basic timestamping only one transaction can access data item at a time * Multi-version can allow multiple transactions to read and write different versions of the same data item * Ensures each transaction sees consistent set of versions for all data items it accesses * Each write operation creates a new version of the data item whilst retaining old version * System selects the correct version of data item when read request arrives * Versions can be deleted when they are no longer required

Answer 96

Based on the assumption that conflict is rare and more efficient to let transactions proceed without delays than ensure serialisability

Answer 97

A check is made to determine whether conflict has occurred or not If there is a conflict, transaction must be rolled back and restarted

Answer 98

Potential for greater concurrency than traditional protocols

Answer 99

Read phase • Extends from start until immediately before commit • Transaction reads values from database and stores them in local variables • Updates applied to a local copy of the data Validation phase - Read only transaction: • Checks that data read still holds current values • If no interference has occurred, transaction committed, else it is aborted and restarted - Update transaction: • Checks transaction leaves DB in a consistent state with serialisability maintained Write Phase • Updates made to local copy are applied to the DB.

Answer 100

The coarser the data item, the lower the degree of concurrency achievable The finer the data item, the more reliant the system becomes on locking Optimal size therefore determined by the type of transaction

Answer 101

* Granularity of locks can be represented in a hierarchal structure * Root node represents the entire DB * When a node is locked, all of its children are also locked * DBMS should check hierarchal path before granting lock

Answer 102

Through the process of database recovery, restoring a DB to a correct state in the event of failure

Answer 103

* Volatile storage does not survive system crashes * Stable storage represents information that has been replicated in several non-volatile storage media with independent failure modes

Answer 104

* System crashes resulting in the loss of main memory * Media failures, resulting in the loss of parts of secondary storage * Application software errors * Natural physical disasters * Carelessness or unintentional destruction of data or facilities * Sabotage

Answer 105

* Recovery manager responsible for atomicity and durability * Failure occurring between commit and DB buffers being flushed to secondary storage requires recovery manager to redo transactions to ensure durability * Transaction not being committed at failure time, recovery manager has to undo any effects of the transaction for atomicity

Answer 106

Partial undo's result in a single transaction being undone, whereas global undos result in all transactions being undone.

Answer 107

Backup mechanism: makes periodic backup copies of the DB Logging facilities: keep track of current state of transactions and DB changes Checkpoint facility: enables updates to DB in progress to be made permanent Recovery manager: allows DBMS to restore DB to consistent state following failure

Answer 108

* Contains information regarding all updates to a DB * Transaction record contains transaction identifier, type of log record, log-management information etc… * Log files may be duplexed or triplexed * Sometimes split into two separate random-access files

Answer 109

Potential bottleneck arises which is critical in determining overall performance

Answer 110

* A checkpoint is a point of synchronisation between DB and log file, with all buffers forcibly written to disk * Checkpoint record is created containing identifiers of all active transactions * When failure occurs, redo all transactions that committed since the checkpoint and undo all transactions active at time of crash

Answer 111

Restore last backup of DB and reapply updates of committed transactions using log file

Answer 112

* Need to undo changes that caused inconsistency * May also need to redo some transactions to ensure updates reach secondary storage * Do not need backup, but can restore DB using before and after images in the log file

Answer 113

* Updates not written to the DB until after a transaction has reached its commit point * If transaction fails before commit, it will not have modified DB and so no undoing of changes are required * May be necessary to redo updates of committed transaction as their effect may not have reached DB

Answer 114

* Updates applied to DB as they occur * Need to redo updates of committed transactions following a failure * May need to undo effects of transactions that had not committed at time of failure * If no transaction commit record in log, transaction was active at failure and undone * Undo operations are performed in reverse order in which they were written to the log

Answer 115

Relates to immediate update recovery technique and states that it is essential that log records are written before write to DB

Answer 116

* Two page tables during life of a transaction: the current page and the shadow page tables * Both are the same at the start of a transaction * Shadow page never changed thereafter, used to restore DB in event of failure * During transaction, current page table records all updates to the DB * When transaction completes, current page table becomes the shadow page table

Answer 117

o Data that has many types, each with small number of instances. o Designs may be very large. o Design is not static but evolves through time. o Updates are far‐reaching. o Cooperative engineering.

Answer 118

More susceptible to failure  Requiring a minimisation in the amount of work lost. May access large number of data items  Concurrency limited if data inaccessible for long periods. Deadlock more likely. Cooperative use of shared data items restricted by traditional protocols.

Answer 119

``` o Nested Transaction Model o Sagas o Multi‐level Transaction Model o Dynamic Restructuring o Workflow Models ```

Answer 120

* Transaction viewed as hierarchy of sub-transactions. * Top‐level transaction can have number of child transactions. * Each child can also have nested transactions. * Transactions have to commit from bottom upwards. * Transaction abort at a level doesn’t have to affect transaction in progress at higher level. * Parent allows to perform its own recovery * Updates of committed sub-transactions at intermediate levels are visible only within scope of their immediate parents. * Commit of sub-transaction is conditionally subject to commit or abort of its superiors. * Top‐level transactions conform to traditional ACID properties of flat transaction.

Answer 121

o Retry sub-transaction. o Ignore failure, in which case sub-transaction non‐vital. o Run contingency sub-transaction. o Abort.

Answer 122

In the nested transaction model, the proposal states that only leaf‐level sub-transactions perform database operations.

Answer 123

Modularity o Transaction can be decomposed into number of sub-transactions for purposes of concurrency and recovery Finer level granularity for concurrency control and recovery Intra-transaction parallelism Intra-transaction recovery control

Answer 124

Using savepoints

Answer 125

An identifiable point in flat transaction representing some partially consistent state

Answer 126

* During execution of transaction, user can establish savepoint * User can use this to roll transaction back

Answer 127

Unlike nested transactions, savepoints don’t support intra‐transaction parallelism.

Answer 128

RA operations work on one or more relations to define another without changing the original relations

Answer 129

Expressions are nested, just as in arithmetic

Answer 130

Works on a single relation R to define a relation that contains only tuples of R that satisfy the specified condition σpredicate (R)

Answer 131

Works on a single relation R to define a relation that contains a vertical subset of R. Extracts values of specified attributes and eliminating duplicates Πcol1,…,col n (R)

Answer 132

The union of two relations R and S defines a relation containing all the tuples of R, S, or both R and S with duplicates removed R and S must be union compatible R∪S

Answer 133

Defines a relation consisting of the tuples that are in R but not in S R and S must be union compatible R-S

Answer 134

Defines a relation consisting of the set of all tuples that are in both R and S R and S must be union-compatible R∩S

Answer 135

Defines a relation that is the concatenation of every tuple in relation R with every tuple of relation S R×S

Answer 136

Selection with Cartesian Product

Answer 137

Join operations Causes RDBMS to have intrinsic performance issues

Answer 138

Defines a relation that contains tuples satisfying the predicate F from the Cartesian product of R and S R ⋈F S

Answer 139

An Equijoin of the two relations R and S over all common attributes x One occurrence of each common attribute is eliminated from the result R⋈S

Answer 140

Used to display rows in the result that do not have matching values in the join column Left join is a join in which tuples from R that do not have matching values in common columns of S are also included in result relation R⋊S Right join is a join in which tuples from S that do not have matching values in common columns of R are also included in result relation R⋉S

Answer 141

Defines a relation that contains the tuples of R that participate in the join of R with S R ⊳F S

Answer 142

Defines a relation over the attributes C that consists of set of tuples from R that match combination of every tuple in S R÷S

Answer 143

Applies aggregate function, AL, to R to define a relation over the aggregate list SQUIGGLEY-F subscript AL (R) where AL contains one or more pairs

Answer 144

* Groups tuples of R by grouping attributes, GA * Then applies aggregate function list, AL, to define a new relation * Resulting relation contains the grouping attributes, GA, along with results of each of the aggregate functions [subscript GA SQUIGGLEY-F subscript AL] (R) where AL contains one or more pairs

Answer 145

o Heuristic rules that order operations in a query | o Comparing different strategies based on relative costs, and selecting one that minimises resource usage

Answer 146

Disk accesses

Answer 147

The activities involved in retrieving data from the DB

Answer 148

Transform query written in high-level language into correct and efficient execution strategy expressed in low-level language Execute strategy to retrieve required data

Answer 149

The activity of choosing an efficient execution strategy for processing a query

Answer 150

As there are many equivalent transformations of same high-level query, aim of QO is to choose one that minimises resource usage

Answer 151

Reduce total execution time of a query Reduce response time of query

Answer 152

o Decomposition, consisting of parsing and validation o Optimisation o Code generation o Execution

Answer 153

o Dynamically every time query is run | o Statically when query is first submitted

Answer 154

Advantages arise from fact that information is up to date Disadvantages is performance of query affected, time may limit finding optimum strategy

Answer 155

Advantages are removal of runtime overhead, and more time to find optimum strategy Disadvantages arise form fact that chosen execution strategy may no longer be optimal when query is run

Answer 156

By adopting a hybrid approach

Answer 157

Transformation of a high-level query into RA query and checks syntactic and semantic correctness

Answer 158

Analysis Normalisation Semantic Analysis Simplification Query Restructuring

Answer 159

* Analyse query lexically and syntactically using compiler techniques * Verify relations and attributes exist * Verify operations are appropriate for object type * Query then transformed into some internal representation more suitable for processing * Query tree chosen

Answer 160

* Leaf node created for each base relation * Non-leaf node created for each intermediate relation produced by RA operation * Root of tree represents query result -> Sequence is directed from leaves to root

Answer 161

Query converted into a normalised form for easier manipulation

Answer 162

o Conjunctive Normal form | o Disjunctive Normal Form

Answer 163

Following normalisation, normalised queries that are incorrectly formulated or contradictory are rejected

Answer 164

Its components do not contribute to generation of result

Answer 165

Its predicate cannot be satisfied by any tuple

Answer 166

Through the construction of a Relation Connection Graph 1. Create node for each relation and node for result 2. Create edges between two nodes that represent a join 3. Add edges between nodes that represent projection If not connected, query is incorrectly formulated

Answer 167

* Detects redundant qualifications * Eliminates common sub-expressions * Transforms query to semantically equivalent but more easily/efficiently computed form

Answer 168

Typically access restrictions, view definitions and integrity constraints are considered Assumption that user has appropriate access privileges

Answer 169

Perform selection operations as early as possible: o Keep predicates on same relation together Combine Cartesian Product with subsequent selection whose predicate represents join condition into a join operation Use Associativity of binary operations to rearrange leaf nodes so leaf nodes with most restrictive selection operations are executed first Perform projection as early as possible: o Keep projection attributes on same relation together Compute common expressions once: o If common expression appears more than once and result not too large, store result and reuse it when require o Useful when querying views, as same expression is used to construct view each time

Answer 170

If stats updated every time tuple is changed, performance would be impacted DBMS may update stats on a periodic basis or whenever system idle

Answer 171

UNDERSTAND MATHEMATICS AND RA

All Flashcards

Get Smarter Yo (196 cards)