Transaction Management/transactSQL Flashcards

Question

What is consistency?

Answer 1

- correct execution of a transaction takes the database from one consistent state to another - when transactions don't violate any constraints - database accurately reflects state of real world - if we have to abort then the database is not in a consistent state and we have to recreate the consistent state (which is the state of the database before the aborted transaction started)

Answer 2

- A serializable schedule always leaves the database in a consistent state. A serial schedule is always a serializable schedule because, in a serial Schedule, a transaction only starts when the other transaction has finished execution

Answer 3

schedules that are equivalent to serial schedules (though there are multiple definitions of serialisability)

Answer 4

- a particular transaction should see itself as the only transaction in the database - the levels of isolation refer to how isolated a transaction should be from other transactions operations (such as modifying items used in both)

Answer 5

1. Read uncommitted 2. read committed 3. Repeatable read 4. serialisable

Answer 6

- No isolation at all, you can read data which has not been committed

Answer 7

- every item you read must have been committed before you can see it

Answer 8

- every item you read must have been committed before you can see it - if you read the same thing twice in a transaction, you must get the same return value

Answer 9

- Has all the levels above

Answer 10

- A non-serial schedule is called a serializable schedule if it can be converted to its equivalent serial schedule - You have serial schedule TA then TB, and item X = 15 after both transactions have run - if you have some of TA then TB then TA then TB and item X still = 15, this schedule is not serial but it serialisable

Answer 11

- Once a transaction commits and changes the database, then these changes cannot be lost because of failure - the effect of a transaction on the database should not be lost after the commit point - we REDO the transaction if there are any problems after the update - durability means we can deal with media failure

Answer 12

- User/application - transaction manager - logging and recovery - concurrency control - Buffers - Storage

Answer 13

- Concurrency control - Logging and recovery

Answer 14

A - via recovery control (logging and recovery) C - via scheduler - concurrency control I - via scheduler - concurrency control D - via recovery control (logging and recovery)

Answer 15

- maintains consistency - maintains isolation

Answer 16

- more efficient in multi-user environments (transactions don't have to wait for others to finish before starting)

Answer 17

Consistency or isolation

Answer 18

- Because they can accidentally overwrite values in the buffer

Answer 19

A schedule S is serializable if there is a serial schedule S’ that has the same effect as S on every initial database state.

Answer 20

- consistency and correctness

Answer 21

- because it depends on reads, writes, commits and non-database operations - non-database operations can be complex

Answer 22

by having conflict seralisability

Answer 23

- a pair of operations from different transactions that cannot be swapped without changing the behavior of at least one transaction

Answer 24

A pair of operations from different transactions that access the same item and at least one of them is a write operation

Answer 25

- If it is conflict equivalent to a serial schedule

Answer 26

- two schedules S and S' are conflict-equivalent if S can be obtained from S' by swapping any number of (1) consecutive (2) non conflicting operations from (3) different transactions

Answer 27

- All serial, conflict-serialised and serialised schedules are concurrent schedules - All serial and conflict-serialised schedules are serialisable - all serial schedules are conflict serialisable

Answer 28

- we create a precedence graph with each transaction as a node and each conflict as a link between nodes - if there is a cycle within this graph then the schedule is not conflict-serialisable

Answer 29

- we create a precedence graph with each transaction as a node and each conflict is a link between nodes (only if op1 appears before op2) and we set the nodes out in order T1,T2,T3 - if there is a cycle within this graph then the schedule is not conflict-serialisable

Answer 30

- They means that a contradiction has arisen (which could cause a deadlock)

Answer 31

- Find a transaction with only outgoing edges - you put that first in the schedule and remove the transaction from the graph - you repeat this process until there's no nodes left.

Answer 32

- The scheduler gets fed operations and it can either execute them or delay them happening.

Answer 33

Using locks with a simple locking mechanism - a transaction has to lock an item before it accesses it - locks are requested from and granted by the scheduler - each item is locked by at most one transaction - each lock must eventually be released

Answer 34

l1(X) for locking u1(x) for unlocking

Answer 35

Every lock must be followed by an unlock An item(x) must be unlocked by a transaction before being locked again by a different transaction

Answer 36

There’s only one lock so other transactions have to wait to run whilst the first transaction has locked an item it wants to use

Answer 37

- 2 Phase Locking is where we follow the pattern, all locks then unlocks, then the schedule is serialisable

Answer 38

- Phase one is until the first unlock, phase 2 is from that unlock to the last unlock

Answer 39

- Because before T1 unlocks the first item that T2 needs T1 locks the next item it needs so that T2 can do all the operations up until it needs the item that T1 is currently using - It ensures that T1's transactions all occur first!

Answer 40

Conflict serialisability

Answer 41

Deadlocking because two transactions may be stuck waiting for the lock that the other transaction has.

Answer 42

Solution: Different lock modes - use shared and exclusive locks

Answer 43

- multiple transactions can have this at the same time - it means the transaction has access to read the item only

Answer 44

- only one transaction can have this at a time - allows transaction to read and write to an item

Answer 45

- a shared lock is granted if no other transactions hold an exclusive lock on the item - an exclusive lock is granted if no other transaction holds a lock of any kind on the item.

Answer 46

sl1(X) - shared lock xl1(X) - exclusive lock

Answer 47

The shared lock gets updated to an exclusive lock

Answer 48

- if a shared lock on an item X can be upgraded later to an exclusive lock on X in order to be friendly to other transactions

Answer 49

- they are requested by items to read (not write) an item - may be upgraded later to an exclusive lock (at that point then no other shared locks can be upgraded) - This is granted to at most one transaction at a time - Helps to avoid deadlocking

Answer 50

Because another transaction holds an update lock

Answer 51

- May lock relations (whole table) - May lock disk blocks - May lock tuples (couple of rows/row)

Answer 52

- less concurrency which may cause unnecessary delays

Answer 53

- high overhead (need to keep track of all the locked items)

Answer 54

- one transaction having a shared lock on a tuple - another transaction having an exclusive lock for the relation

Answer 55

If a transaction wants to lock something it has to put intention locks on super items (higher levels that contain item x) so conflict serialisability is ensured -to get access to an item you need to have the intention locks on the levels above

Answer 56

- IS1(x) = intention to request a shared lock on a sub item - IX1(x) = intention to request an exclusive lock on a sub item

Answer 57

- Errors whilst executing transactions - deadlocking - Explicit request

Answer 58

- Media failures: A crash is caused if the rotating head in a Hard disk drive crashes - Catastrophic events: need to store databases/copies of it in a physically different location and hope that not all of the locations get physically damaged. - System failures: power failures etc, information about the active state of the database is lost

Answer 59

- writing activities into a log so that a desired database state can be recovered later

Answer 60

- starts of transactions, commits, aborts - modification of database items

Answer 61

- REDO logging - UNDO logging - Combinations of the two REDO/UNDO logging

Answer 62

On the hard disk = secondary storage

Answer 63

It means stored in RAM

Answer 64

In main memory = RAM

Answer 65

- logs activities with the goal of restoring a previous consistent database state - maintains atomicity

Answer 66

- : Transaction T has started - : Transaction T has committed - : Transaction T was aborted - : Transaction T has updated the value of database item X, the old value of X was v

Answer 67

1: if T1 updates database item X and old value = v then must be written to log on disk before X is written to disk 2: If T1 commits, then must be written to disk as soon as all database elements changed by T have been written to disk

Answer 68

- read/write/lock/unlock operations - write values to go into log in buffer - flush pervious values of items to log file on disk - write changes of item values to disk - once done, write COMMIT T to buffer in main memory - flush COMMIT T to log file on disk

Answer 69

- If T has committed successfully then no recovery process is needed - if T has not committed before the failure then we must undo all the updates to the database items that were written to disk (we can do this by looking at what the old value of the item was in the log), we undo up to the last START T or COMMIT T

Answer 70

- if an error occurs, the recovery manager restores the last consistent database state - it traverses the log backward to do this.

Answer 71

- We replace each statement with for each uncommitted transaction T that was not previously aborted and call flush log

Answer 72

There should be 3 statements

Answer 73

- Use the UNDO log like before to recover all the changes that were made - but we only change the values for one specific transaction not all of them.

Answer 74

- logs activities with the goal of restoring committed transactions - it ignores incomplete transactions - maintains durability - log stores same commands except : stores the new value of the item not the old one

Answer 75

- : Transaction T has started - : Transaction T has committed - : Transaction T was aborted - : Transaction T has updated the value of database item X, the new value of X was v

Answer 76

To restore things for transactions that have finished, even if they haven’t been changed on the disk yet

Answer 77

1. T1 writes all log records for all updates of items to log on disk 2. T1 writes to log on disk 3. T1 writes all committed updates to database on disk

Answer 78

- In UNDO logging we can output items values before they have been committed - In REDO logging items have to have been committed before we can output them

Answer 79

- read/write/lock/unlock operations - write new values to go into log in buffer - once done, write COMMIT T to buffer in main memory - flush new values of items stored in buffer to log file on disk - flush COMMIT T to log file on disk - write changes of item values to disk for committed items only

Answer 80

- Need to check whether has been written to the log file on disk or if it's only in the log in the buffer in main memory - if has been written to the log file on disk then we know all the transactions have been written to the disk - if is only in the log in the buffer then T hasn't written anything to the disk

Answer 81

- traverse the log from first to last item - if we see and T has a log record then change the value of X on the disk to v - for each incomplete transaction T, write into the log on disk

Answer 82

- So we write at the end of a transaction that hasn't committed - we don't need to take any further actions since none of the changes were written to the disk yet.

Answer 83

- does a combination of undo and redo logging - this way it maintains atomicity and durability - instead of writing to the log file it writes where v is the old value and w is the new value of X

Answer 84

- Because it stores enough information that we can do undo logging and redo logging and therefore we can do both things when we need to - so it doesn’t matter what order you do the COMMIT T command now. This means you’re more free to do things in the order that you like. - Say one that makes it faster because it’s better to make bigger updates on the disk and by doing this you’re allowed to do it whenever you want and therefore you can make bigger updates.

Answer 85

- Redo logging, but we need to be careful only to use the last committed value instead of the last value - because we can do these updates to the disk whenever and not have to worry about - advantage of doing this later is that you can do bigger updates at the same time which is faster on a real hard disk.

Answer 86

1. write all log records for all updates of item values to buffer in main memory 2. write all updates to disk 3. can be written to disk before or after all the changes to the database have been written to the disk depending on what the DBMS prioritises

Answer 87

- It ensures atomicity (as well as durability) - it means that uncommitted data may not overwrite committed data on disk

Answer 88

- the log is periodically checkpointed - so every t mins we put a checkpoint in the log - we don't need to undo transactions before the checkpoint

Answer 89

1. Stop accepting new transactions 2. wait until all active transactions finish and have written or record to log in buffer 3. flush log updates to hard disk 4. write a log record 5. Flush the log to disk again (with checkpoint command) 6. Resume accepting transactions

Answer 90

does undo/redo logging but transactions do not write to buffers until they are sure they want to commit

Answer 91

- write to the log in the buffer and then flush it to the hard disk

Answer 92

- process the undo/redo log as before - only redo (part of) the committed transactions in T1,T2... after - undo all of the uncommitted transactions that come before - as the ones before will have been written to the disk and the ones after won't have

Answer 93

- equivalent to serial schedules - ensures consistency and correctness - enforced by 2PL

Answer 94

- ensure consistency as we can recover data base states - robust, works even after system failures - enforced using undo logging or redo logging or undo/redo logging or simple checking pointing with undo logs or aries checkpointing with undo/redo logs

Answer 95

- when the isolation property is not fully enforced by setting isolation level to READ UNCOMMITTED - gains more parallelism by executing some transactions that would have to wait to prevent dirty reads - however dirty reads can slow down the system when transactions have to abort

Answer 96

- if transaction T aborts, find all the transactions that have read items that were written by T - recursively abort all transactions that have read items written by an aborted transaction

Answer 97

- if we do not abort all the transactions that interacted with aborted transaction T, then we can risk breaking consistency and isolation - if we do abort them all we can break durability

Answer 98

- if a transaction T1 commits and has read an item X that was written before a different transaction T2 - then T1 must commit before T2 commits (T2 must delay it's commit)

Answer 99

- Only active transactions can be forced to commit - so transactions that have committed before T1 aborts won't have to abort

Answer 100

recoverable schedules have to be in a specific order - that specifies R2(X), C1, C2 meaning that this is not serialisable because it's not equivalent to a serial schedule

Answer 101

that all log records have to reach the hard disk in the order in which they were written (if they don't this could mean a recoverable schedule becomes non-recoverable)

Answer 102

- If each transaction in it reads only values that were written by transactions that have already committed

Answer 103

- No reading of dirty data - no cascading rollbacks

Answer 104

- non-serialisable, because they have to be in a certain order, the reads of other transactions have to be after others commit, so the order matters so they're not serialisable - they are recoverable because the item doing a dirty read is doing it from a transaction that has already committed, therefore it commits after the first transaction

Answer 105

A schedule where each transaction in it reads and writes only values that were written by transactions that have already committed

Answer 106

- Cascadeless - serialisability - recoverable

Answer 107

- variant of 2PL - a transaction T must release any lock (that allows T to write data) until T has committed or aborted and the commit/abort log record had been written to disk

Answer 108

- conflict serialisability - strict schedules

Answer 109

- in 2PL the commit command comes after the unlocks - in 2PL the commit comes before the unlocks - this is so that no other transactions can read/write to uncommitted data

Answer 110

- All strict and 2PL strict schedules are serialisable and conflict serialisable - some cascadeless and recoverable schedules are serialisable and conflict serialisable

Answer 111

Deadlocking

Answer 112

- Detect deadlocks and fix them (rollback/ restart transactions) - Enforce deadlock-free schedules

Answer 113

- Timeouts: assume a transaction is in deadlock if it exceeds a given time limit - waits-for-graphs: shows transactions and dependencies, cycles mean deadlocks - Timestamp-based

Answer 114

- each transaction T is assigned a unique integer TS(T) upon arrival at the scheduler - if T1 arrived earlier than T2, we require the TS(T1) < TS(T2) - Time stamps do not change even after restart - if transactions arrive at the same time they still get different timestamps

Answer 115

- we use time stamps to decide which transactions can wait longer (for a lock/or to start) - we want to prevent cyclic dependencies because as we saw before this creates deadlocking

Answer 116

(“older transactions always wait for unlocks”) - If T1 is older than T2, and requests an item then it waits for T2 to be finished with it - If T2 is younger than T1 and requests an item, it dies (rollsback and starts again) - this makes sense because if a transaction is younger it means it's done less work, so it less of a waste to restart

Answer 117

(“older transactions never wait for unlocks”) - if T1 is older than T2 and requests an item from T2, T2 is immediately rolled back unless it has committed - if T2 is younger than T1 and requests an item from T1, it's allowed to wait for T1 to unlock the item

Answer 118

In both methods, the older transaction never dies or rollsback

Answer 119

- At all times the oldest transaction keeps running - hence when that finishes the same occurs for the transaction that arrives directly before it

Answer 120

wait-to-die: dies of timeout wound-wait: dies because it's holding a lock the older one needs

Answer 121

- schedule transactions as if they are executing each transaction instantly - we should think of transactions as being completely isolated so each transaction starts and finishes before the next - if two arrive at the scheduler at the same time we just randomly pick one to start first - equivalent to serial schedules

Answer 122

- Here each transaction is assigned a new time stamp number whenever it restarts - this means transactions only go in reverse consecutive order - holds information about the last transaction to read from and write to an item

Answer 123

advantages: conflict-serialisable schedules, no deadlocks disadvantages: cascading roll backs, starvation, many restarts

Answer 124

- we delay read or write requests until the youngest transaction who wrote X has committed or aborted - Just lock the transaction until the earlier one has finished - So we won’t get deadlocks because the oldest one can always move forwards using this principal

Answer 125

- A more advanced version of timestamping - just have multiple versions of each item in your database - So you don’t overwrite the values in an item, you just write a new item for each new timestamp. - you can just discard the old items when they stop being relevant -we only need to restart transactions if you try to write and the RT is later than the write time stamp - Before we had to restart if either the read or write stamp was too young, now we only have to restart if the read timestamp is too young.