Section Four: Exchanging data Flashcards

Question

Chapter 16 – Database concepts Foreign key

Answer 1

A foreign key is an attribute that creates a join between two tables. It is the attribute that is common to both tables, and the primary key in one table is the foreign key in the table to which it is linked. Example: In the one-to-many relationship between Dentist and Patient, the entity on the ‘many’ side of the relationship will have DentistID as an extra attribute. .

Answer 2

When there is a many-to-many relationship between two entities, tables cannot be directly linked in this way. For example, consider the relationship between Student and Course. A student takes many courses, and the same course is taken by many students. In this case, an extra table is needed to link the Student and Course tables. We could call this StudentCourse, or Enrolment, for example. The three tables will now have attributes something like those shown below: Student (StudentID, Name, Address) Enrolment (StudentID, CourseID) Course (CourseID, Subject, Level)

Answer 3

In this data model, the table linking Student and Course has two foreign keys, each linking to one of the two main tables. The two foreign keys also act as the primary key of this table. A primary key which consists of more than one attribute is called a composite primary key.

Answer 4

A database system will frequently involve many different entities linked to each other, and an entity relationship diagram can be drawn to show all the relationships.

Answer 5

When tables are linked in a relational database, it is important to ensure that, for example, a particular component is not deleted if it is used in a product in the Product table. This is known as referential integrity.

Answer 6

In order that a record with a particular primary key can be quickly located in a database, an index of primary keys will be automatically maintained by the database software, giving the position of each record according to its primary key.

Answer 7

A relational database is a collection of tables in which relationships are modelled by shared attributes.

Answer 8

Tables may be linked through the use of a common attribute. This attribute must be a primary key of one of the tables, and is known as a foreign key in the second table.

Answer 9

Normalisation is a process used to come up with the best possible design for a relational database. Tables should be organised in such a way that: - no data is unnecessarily duplicated (i.e. the same data item held in more than one table) - data is consistent throughout the database (e.g. a customer is not recorded as having different addresses in different tables of the database). Consistency should be an automatic consequence of not holding any duplicated data. This means that anomalies will not arise when data is inserted, amended or deleted. - the structure of each table is flexible enough to allow you to enter as many or as few items (for example, components making up a product) as required - the structure should enable a user to make all kinds of complex queries relating data from different tables There are three basic stages of normalisation known as first, second and third normal form.

Answer 10

A table is in first normal form (1NF) if it contains no repeating attribute or groups of attributes. As the first stage in normalization, we need to note that there are repeating groups of attributes in this table; for example, ProductID 123 has three components with IDs ST01, G56 and FF77. We need to split the data into two tables to get rid of the repeating groups.

Answer 11

A table is in second normal form (2NF) if it is in first normal form and contains no partial dependencies. A partial dependency would mean that one or more of the attributes depends on only part of the primary key, which can only occur if the primary key is a composite key.

Answer 12

A table is in third normal form (3NF) if it is in second normal form and contains no ‘non-key dependencies’. A non-key dependency is one where the value of an attribute is determined by the value of another attribute which is not part of the key.

Answer 13

A normalised database has major advantages over an un-normalised one. - No data redundancy - Maintaining and modifying the database - Faster sorting and searching - Deleting records

Answer 14

One of the aims of normalising a database design is to remove the possibility of redundant data from any of the tables. Redundant data is data that appears in more than one database table.

Answer 15

Data integrity is maintained since there is no unnecessary duplication of data. For example, a customer with a particular customer ID will have their personal details stored only once. If the customer changes address, the update needs only to be made to a single table, so there is no possibility of inconsistencies arising with different addresses for the customer being held on different files.

Answer 16

Normalisation will produce smaller tables with fewer fields. This results in faster searching, sorting and indexing operations as there is less data involved. A further advantage is that holding data only once saves storage space.

Answer 17

A normalised database with correctly defined relationships between tables will not allow records in a table on the ‘one’ side of a one-to-many relationship to be deleted accidentally. For example, a customer who still has unresolved transactions on file cannot be deleted. This will prevent accidental deletion of a customer who has an unpaid invoice recorded, for example.

Answer 18

SQL, or Structured Query Language (pronounced either as S Q-L or Sequel) is a declarative language used for querying and updating tables in a relational database. It can also be used to create tables. In this chapter, we will look at SQL statements used in querying a database.

Answer 19

The SELECT statement is used to extract a collection of fields from a given table. The basic syntax of this statement is: SELECT list the fields to be displayed FROM list the table or tables the data will come from WHERE list the search criteria ORDER BY list the fields that the results are to be sorted on (default is Ascending order)

Answer 20

= Equal > Greater < Less than != Not equal to >= Greater than or equal to <= Less than or equal to IN Equal to a value within a set of values LIKE Similar to BETWEEN... AND Within a range, including the two values which define the limits IS NULL Field does not contain AND Both expressions must be true for the entire expression to be judged true OR If either or both of the expressions are true, the entire expression is judged true. NOT Inverts truth

Answer 21

ORDER BY gives you control over the order in which records appear in the Answer table. If for example you want the records to be displayed in ascending order of CDTitle and within that, descending order of DatePublished, you would write, for example: SELECT * FROM CD WHERE DatePublished < #01/01/2015# ORDER BY CDTitle, DatePublished Desc

Answer 22

Using SQL you can combine data from two or more tables, by specifying which table the data is held in. For example, suppose you wanted SongTitle, ArtistName and MusicType for all Art Pop music. When more than one table is involved, SQL uses the syntax tablename.fieldname. (The table name is optional unless the field name appears in more than one table.)

Answer 23

JOIN provides an alternative method of combining rows from two or more tables, based on a common field between them. The query above could be written as follows: SELECT Song.SongTitle, Artist.ArtistName, Song.MusicType FROM Song JOIN Artist ON Song.ArtistID = Artist.ArtistID WHERE Song.MusicType = "Art Pop"

Answer 24

Use SQL to create a table named Employee, which has four columns: EmpID (a compulsory int field which is the primary key), Name (a compulsory character field of length 10), HireDate (an optional date field) and Salary (an optional real number field). CREATE TABLE Employee ( EmpID INTEGER NOT NULL, PRIMARY KEY, EmpName VARCHAR(20) NOT NULL, HireDate DATE, Salary CURRENCY )

Answer 25

CHAR(n) Character string of fixed length n VARCHAR(n) Character string variable length, max. n BOOLEAN TRUE or FALSE INTEGER, INT FLOAT Number with a floating decimal point DATE Stores Day, Month, Year values TIME Stores Hour, Minute, Second values CURRENCY Formats numbers in the currency used in your region

Answer 26

The ALTER TABLE statement is used to add, delete or modify columns (i.e. fields) in an existing table. To add a column (field): ALTER TABLE Employee ADD Department VARCHAR(10) To delete a column: ALTER TABLE Employee DROP COLUMN HireDate To change the data type of a column: ALTER TABLE Employee MODIFY COLUMN EmpName VARCHAR(30)NOT NULL

Answer 27

Suppose that an extra table is to be added to the Employee database which lists the training courses offered by the company. A third table shows which date an employee attended a particular course. The structure of the Employee table is: EmpID Integer (Primary key) Name 30 characters maximum HireDate Date Salary Currency Department 30 characters maximum The structure of the Course table is: CourseID 6 characters, fixed length (Primary key) CourseTitle 30 characters maximum (must be entered) OnSite Boolean The structure of the CourseAttendance table is: CourseID 6 characters, fixed length (foreign key) EmpID Integer (foreign key) Course ID and EmpID form a composite primary key CourseDate Date (note that the same course may be run several times on different dates) The CourseAttendance table is created using the SQL statements: CREATE TABLE CourseAttendance ( CourseID CHARACTER(6)NOT NULL, EmpID INTEGER NOT NULL, CourseDate DATE, FOREIGN KEY CourseID REFERENCES Course(CourseID), FOREIGN KEY EmpID REFERENCES Employee(EmpID) PRIMARY KEY (CourseID, EmpID) )

Answer 28

This statement is used to insert a new record in a database table. The syntax is: INSERT INTO tableName (column1, column2, ...) VALUES (value1, value2, ...)

Answer 29

This statement is used to update a record in a database table. The syntax is: UPDATE tableName SET column1 = value1, column2 = value2, ... WHERE columnX = value

Answer 30

This statement is used to delete a record from a database table. The syntax is: DELETE FROM tableName WHERE columnX = value

Answer 31

This statement is used to delete a record from a database table. The syntax is: DELETE FROM tableName WHERE columnX = value

Answer 32

This statement is used to delete a record from a database table. The syntax is: DELETE FROM tableName WHERE columnX = value

Answer 33

Before data is added to a database, it has to be captured or input by some means or other. Manual methods include transcribing data from a form that has been filled in, for example by a customer ordering items from a catalogue or a market researcher filling in forms on the High Street. Cheques paid in at a bank are scanned using magnetic ink character recognition (MICR); the bank number, customer account number and cheque number are printed in special magnetic ink along the bottom of the cheque. The amount of the cheque has to be manually entered by the bank clerk.

Answer 34

Data may be selected before it is even added to a database, depending on whether or not it matches specified criteria. For example, a speed camera may automatically photograph only those vehicles which are exceeding the speed limit. Once in the database, SQL may be used to select data from different tables which match required criteria. Using the selected data, reports may be produced, letters sent out by post or email, new stock items automatically re-ordered, records added, updated or deleted.

Answer 35

A common method of transferring data between one computer system and another (usually via the Internet) without the need for human intervention is EDI (Electronic Data Interchange). Using standardised message formatting, documents can be exchanged electronically. Transaction software processes the information and the software on the receiving end looks up details of, for example, items to be purchased, price, buyer’s name and address etc. in an order processing system.

Answer 36

The database system has to ensure that it is not possible to complete only part of a transaction, for example booking the cinema ticket without paying for it. ACID (Atomicity, Consistency, Isolation, Durability) is a set of properties that guarantees that transactions are processed reliably.

Answer 37

Atomicity requires that a transaction must be processed in its entirety or not at all. Atomicity must guarantee that in any situation, including power cuts or hard disk crashes, it is not possible to process only part of a transaction.

Answer 38

The consistency property ensures that no transaction can violate any of the defined validation rules for maintaining the integrity of the database. When a database is created, referential integrity rules will be specified between linked tables. Thus it will not be possible, for example, to record a mark in a RESULTS table for a student who is not in the STUDENT table in the database. Similarly, it will not be possible to delete a record from the STUDENT table if they have marks on the RESULTS table.

Answer 39

The isolation property ensures that concurrent execution of transactions leads to the same results as if transactions were processed one after the other.

Answer 40

The durability property ensures that once a transaction has been committed, it will remain so, even in the event of a power cut. For example, if the online sale of a cinema ticket is in the process of being completed, it should not be possible for the number of seats sold to be updated but the customer’s debit card not processed. As each part of the transaction is completed, it is held in a buffer on disk until all elements of the transaction are completed. Only then will the changes to the database tables be made.

Answer 41

Allowing multiple users to simultaneously update a database table may cause one of the updates to be lost unless measures are taken to prevent this. When an item is updated, the entire record (indeed the whole block in which the record is physically held) will be copied into the user’s own local memory area at the workstation.

Answer 42

Record locking is the technique of preventing simultaneous access to objects in a database in order to prevent updates being lost or inconsistencies in the data arising. In its simplest form, a record is locked whenever a user retrieves it for editing or updating. Anyone else attempting to retrieve the same record is denied access until the transaction is completed or cancelled.

Answer 43

If two users are attempting to update two records, a situation can arise in which neither can proceed, known as deadlock. Suppose a bank clerk is updating Customer A’s record with a transfer to Customer B’s account. The DBMS must recognise when this situation has occurred and take action. Serialisation, timestamp ordering or commitment ordering may be used.

Answer 44

This is a technique which ensures that transactions do not overlap in time and therefore cannot interfere with each other or lead to updates being lost. A transaction cannot start until the previous one has finished. It can be implemented using timestamp ordering.

Answer 45

Whenever a transaction starts, it is given a timestamp, so that if two transactions affect the same object (for example record or table), the transaction with the earlier timestamp should be applied first. In order to ensure that transactions are not lost, every object in the database has a read timestamp and a write timestamp, which are updated whenever an object in a database is read or written.

Answer 46

This is another serialisation technique used to ensure that transactions are not lost when two or more users are simultaneously trying to access the same database object. Transactions are ordered in terms of their dependencies on each other as well as the time they were initiated. It can be used to prevent deadlock by blocking one request until another is completed.

Section Four: Exchanging data Flashcards

(71 cards)