Database Terms And Internet Terms Flashcards

Question

What is the trade-off with regards to the redundancy of data?

Answer 1

The trade-off here is the cost of collecting and maintaining the redundant data and the system overhead it requires to process the data. Another concern is synchronization of data updates in terms of timing and sequence. Ideally, the synchronization should be done at the system level rather than the application level.

Answer 2

Store data and to provide operations on the database. Operations usually include create, delete, update, and search of data.

Answer 3

1) Persistence - is the property wherein the state of the database survives the execution. Of a process in order to be reused later in another process. 2) Data sharing - is the property that permits simultaneous use of the database by multiple users. 3) Recovery - Refers to the capability of the DBMS to return its data to a consistent and coherent state after a hardware or software failure. 4) Database language - permits external access to the DBMS. 5) Security and integrity - security and authorisation control, integrity checking, utility programs, backup/archiving, versioning, and view definition.

Answer 4

Concurrency control (locking) mechanism that prevents users from executing inconsistent actions on the database.

Answer 5

Data definition language (DDL) - used to define database schema and subschema Data manipulation language (DML) - used to examine and manipulate contents of the database Data control language (DCL) - used to specify parameters needed to define the internal organisation of the database such as indexes and buffer size. Ad hoc query language - is provided for interactive specification of queries.

Answer 6

Semantic - declaration of semantic and structural integrity rules and the enforcement of these rules. May be automatically enforced at program run time or at compile time or may be performed only when a message is sent. Referential - No record may contain a reference to the primary key of a non existent record

Answer 7

User needs - Conceptual model or user views - logical/external model - physical/internal model - User requirements are specified to conceptual model first (user views). - When the conceptual model is presented to the DBMS, it becomes a logical model/external model/schema/subschema. - The logical model is converted to a physical model(internal model) in terms of physical storage media such as magnetic disks, tapes, disk arrays.

Answer 8

The type of DBMS is not a factor in designing a conceptual model, but the design of a logical model is dependent on the type of DBMS to be used. This means that the conceptual model is, or should be, independent of a DBMS.

Answer 9

- process of determining an information system structure that is independent of software or hardware considerations. - It produces logical data structures consisting of a number of entities connected by one-to-one or one-to-many relationships, subject to appropriate integrity checking. - The objective is to improve the effectiveness of an information system by maximizing the accuracy, consistency, integrity, security, and completeness of the database.

Answer 10

- the implementation of a logical design in a particular computer system environment. - deals with retrieval and update workloads for the system and the parameters required (i.e., average time required for random/sequential access to a track, length of a track, and disk cylinder sizes) for the hardware environment. - The objective is to improve information system performance by minimizing the data entry time, data retrieval time, data update time, data query time, and storage space and costs.

Answer 11

1) Analyze workload complexity and characteristics. 2) Translate the relationships specified in the logical data structures into physical records and hardware devices, and determine their relationships. This includes consideration of symbolic and direct pointers. Symbolic pointers contain the other’s logical identifier. Direct pointers contain the other’s physical address. Both pointers can coexist. 3) Fine-tune the design by determining the initial record loading factors, record segmentations, record and file indexes, primary and secondary access methods, file block sizes, and secondary memory management for overflow handling.

Answer 12

Physical - concerned with physical storage of data (internal schema) - concerned with entities for which data are collected - describes how data are arranged in the defined storage media (eg disk) from program and programmer viewpoints - physical in nature - describes the way data is physically located in the database Logical - concerned with user-oriented data views (external schema) - concerned with entities for which data are collected - describes how data can be viewed by the designated end user - conceptual in nature (conceptual schema) - describes overall logical view of the database

Answer 13

Describes relationships between the data elements and is used as a tool to represent the conceptual organization of data.

Answer 14

One to one - one bed assigned to one patient One to many - one hospital room, many patients Many to Many - one surgeon may attend to many patients, and a patient may be attended by more than one surgeon

Answer 15

1) data structure - basic building blocks describing the way data are organised 2) Operators - set of functions that can be used to act on the data structures 3) Integrity rules - the valid states in which the data stored in the database may exist

Answer 16

Is to provide a formal means of representing information and a formal means of manipulating the representation.

Answer 17

1) Relational 2) Hierarchical 3) Network 4) Inverted file 5) Object 6) Distributed

Answer 18

- Consists of columns = data fields, and rows = data records, represented in a table. - Data is stored in tables with keys or indexes outside the program. - Columns of tables are called attributes, rows are called tuples.

Answer 19

1) all "key" values are defined 2) duplicate rows do not exist 3) column order is not significant 4) row order is not significant

Answer 20

simplicity in use and true data independence from data storage structures and access methods.

Answer 21

low system performance and operational efficiency compared to other data models

Answer 22

Can be related to a family tree concept a number of trees or data records forma database Every branch has a number of leaves or data fields Consists of nodes and branches. Highest node is called a "root" (parent-level 1_ and its every occurrence begins a logical database record. The dependent nodes are at the lower levels (children - level 2, 3....)

Answer 23

- model always starts with a root node - a parent node must have at least one dependent node - every node except the root must be accessed through its parent node - except at level 1, the root node, the dependent node can be added horizontally as well as vertically with no limitations - there can be a number of occurrences of each node at each level - every node occurring at level 2 must be connected with one and only one node occurring at level 1, and is repeated down.

Answer 24

proven performance, simplicity, ease of use and reduction of data dependency

Answer 25

addition and deletion of parent/child nodes can become complex and deletion of parent results in deletion of children

Answer 26

Is depicted using blocks and arrows. Block represents a record type or an entity. Each record type is composed of zero, one or more data elements/fields or attributes. An arrow linking two blocks shows the relationship between 2 records types.

Answer 27

Consists of a number of areas. An area contains records, which in turn contain data elements or fields. A set (grouping of records), may reside in an area or span a number of areas. Each area can have its own unique physical attributes. Areas can be operated independently of, or in conjunction with other areas.

Answer 28

Each entity is represented by a file Each record in the file represents an occurrence of the entity Each attribute becomes a data field or element in the inverted file

Answer 29

A set is composed of related records There is only a single owner in a set There may be zero, one or many members in a set

Answer 30

Proven performance and accommodation of many-to-many relationships that occur quite frequently

Answer 31

Complexity in programming | Loss of data independence during database reorganisation and when sets are removed

Answer 32

Each entity is represented by a file Each record in the file represents an occurrence of the entity Each attribute becomes data field or element in the file Data fields are inverted to allow efficient access to individual files To accomplish this, an index file is created containing all the values taken by the inverted field and pointers to all records in the file.

Answer 33

Simplicity Data independence Ease of adding new files and fields

Answer 34

Difficulty in synchronising changes between database records/fields and index file.

Answer 35

- Developed by combining the special nature of object-oriented programming languages with DBMS. - Objects, classes and inheritance form the basis for the structural aspects of the object data model. - Objects are basic entities that have data structures and operations. - Every object has an object ID that is a unique, system-provided identifier. - Classes describe generic object types. - All objects are members of a class. - Classes are related through inheritance. - Classes can be related to each other by superclass or subclass relationships. - Class definitions are the mechanism for specifying the database schema for an application.

Answer 36

System development efficiency and handling of complex data structures

Answer 37

New technology and new risks, which requires training and learning curves.

Answer 38

Data resides in more than one physical database in the network. Location transparency, in which the user does not need to know where data are stored, is one major goal of distributed data model. Similarly, programmers do not have to rewrite applications and can move data from one location to another, depending on need.

Answer 39

A technique used to start at certain points in the execution of a program after the system fails or detects an error.

Answer 40

ADV - relatively easy to implement in batch programs Disadvantages - cumbersome to implement for online programs due to concurrent processing. They also degrade system performance.

Answer 41

- Database designer needs to balance the number of checkpoints and time interval between 2 checkpoints. - Higher the number of checkpoints, the greater the degradation of performance, even though recovery process is easier. - If time interval between 2 check points is long, however, performance degradation is reduced but recovery is more difficult.

Answer 42

1) Time interval 2) Operator action 3) No of changes to database 4) No of records written to log tape 5) No of transactions processed

Answer 43

- Common to find unused space in database due to deletion of many records - unused space widens distance between active database records, resulting in longer time for data retrieval. - compression or compaction techniques can be used to reduce the amount of storage space required for a given collection of data records

Answer 44

Adv - saves storage space and saves disk input/output operations Disadvantages - CPU activity will increase to decompress the data after it has been retrieved Trade off exists between the input/output savings and additional CPU activity.

Answer 45

A deletion of some records in the database results in a fragmentation of space or unused space - could happen during initial loading or after reloading of the database. Other reorganisation efforts could result from changing block sizes, buffer pool sizes, prime areas and overflow areas.

Answer 46

- copying the old database onto another device, such as disk or tape - reblocking the valid records - reloading the valid records - excluding the records marked “deleted” during this process. Reorganisation can arrange records in such a way that their physical sequence is the same or nearly the same as their logical sequence. Also possible to arrange the records so that the more frequently access ones are stored on disk, rarely accessed records are stored on tape.

Answer 47

Restructuring - major activity, affects existing application systems and procedures. Reorganisation - minor activity, does not affect existing application systems and procedures

Answer 48

1) Logical changes - adding or deleting data elements, combining a no of records, changing relationship between records 2) Physical changes - in terms of channels and disk configuration to minimise contention(delays) by adding or removing some pointers. 3) Procedural changes - in terms of backup and recovery procedures and access control security rules

Answer 49

Often a performance monitoring tool and/or utility program is utilised to take internal readings of the database and its components. Objective is to identify performance related problems and take corrective action.

Answer 50

Alphabetical listing that describes all the data elements in an application system and tells how and where they are used.

Answer 51

Prevent the entry of inaccurate data into the system

Answer 52

Corrective control because of its “where-used” information, which can be used to trace data backward and forward through the transaction. As an audit trail.

Answer 53

Usually automated software is used to manage and control the DD. A manual DD can become inconsistent with what is actually in the system in a very short time. Automated DD supports the objectives of minimum data redundancy, maximum data consistency and adequate data integrity and security.

Answer 54

Dependent DD uses underlying DBMS to manage and control its data and it is a part of DBMS. A stand-alone DD is a separate package from the DBMS package.

Answer 55

Active DD - requires all data descriptions for a database defined or available at one time Passive DD - may or may not require a check for currency of data descriptions before a program is executed.

Answer 56

- Provides quick access to the data in the database - Tracks database accesses and actions - Provides valuable statistics for improving system performance - Minimizes redundancy in storage of data descriptions - Facilitates system documentation - Improves data editing and validation controls - Works well with database files

Answer 57

- Less risk of commitment to a DBMS - Easier to implementation - Can describe data descriptions on a piecemeal basis - Works well with conventional data files - Serves as a documentation and communication tool

Answer 58

- access control reports - audit trail reports - cross-reference reports - data elements and their relationships with their usage frequencies - summary, change, error and ad hoc reports

Answer 59

- It provides a consistent description of data as well as consistent data names for programming and data retrieval. - - This in turn provides consistent descriptive names and meanings. - It shows where-used information, such as what programs used the data items, which files contain the data items, and which printed reports display the data items. - It provides data integrity through data editing and validation routines. - It supports elimination of data redundancy. - It supports tracing of data item’s path through several application programs. - It describes the relationships among the entities.

Database Terms And Internet Terms Flashcards

(83 cards)