Module 1 Database Systems Flashcards

1
Q

data

A

Raw facts from which the required information is derived. Data have little meaning unless they are grouped in a logical manner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

field

A

A character or a group of characters (numeric or alphanumeric) that describes a specific characteristic. A field may define a telephone number, a date, or other specific characteristics that the end user wants to keep track of.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

record

A

A logically connected set of one or more fields that describes a person, place, event, or thing. For example, a CUSTOMER record may be composed of the fields CUST_NUMBER, CUST_LNAME, CUST_FNAME, CUST_INITIAL, CUST_ADDRESS, CUST_CITY, CUST_STATE, CUST_ZIPCODE, CUST_AREACODE, and CUST_PHONE.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

file

A

Historically, a collection of file folders, properly tagged and kept in a filing cabinet. Although such manual files still exist, we more commonly think of a (computer) file as a collection of related records that contain information of interest to the end user. For example, a sales organization is likely to keep a file containing customer data. Keep in mind that the phrase related records reflects a relationship based on function. For example, customer data are kept in a file named CUSTOMER. The records in this customer file are related by the fact that they all pertain to customers. Similarly, a file named PRODUCT would contain records that describe products – the records in this file are all related by the fact that they all pertain to products. You would not expect to find customer data in a product file, or vice versa.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is data redundancy, and which characteristics of the file system can lead to it?

A

Data redundancy exists when unnecessarily duplicated data are found in the database. For example, a customer’s telephone number may be found in the customer file, in the sales agent file, and in the invoice file. Data redundancy is symptomatic of a (computer) file system, given its inability to represent and manage data relationships. Data redundancy may also be the result of poorly-designed databases that allow the same data to be kept in different locations. (Here’s another opportunity to emphasize the need for good database design!)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is data independence, and why is it lacking in file systems?

A

What is data independence, and why is it lacking in file systems?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a DBMS, and what are its functions?

A

A DBMS is best described as a collection of programs that manage the database structure and that control shared access to the data in the database. Current DBMSes also store the relationships between the database components; they also take care of defining the required access paths to those components. The functions of a current-generation DBMS may be summarized as follows:
 The DBMS stores the definitions of data and their relationships (metadata) in a data dictionary; any changes made are automatically recorded in the data dictionary.
 The DBMS creates the complex structures required for data storage.
 The DBMS transforms entered data to conform to the data structures in item 2.
 The DBMS creates a security system and enforces security within that system.
 The DBMS creates complex structures that allow multiple-user access to the data.
 The DBMS performs backup and data recovery procedures to ensure data safety.
 The DBMS promotes and enforces integrity rules to eliminate data integrity problems.
 The DBMS provides access to the data via utility programs and from programming languages interfaces.
 The DBMS provides end-user access to data within a computer network environment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is structual independence, and why is it important?

A

Structural independence exists when data access programs are not subject to change when the file’s structural characteristics, such as the number or order of the columns in a table, change. Structural independence is important because it substantially decreases programming effort and program maintenance costs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Explain the difference between data and information.

A

Data are raw facts. Information is processed data to reveal the meaning behind the facts. Let’s summarize some key points:
 Data constitute the building bocks of information.
 Information is produced by processing data.
 Information is used to reveal the meaning of data.
 Good, relevant, and timely information is the key to good decision making.
 Good decision making is the key to organizational survival in a global environment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the role of a DBMS, and what are its advantages? What are its disadvantages?

A

A database management system (DBMS) is a collection of programs that manages the database structure and controls access to the data stored in the database. Figure 1.2 (shown in the text) illustrates that the DBMS serves as the intermediary between the user and the database. The DBMS receives all application requests and translates them into the complex operations required to fulfill those requests. The DBMS hides much of the database’s internal complexity from the application programs and users. The application program might be written by a programmer using a programming language such as COBOL, Visual Basic, or C++, or it might be created through a DBMS utility program.
Having a DBMS between the end user’s applications and the database offers some important advantages. First, the DBMS enables the data in the database to be shared among multiple applications or users. Second, the DBMS integrates the many different users’ views of the data into a single all-encompassing data repository.
Because data are the crucial raw material from which information is derived, you must have a good way of managing such data. As you will discover in this book, the DBMS helps make data management more efficient and effective. In particular, a DBMS provides advantages such as:
 Improved data sharing. The DBMS helps create an environment in which end users have better access to more and better-managed data. Such access makes it possible for end users to respond quickly to changes in their environment.
 Better data integration. Wider access to well-managed data promotes an integrated view of the organization’s operations and a clearer view of the big picture. It becomes much easier to see how actions in one segment of the company affect other segments.
 Minimized data inconsistency. Data inconsistency exists when different versions of the same data appear in different places. For example, data inconsistency exists when a company’s sales department stores a sales representative’s name as “Bill Brown” and the company’s personnel department stores that same person’s name as “William G. Brown” or when the company’s regional sales office shows the price of product “X” as $45.95 and its national sales office shows the same product’s price as $43.95. The probability of data inconsistency is greatly reduced in a properly designed database.
 Improved data access. The DBMS makes it possible to produce quick answers to ad hoc queries. From a database perspective, a query is a specific request for data manipulation (for example, to read or update the data) issued to the DBMS. Simply put, a query is a question and an ad hoc query is a spur-of-the-moment question. The DBMS sends back an answer (called the query result set) to the application. For example, end users, when dealing with large amounts of sales data, might want quick answers to questions (ad hoc queries) such as:
 What was the dollar volume of sales by product during the past six months?
 What is the sales bonus figure for each of our salespeople during the past three months?
 How many of our customers have credit balances of $3,000 or more?
 Improved decision making. Better-managed data and improved data access make it possible to generate better quality information, on which better decisions are based.
 Increased end-user productivity. The availability of data, combined with the tools that transform data into usable information, empowers end users to make quick, informed decisions that can make the difference between success and failure in the global economy.
Chapter 1 Database Systems
4
The advantages of using a DBMS are not limited to the few just listed. In fact, you will discover many more advantages as you learn more about the technical details of databases and their proper design.
Although the database system yields considerable advantages over previous data management approaches, database systems do carry significant disadvantages. For example:
 Increased costs. Database systems require sophisticated hardware and software and highly skilled personnel. The cost of maintaining the hardware, software, and personnel required to operate and manage a database system can be substantial. Training, licensing, and regulation compliance costs are often overlooked when database systems are implemented.
 Management complexity. Database systems interface with many different technologies and have a significant impact on a company’s resources and culture. The changes introduced by the adoption of a database system must be properly managed to ensure that they help advance the company’s objectives. Given the fact that databases systems hold crucial company data that are accessed from multiple sources, security issues must be assessed constantly.
 Maintaining currency. To maximize the efficiency of the database system, you must keep your system current. Therefore, you must perform frequent updates and apply the latest patches and security measures to all components. Because database technology advances rapidly, personnel training costs tend to be significant.
 Vendor dependence. Given the heavy investment in technology and personnel training, companies might be reluctant to change database vendors. As a consequence, vendors are less likely to offer pricing point advantages to existing customers, and those customers might be limited in their choice of database system components.
 Frequent upgrade/replacement cycles. DBMS vendors frequently upgrade their products by adding new functionality. Such new features often come bundled in new upgrade versions of the software. Some of these versions require hardware upgrades. Not only do the upgrades themselves cost money, but it also costs money to train database users and administrators to properly use and manage the new features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

List and describe the different types of databases.

A
The focus is on Section 1.3.2, TYPES OF DATABASES. Organize the discussion around the number of users, database site location, and data use:
 Number of users
o Single-user
o Multiuser
o Workgroup
o Enterprise
 Database site location
o Centralized
o Distributed
 Type of data
o General-purpose
o Discipline-specific
 Database use
o Transactional (production) database (OLTP)
o Data warehouse database (OLAP)
 Degree of data structure
o Unstructured data
o Structured data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the main components of a database system?

A

The basis of this discussion is Section 1.7.1, THE DATABASE SYSTEM ENVIRONMENT. Figure 1.9 provides a good bird’s eye view of the components. Note that the system’s components are hardware, software, people, procedures, and data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are metadata?

A

Metadata is data about data. That is, metadata define the data characteristics such as the data type (such as character or numeric) and the relationships that link the data. Relationships are an important component of database design. What makes relationships especially interesting is that they are often defined by their environment. For instance, the relationship between EMPLOYEE and JOB is likely to depend on the organization’s definition of the work environment. For example, in some organizations an employee can have multiple job assignments, while in other organizations – or even in other divisions within the same organization – an employee can have only one job assignment.
The details of relationship types and the roles played by those relationships in data models are defined and described in Chapter 2, Data Models.”. Relationships will play a key role in subsequent chapters. You cannot effectively deal with database design issues unless you address relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain why database design is important.

A

The focus is on Section 1.4, WHY DATABASE DESIGN IS IMPORTANT. Explain that modern database and applications development software is so easy to use that many people can quickly learn to implement a simple database and develop simple applications within a week or so, without giving design much thought. As data and reporting requirements become more complex, those same people will simply (and quickly!) produce the required add-ons. That’s how data redundancies and all their attendant anomalies develop, thus reducing the “database” and its applications to a status worse than useless. Stress these points:
 Good applications can’t overcome bad database designs.
 The existence of a DBMS does not guarantee good data management, nor does it ensure that the database will be able to generate correct and timely information.
 Ultimately, the end user and the designer decide what data will be stored in the database.
A database created without the benefit of a detailed blueprint is unlikely to be satisfactory. Pose this question: would you think it smart to build a house without the benefit of a blueprint? So why would you want to create a database without a blueprint? (Perhaps it would be OK to build a chicken coop without a blueprint, but would you want your house to be built the same way?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the potential costs of implementing a database system?

A

Although the database system yields considerable advantages over previous data management approaches, database systems do impose significant costs. For example:
 Increased acquisition and operating costs. Database systems require sophisticated hardware and software and highly skilled personnel. The cost of maintaining the hardware, software, and personnel required to operate and manage a database system can be substantial.
 Management complexity. Database systems interface with many different technologies and have a significant impact on a company’s resources and culture. The changes introduced by the adoption of a database system must be properly managed to ensure that they help advance the company’s objectives. Given the fact that databases systems hold crucial company data that are accessed from multiple sources, security issues must be assessed constantly.
 Maintaining currency. To maximize the efficiency of the database system, you must keep your system current. Therefore, you must perform frequent updates and apply the latest patches and security measures to all components. Because database technology advances rapidly, personnel training costs tend to be significant.
 Vendor dependence. Given the heavy investment in technology and personnel training, companies may be reluctant to change database vendors. As a consequence, vendors are less likely to offer pricing point advantages to existing customers and those customers may be limited in their choice of database system components.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Use examples to compare and contrast unstructured and structured data. Which type is more prevalent in a typical business environment?

A

Unstructured data are data that exist in their original (raw) state, that is, in the format in which they were collected. Therefore, unstructured data exist in a format that does not lend itself to the processing that yields information. Structured data are the result of taking unstructured data and formatting (structuring) such data to facilitate storage, use, and the generation of information. You apply structure (format) based on the type of processing that you intend to perform on the data. Some data might be not ready (unstructured) for some types of processing, but they might be ready (structured) for other types of processing. For example, the data value 37890 might refer to a zip code, a sales value, or a product code. If this value represents a zip code or a product code and is stored as text, you cannot perform mathematical computations with it. On the other hand, if this value represents a sales transaction, it is necessary to format it as numeric.
Structured data are more prevalent than unstructured data in a business environment. For example, if invoices are stored as images for future retrieval and display, you can scan them and save them in a graphic format. On the other hand, if you want to derive information such as monthly totals and average sales, such graphic storage would not be useful. Instead, you could store the invoice data in a (structured) spreadsheet format so that you can perform the requisite computations.

17
Q

What are some basic database functions that a spreadsheet cannot perform.

A

Spreadsheets do not support self-documentation through metadata, enforcement of data types or domains to ensure consistency of data within a column, defined relationships among tables, or constraints to ensure consistency of data across related tables.

18
Q

What common problems do a collection of spreadsheets created by end users share with the typical file system?

A

A collection of spreadsheets shares several problems with the typical file system. First problem is that end users create their own, private, copies of the data, which creates issues of data ownership. This situation also creates islands of information where changes to one set of data are not reflected in all of the copies of the data. This leads to the second problem – lack of data consistency. Because the data in various spreadsheets may be intended to represent a view of the business environment, a lack of consistency in the data may lead to faulty decision making based on inaccurate data.

19
Q

Explain the significance of the loss of direct, hands-on access to business data that users experienced with the advent of computerized data repositories.

A

Users lost direct, hands-on access to the business data when computerized data repositories were developed because the IT skills necessary to directly access and manipulate the data were beyond the average user’s abilities, and because security precautions restricted access to the shared data. This was significant because it removed users from the direct manipulation of data and introduced significant time delays for data access. When users need answers to business questions from the data, necessity often does not give them the luxury of time to wait days, weeks, or even months for the required reports. The desire to return hands-on access to the data to the users, among other drivers, helped to propel the development of database systems. While database systems have greatly improved the ability of users to directly access data, the need to quickly manipulate data for themselves has lead to the problems of spreadsheets being used when databases are needed.