Chapter 1 Flashcards
Data
Raw facts, or facts that have not yet been processed to reveal their meaning to the end user.
Information
The result of processing raw data to reveal its meaning. Information consists of transformed data and facilitates decision making.
Knowledge
The body of information and facts about a specific subject. Knowledge implies familiarity, awareness, and understanding of information as it applies to an environment. A key characteristic is that new knowledge can be derived from old knowledge.
Data Management
A process that focuses on data collection, storage, and retrieval. Common data management functions include addition, deletion, modification, and listing.
Database
A shared, integrated computer structure that houses a collection of related data. A database contains two types of data: end-user data (raw facts) and metadata.
Metadata
Data about data; that is, data about data characteristics and relationships.
End-User Data
Raw facts of interest to the end user.
Database Management System (DBMS)
The collection of programs that manages the database structure and controls access to the data stored in the database.
Advantages of a DBMS
- Improved data sharing
- Improved data security
- Better data integration
- Minimized data inconsistency
- Improved data access
- Improved decision making
- Increased end-user productivity
Data Inconsistency
A condition in which different versions of the same data yield different (inconsistent) results.
Query
A question or task asked by an end user of a database in the form of SQL code. A specific request for data manipulation issued by the end user or the application to the DBMS.
Ad Hoc Query
A “spur-of-the-moment” question.
Query Result Set
The collection of data rows returned by a query.
Data Quality
A comprehensive approach to ensuring the accuracy, validity, and timeliness of data.
Single-User Database
A database that supports only one user at a time.
Desktop Database
A single-user database that runs on a personal computer.
Multiuser Database
A database that supports multiple concurrent users.
Workgroup Database
A multiuser database that usually supports fewer than 50 users or is used for a specific department in an organization.
Enterprise Database
The overall company data representation, which provides support for present and expected future needs.
Centralized Database
A database located at a single site.
Distributed Database
A logically related database that is stored in two or more physically independent sites.
Cloud Database
A database that is created and maintained using cloud services, such as Microsoft Azure or Amazon AWS.
General-Purpose Databases
A database that contains a wide variety of data used in multiple disciplines.
Discipline-Specific Databases
A database that contains data focused on specific subject areas.
Operational Database aka Online Transaction Processing (OLTP) Database, Transactional Database, or Production Database
A database designed primarily to support a company’s day-to-day operations. Also known as a transactional database, OLTP database, or production database.
Analytical Database
A database focused primarily on storing historical data and business metrics used for tactical or strategic decision making.
Data Warehouse
A specialized database that stores historical and aggregated data in a format optimized for decision support. An integrated, subject-oriented, time-variant, nonvolatile collection of data that provides support for decision making.
Online Analytical Processing (OLAP)
Decision support system (DSS) tools that use multidimensional data analysis techniques. OLAP creates an advanced data analysis environment that supports decision making, business modeling, and operations research.
Business Intelligence
A comprehensive approach to capture and process business data with the purpose of generating information to support business decision making.
Unstructured Data
Data that exists in its original, raw state; that is, in the format in which it was collected and does not conform to a predefined data model.
Structured Data
Data that conforms to a predefined data model and has been formatted to facilitate storage, use, and information generation.
Semistructured Data
Data that has already been processed to some extent.
Extensible Markup Language (XML)
A metalanguage used to represent and manipulate data elements. Unlike other markup languages, XML permits the manipulation of a document’s data elements. XML facilitates the exchange of structured documents such as orders and invoices over the Internet.
XML Database
A database system that stores and manages semistructured XML data.
Social Media
Web and mobile technologies that enable “anywhere, anytime, always on” human interactions.
NoSQL
A new generation of database management systems that is not based on the traditional relational database model.
Database Design
The process that yields the description of the database structure and determines the database components. The second phase of the database life cycle.
Data Processing (DP) Specialist
The person responsible for developing and managing a computerized file processing system.
Field
A character or group of characters (alphabetic or numeric) that has a specific meaning. A field is used to define and store data.
Record
A logically connected set of one or more fields that describes a person, place, or thing.
File
A collection of related records. For example, a file might contain data about the students currently enrolled at Gigantic University.
Problems with File System Data Processing
- Lengthy development times
- Difficulty of getting quick answers
- Complex system administration
- Lack of security and limited data sharing
- Extensive programming
Structural Dependence
A data characteristic in which a change in the database schema affects data access, thus requiring changes in all access programs.
Structural Independence
A data characteristic in which changes in the database schema do not affect data access.
Data Type
Defines the kind of values that can be used or stored. Also, used in programming languages and database systems to determine the operations that can be applied to such data.
Data Dependence
A data condition in which data representation and manipulation are dependent on the physical data storage characteristics.
Data Independence
A condition in which data access is unaffected by changes in the physical data storage characteristics.
Logical Data Format
The way a person views data within the context of a problem domain.
Physical Data Format
The way a computer “sees” (stores) data.
Islands of Information
In the old file system environment, pools of independent, often duplicated, and inconsistent data created and managed by different departments.
Data Redundancy
Exists when the same data is stored unnecessarily at different places.
Results of Data Redundancy
- Poor data security
- Data inconsistency
- Data-entry errors
- Data integrity problems
Data Integrity
In a relational database, a condition in which the data in the database complies with all entity and referential integrity constraints.
Means 2 things:
1. Data is accurate
2. Data is verifiable
Data Anomaly
A data abnormality in which inconsistent changes have been made to a database. For example, an employee moves, but the address change is not corrected in all files in the database.
Types of Data Anomalies
- Update anomalies
- Insertion anomalies
- Deletion anomalies
Database System
An organization of components that defines and regulates the collection, storage, management, and use of data in a database environment.
Five Parts of a Database System
- Hardware (all of the systems physical devices)
- Software (OS, DBMS software, application programs and utility software)
- People (sys admins, database admins, database designers, system analysts and programmers, end users)
- Procedures (instructions and rules the govern the design and use of the database system)
- Data
Data Dictionary
A DBMS component that stores metadata—data about data. Thus, the data dictionary contains the data definition as well as its characteristics and relationships. A data dictionary may also include data that is external to the DBMS.
DBMS Functions
- Data dictionary management
- Data storage management
- Data transformation and presentation
- Security management
- Multiuser access control
- Backup and recovery management
- Data integrity management
- Database access languages and application programming interfaces
- Database communication interfaces
Performance Tuning
Activities that make a database perform more efficiently in terms of storage and access speed.
Query Language
A nonprocedural language that is used by a DBMS to manipulate its data. An example of a query language is SQL.
Structured Query Language (SQL)
A powerful and flexible relational database language composed of commands that enable users to create database and table structures, perform various types of data manipulation and data administration, and query the database to extract useful information.
Database System Disadvantages
- Increased costs
- Management complexity
- Maintaining currency
- Vendor dependence
- Frequent upgrade/replacement cycles