Lesson 1 Flashcards
Database
A shared, integrated computer structure that stores a collection of related data.
Data Management
A process that focuses on data collection, storage, and retrieval. Common data management functions include addition, deletion, modification, and listing
What are the two types of data in a database?
- end-user data (raw facts)
- Metadata
Metadata
Data about data; that is, data about data characteristics and relationships
Database Management Systems (DBMS)
The collection of programs that manages the database structure and controls access to the data stored in the database.
What are the advantages of a DBMS?
- Improved data security
- Better data integration
- Minimized data inconsistency
- Improved data access
- Improved decision making
- Increased end-user productivity
Data inconsistency
A condition in which different versions of the same data yield different (inconsistent) results.
Query
A question or task asked by an end user of a database in the form of SQL code. A specific request for data manipulation issued by the end user or the application to the DBMS.
Ad hoc query
A “spur-of-the-moment” question
Query result set
The collection of data rows returned by a query.
Data quality
A comprehensive approach to ensuring the accuracy, validity, and timeliness of data.
Single-user database
A database that supports only one user at a time
Desktop database
A single-user database that runs on a personal computer.
Multiuser database
A database that supports multiple concurrent users
Workgroup database
A multiuser database that usually supports fewer than 50 users or is used for a specific department in an organization.
Enterprise database
The overall company data representation, which provides support for present and expected future needs.
Centralized database
A database located at a single site
Distributed database
A logically related database that is stored in two or more physically independent sites.
Cloud database
A database that is created and maintained using cloud services, such as Microsoft Azure or Amazon AWS
General-purpose database
A database that contains a wide variety of data used in multiple disciplines.
Discipline-specific database
A database that contains data focused on specific subject areas
Operational database
A database designed primarily to support a company’s day-to-day operations.
An operational database is also known as:
- online transaction processing database (OLTP)
- Transactional database
- production database
Data warehouse
A specialized database that stores historical and aggregated data in a format optimized for decision support.
Online analytical processing (OLAP)
A set of tools that provide advanced data analysis for retrieving, processing, and modeling data from the data warehouse
Business intelligence
A set of tools and processes used to capture, collect, integrate, store, and analyze data to support business decision making
Unstructured data
Data that exists in its original, raw state; that is, in the format in which it was collected
Structured data
Data that has been formatted to facilitate storage, use, and information generation.
Digital data
It represents other forms of data using specific machine language systems that can be interpreted by various technologies
Semistructured data
Data that has already been processed to some extent
Extensible Markup Language (XML)
A metalanguage used to represent and manipulate data elements. Unlike other markup languages, XML permits the manipulation of a document’s data elements.
XML database
A database system that stores and manages semistructured XML data.
NoSQL
A new generation of database management systems that is not based on the traditional relational database model
Database design
The process that yields the description of the database structure and determines the database components. The second phase of the Database Life Cycle
Analytical Database
A database focused primarily on storing historical data and business metrics used for tactical or strategic decision making
Data processing (DP) specialist
The person responsible for developing and managing a computerized file processing system
Record
Contains a set of fields that describes a person, place, or thing. It is a collection of data items arranged for processing by a program.
Field
Contains all the information within the table appropriate to a particular entity. It is a data structure for a single piece of data.
File
Stores data, information, settings, or commands that are used in computation. It is a collection of related records.
What are the two file systems?
- Manual
- Computerized
Manual file system
Includes paper-and-pencil system to facilitate the expected use of data
Computerized file system
- Stores and organizes computer files and makes data easy to find and access them
- Generates reports and keeps track of complex data
Structural dependence
A data characteristic in which a change in the database schema affects data access, thus requiring changes in all access programs
Structural independence
A data characteristic in which changes in the database schema do not affect data access
Data dependence
A data condition in which data representation and manipulation are dependent on the physical data storage characteristics
Data independence
A condition in which data access is unaffected by changes in the physical data storage characteristics
Logical data format
The way a person views data within the context of a problem domain
Physical data format
The way a computer “sees” (stores) data.
Data redundancy
Exists when the same data is stored unnecessarily at different places
Uncontrolled data redundancy sets the stage for the following:
- Poor data security
- Data inconsistency
- Data-entry errors
- Data integrity problems
Data integrity
In a relational database, a condition in which the data in the database complies with all entity and referential integrity constraints.
Data anomaly
A data abnormality in which inconsistent changes have been made to a database. For example, an employee moves, but the address change is not corrected in all files in the database.
Database system
An organization of components that defines and regulates the collection, storage, management, and use of data in a database environment
From a general management point of view, the database system is composed of the five major parts:
- Hardware
- Software
- People
- Procedures
- Data
Data dictionary
The data dictionary contains data definitions as well as data characteristics and relationships. May also include data that is external to the DBMS
Performance tuning
Activities that make a database perform more efficiently in terms of storage and access speed
Query language
A nonprocedural language that is used by a DBMS to manipulate its data. An example of a query language is SQL.
DBMS function: Security Management
Sets rules that determine specific users that are allowed to access the database. This function also sets restraints on what specific data any user can see or manage.
DBMS function: Data transformation and presentation
Exists to transform any data entered into required data structures. By using this function, the DBMS can determine the difference between logical and physical data formats.
DBMS function: data dictionary management
Removes structural and data dependency and provides the user with data abstraction. The DBMS uses this function to look up the required data component structures and relationships.
DBMS software
manages the database within the database system. Some examples of DBMS software include Microsoft’s SQL Server, Oracle Corporation’s Oracle, Oracle’s MySQL, and IBM’s DB2
Application programs and utility software
used to access and manipulate data in the DBMS and to manage the computer environment in which data access and manipulation take place
System analysts and programmers
design and implement the application programs. They design and create the data-entry screens, reports, and procedures through which end users access and manipulate the database’s data
End Users
the people who use the application programs to run the organization’s daily operations
DBMS function: Multiuser access control
To provide data integrity and data consistency, the DBMS uses sophisticated algorithms to ensure that multiple users can access the database concurrently without compromising its integrity.
DBMS function: data integrity management
promotes and enforces integrity rules, thus minimizing data redundancy and maximizing data consistency.
DBMS function: Database communication interfaces
A current-generation DBMS accepts end-user requests via multiple, different network environments.
For example web browsers like Mozilla Firefox
____ ____ exists when different and conflicting versions of the same data appear in different places.
Data inconsistency
One disadvantage of the DBMS is that it increases the risk of data security breaches (T/F)
True
Data processing can be as simple as organizing data to reveal patterns (T/F)
True
The database structure itself is stored as a collection of files, and the only way to access the data in those files is through the DBMS. (T/F)
True
Accurate, relevant, and timely information is the key to _______.
Data Management
Data is said to be verifiable if:
The data always yields consistent results
A(n) ______ database is used by an organization and supports many users across many departments.
Enterprise
A _____ contains at least all of the attribute names and characteristics for each table in the system.
Data dictionary
A(n) ____ _____ develops when all required changes in the redundant data are not made successfully
Data anormaly
Raw data must be properly ______ for storage, processing, and presentation
formatted
_______ are the result of formatting disorganized data in order to facilitate storage, use and generation of information.
Structured data
An XML database supports the storage and management of _____ XML data.
Semistructured
The term ______ __ _______ refers to scattered locations storing the same basic data.
Islands of information
Which of the following is true of spreadsheet applications?
They enhance the user’s ability to understand the data.
One advantage of a database system over previous data management approaches is that the database system is considerably less complex. (T/F)
False
The DBMS uses the _____ ____ to look up the required data component structures and relationships, thus relieving programmers from having to code such complex relationships in each program.
Data dictionary
Corporations use only structured data. (T/F)
False
An operational database is sometimes referred to as an enterprise database. (T/F)
False
The organization of data within folders in a manual file system is determined by _____.
its expected use
Most data that can be encountered are best classified as _____ .
Semistructured
_____ refer to a type of database that stores most of its data in RAM rather than in hard disks.
In-memory databases
To reveal meaning, information requires ______
context
____ _____ is defined as the condition in which all of the data in the database are consistent with the real-world events and conditions.
Data integrity
________ is the result of processing raw data to reveal its meaning
Information
Raw data must be properly ______ for storage, processing and presentation.
formatted
A _____ is a logically connected set of one or more fields that describes a person, place, or thing.
Record
A data warehouse can store data derived from many sources. (T/F)
True
Metadata describe the data characteristics and the set of relationships that links the data found within the database. (T/F)
True
End-user data is ______.
Raw facts of interest to the end user
A(n) _____ database supports a relatively small number of users (usually fewer than 50) or a specific department within an organization.
Workgroup
Data processing can be as simple as organizing data to reveal patterns. (T/F)
True
_____ exists when different versions of the same data appear in different places.
Data inconsistency
_____ is the result of revealing the meaning of raw facts.
Information
________ ________ relates to activities that make a database operate more efficiently in terms of storage and access speed.
Performance Tuning
A ______ is a specific request issued to the DBMS for data manipulation.
Query
_____ relates to the activities that make the database execute transactions more efficiently in terms of storage and access speed.
Performance Tuning
________ is data about data through which the end-user data are integrated and managed.
Metadata
The DBMS reveals much of the database’s internal complexity to the application programs and users. (T/F)
False
Field refers to a collection of related records. (T/F)
False
Which of the following refers to the situation where the same data is stored unnecessarily at different places?
Data redundancy
________ is the result of processing raw data to reveal its meaning
information
________ is a special language used to represent and manipulate data elements in a textual format.
Extensible Markup Language (XML)
Database programming languages receive all application requests and translate them into the complex operations required to fulfill those requests. (T/F)
False
Data constitute the building blocks of information. (T/F)
True
The same data might be simultaneously structured and unstructured depending on the intended processing. (T/F)
True
An advantage of database systems is that you don’t need to perform frequent updates and apply the latest patches. (T/F)
False
The database structure in a DBMS is stored as a ______.
collection of files
The response of the DBMS to a query is the ______.
query result set
_____ data exist in the format in which they were collected.
Unstructured
_____ data exist in a format that does not lend itself to processing that yields information.
Unstructured
A _______ ________ ______ is a collection of programs that manages the database structure and controls access to the data stored in the database.
database management system
Structural dependence exists when it is possible to make changes in the file structure without affecting the application program’s ability to access the data. (T/F)
False
An ________ is a spur-of-the-moment question
ad hoc query
________ exists when it is possible to make changes in the data storage characteristics without affecting an application program’s ability to access data.
data independence
__________ exists when the same data are stored unnecessarily at different places.
Data redundancy
A desktop database is a _____ database
single-user
_____ serve as the intermediary between the user and the database.
DBMSs
Data warehouse contains historical data obtained from the_____.
Operational Databases
A _____ is a collection of related records.
File
A(n) _______ _______ develops when all required changes in the redundant data are not made successfully.
Data anomaly
____ ________ exists when different and conflicting versions of the same data appear in different places.
Data inconsistency
The term _____ refers to an organization of components that define and regulate the collection, storage, management and use of data within a database environment.
Database system
Data anomaly is defined as the condition in which all of the data in the database are consistent with the real-world events and conditions. (T/F)
False
A _____ is a character or group of characters that has a specific meaning.
Field
_______ databases focus primarily on storing data used to generate information required to make tactical or strategic decisions.
Analytical
______ provide a description of the data characteristics and the set of relationships that link the data found within the database.
Metadata
A(n) _____ might be written by a programmer or it might be created through a DBMS utility program.
application
A workgroup database is a(n) _____ database.
multiuser