Lesson 1 Flashcards
Database
A shared, integrated computer structure that stores a collection of related data.
Data Management
A process that focuses on data collection, storage, and retrieval.
Common data management functions include addition, deletion, modification, and listing
What are the two types of data in a database?
- end-user data (raw facts)
- Metadata
Metadata
Data about data; that is, data about data characteristics and relationships
Database Management Systems (DBMS)
The collection of programs that manages the database structure and controls access to the data stored in the database.
What are six advantages of a DBMS?
- Improved data security
- Better data integration
- Minimized data inconsistency
- Improved data access
- Improved decision making
- Increased end-user productivity
Data inconsistency
A condition in which different versions of the same data yield different (inconsistent) results.
Query
- A question or task asked by an end user of a database in the form of SQL code.
- A specific request for data manipulation issued by the end user or the application to the DBMS.
Ad hoc query
A “spur-of-the-moment” question
Query result set
The collection of data rows returned by a query.
Data quality
A comprehensive approach to ensuring the accuracy, validity, and timeliness of data.
Single-user database
A database that supports only one user at a time
Desktop database
A single-user database that runs on a personal computer.
Multiuser database
A database that supports multiple concurrent users
Workgroup database
A multiuser database that usually supports:
- Fewer than 50 users or
- Used for a specific department in an organization.
Enterprise database
The overall company data representation, which provides support for present and expected future needs.
Centralized database
A database located at a single site
Distributed database
A logically related database that is stored in two or more physically independent sites.
Cloud database
A database that is created and maintained using cloud services, such as Microsoft Azure or Amazon AWS
General-purpose database
A database that contains a wide variety of data used in multiple disciplines.
Discipline-specific database
A database that contains data focused on specific subject areas
Operational database
A database designed primarily to support a company’s day-to-day operations.
An operational database is also known as: 3
- online transaction processing database (OLTP)
- Transactional database
- production database
Data warehouse
- A specialized database that stores historical and aggregated data.
- The data format is optimized for decision support.
Online analytical processing (OLAP)
retrieving, processing, and modeling
A set of tools that provide advanced data analysis for retrieving, processing, and modeling data from the data warehouse
Business intelligence
Cathy Can I See Aardvarks?
A set of tools and processes used to capture, collect, integrate, store, and analyze data to support business decision making
Unstructured data
Data that exists in its original, raw state; that is, in the format in which it was collected
Structured data
Data that has been formatted to facilitate:
storage, use, and information generation.
Digital data
It represents other forms of data using specific machine language systems that can be interpreted by various technologies
Semistructured data
Data that has already been processed to some extent
Extensible Markup Language (XML)
- A metalanguage used to represent and manipulate data elements.
- Unlike other markup languages, XML permits the manipulation of a document’s data elements.
XML database
A database system that stores and manages semistructured XML data.
NoSQL
A new generation of database management systems that is not based on the traditional relational database model
Database design
The second phase of the Database Life Cycle
The process that yields the description of the database structure and determines the database components.
Analytical Database
A database focused primarily on storing historical data and business metrics used for tactical or strategic decision making
Data processing (DP) specialist
The person responsible for developing and managing a computerized file processing system
Record
- Contains a set of fields that describes a person, place, or thing.
- It is a collection of data items arranged for processing by a program.
Field
- It is a data structure for a single piece of data.
- A field is a attibute (column) value
- Contains all the information within the table appropriate to a particular entity.
File
- While a file can contain tables, it can also contain other types of data, such as text, images, or rows.
- Stores data, information, settings, or commands that are used in computation.
What are the two file systems?
- Manual
- Computerized
Manual file system
Includes paper-and-pencil system to facilitate the expected use of data
Computerized file system
- Stores and organizes computer files and makes data easy to find and access them
- Generates reports and keeps track of complex data
Structural dependence
A data characteristic in which a change in the database schema affects data access, thus requiring changes in all access programs
Structural independence
A data characteristic in which changes in the database schema do not affect data access
Data dependence
A data condition in which data representation and manipulation are dependent on the physical data storage characteristics
Data independence
A condition in which data access is unaffected by changes in the physical data storage characteristics such as row order
Logical data format
The way a person views data within the context of a problem domain
Physical data format
The way a computer stores data.
How it see’s
Data redundancy
Exists when the same data is stored unnecessarily at different places
Uncontrolled data redundancy sets the stage for the following:
- Poor data security
- Data inconsistency
- Data-entry errors
- Data integrity problems
Data integrity
In a relational database, a condition in which the data in the database complies with all entity and referential integrity constraints.
Data anomaly
A data abnormality (irregularity) in which inconsistent changes have been made to a database.
For example, an employee moves, but the address change is not corrected in all files in the database.
Database system
- An organization of components in a database environment
- Defines and regulates the collection, storage, management, and use of data
From a general management point of view, the database system is composed of the five major parts:
Data dictionary
- The data dictionary contains data~ definitions, characteristics and relationships.
- May also include data that is external to the DBMS
Performance tuning
Activities that make a database perform more efficiently in terms of storage and access speed.
Query language
A nonprocedural language that is used by a DBMS to manipulate its data.
An example of a query language is SQL.
DBMS function: Security Management
- Sets rules that determine specific users that are allowed to access the database.
- This function also sets restraints on what specific data any user can see or manage.
DBMS function: Data transformation and presentation
- By using this function, the DBMS can determine the difference between logical and physical data formats.
- Exists to transform any data entered into required data structures.
DBMS function: data dictionary management
- Removes structural and data dependency and provides the user with data abstraction.
- The DBMS uses this function to look up the required data component structures and relationships.
DBMS software
Manages the database within the database system.
Some examples of DBMS software include Microsoft’s SQL Server, Oracle Corporation’s Oracle, Oracle’s MySQL, and IBM’s DB2
Application programs and utility software
used to access and manipulate data in the DBMS and to manage the computer environment in which data access and manipulation take place
System analysts and programmers
- Design and implement the application programs.
- They design and create the data-entry: screens, reports, and procedures.
- End users access- those designs and manipulate the database’s data
End Users
the people who use the application programs to run the organization’s daily operations
DBMS function: Multiuser access control
To provide data integrity and data consistency:
- The DBMS uses sophisticated algorithms to ensure that multiple users can access the database concurrently without compromising its integrity.
DBMS function: data integrity management
- Promotes and enforces integrity rules.
- Minimizing data redundancy and maximizing data consistency.
DBMS function: Database communication interfaces
- A current-generation feature
- DBMS accepts end-user requests via multiple, different network environments.
Ex: web browsers like Mozilla Firefox
____ ____ exists when different and conflicting versions of the same data appear in different places.
Data inconsistency
One disadvantage of the DBMS is that it:?
increases the risk of data security breaches
Data processing can be as simple as:
organizing data to reveal patterns
The database structure itself is stored as a collection of files, how would you access the data?
The only way to access the data in those files is through the DBMS.
Accurate, relevant, and timely information is the key to _______.
Data Management
Data is said to be verifiable if:
The data always yields consistent results
A(n) ______ database is used by an organization and supports many users across many departments.
Enterprise
A _____ contains at least all of the attribute names and characteristics for each table in the system.
Data dictionary
A(n) ____ _____ develops when all required changes in the redundant data are not made successfully
Data anormaly
Raw data must be properly ______ for storage, processing, and presentation
formatted
_______ are the result of formatting disorganized data in order to facilitate storage, use and generation of information.
Structured data
An XML database supports the storage and management of _____ XML data.
Semistructured
The term ______ __ _______ refers to scattered locations storing the same basic data.
Islands of information
Which of the following is true of spreadsheet applications?
They enhance the user’s ability to understand the data.
What makes today’s databases more complex & more powerful for organizing and using information than earlier ones?
- Connections: Linking related data together.
- Accuracy: Preventing errors and keeping data consistent.
- Sharing: Allowing many users at the same time.
- Protection: Strong security for sensitive information.
- Speed: Finding data quickly and efficiently.
The DBMS uses the _____ ____ to look up the required data component structures and relationships, thus relieving programmers from having to code such complex relationships in each program.
Data dictionary
Corporations use:
All sorts of data types including structured data.
Explain the difference between operational database and an enterprise database.
- Operational databases handle daily transactions (like orders and inventory).
- Enterprise databases are broader, storing various data types (including operational but also analytical and historical) for the entire organization.
While an operational database can be part of an enterprise database, they aren’t the same.
The organization of data within folders in a manual file system is determined by _____.
its expected use
Most data that can be encountered are best classified as _____ .
Semistructured
_____ refer to a type of database that stores most of its data in RAM rather than in hard disks.
In-memory databases
To reveal meaning, information requires ______
context
____ _____ is defined as the condition in which all of the data in the database are consistent with the real-world events and conditions.
Data integrity
________ is the result of processing raw data to reveal its meaning
Information
Raw data must be properly ______ for storage, processing and presentation.
formatted
A _____ is a logically connected set of one or more fields that describes a person, place, or thing.
Record
A data warehouse can store data derived from many sources. (T/F)
True
Metadata describe the data characteristics and the set of relationships that links the data found within the database. (T/F)
True
End-user data is ______.
Raw facts of interest to the end user
A(n) _____ database supports a relatively small number of users (usually fewer than 50) or a specific department within an organization.
Workgroup
Data processing is:
As simple as organizing data to reveal patterns.
_____ exists when different versions of the same data appear in different places.
Data inconsistency
_____ is the result of revealing the meaning of raw facts.
Information
________ ________ relates to activities that make a database operate more efficiently in terms of storage and access speed.
Performance Tuning
A ______ is a specific request issued to the DBMS for data manipulation.
Query
_____ relates to the activities that make the database execute transactions more efficiently in terms of storage and access speed.
Performance Tuning
________ is data about data through which the end-user data are integrated and managed.
Metadata
The DBMS
- A Database Management System is software that allows users to define, store, maintain, and manage data in a structured and efficient manner.
- It acts as an intermediary between data and users, allowing disparate data from different applications to be managed.
Record
- A single row of data within a table, containing all the information related to a specific entity.
- Essentially acting as a horizontal entry within the table structure
What refers to the situation where the same data is stored unnecessarily at different places?
Data redundancy
________ is the result of processing raw data to reveal its meaning
information
________ is a special language used to represent and manipulate data elements in a textual format.
Extensible Markup Language (XML)
Database programming languages receive all application requests and translate them into the complex operations required to fulfill those requests. (T/F)
False
What is Data?
Data constitutes the building blocks of information.
The same data might be structured how?
The same data might be simultaneously structured and unstructured depending on the intended processing.
Why do you database management systems need updates?
Database systems are like any other software on your computer. They need regular updates to fix problems, improve security, and add new features.
The database structure in a DBMS is stored as a ______.
collection of files
The response of the DBMS to a query is the ______.
query result set
_____ data exist in the format in which they were collected.
Unstructured
_____ data exist in a format that does not lend itself to processing that yields information.
Unstructured
A _______ ________ ______ is a collection of programs that manages the database structure and controls access to the data stored in the database.
database management system
Structural dependence
- The application program is affected by changes in the file structure.
- If you change how the data is organized (schema), the program needs to be updated to understand the new organization in order to access the data correctly.
An ________ is a spur-of-the-moment question
ad hoc query
________ exists when it is possible to make changes in the data storage characteristics without affecting an application program’s ability to access data.
data independence
__________ exists when the same data are stored unnecessarily at different places.
Data redundancy
A desktop database is a _____ database
single-user
_____ serve as the intermediary between the user and the database.
DBMSs
Data warehouse contains historical data obtained from the_____.
Operational Databases
A _____ is a collection of related records.
File
A(n) _______ _______ develops when all required changes in the redundant data are not made successfully.
Data anomaly
____ ________ exists when different and conflicting versions of the same data appear in different places.
Data inconsistency
The term _____ refers to an organization of components that define and regulate the collection, storage, management and use of data within a database environment.
Database system
Data Integrity
Defined as the condition in which all of the data in the database are consistent with the real-world events and conditions.
A _____ is a character or group of characters that has a specific meaning.
Field
_______ databases focus primarily on storing data used to generate information required to make tactical or strategic decisions.
Analytical
______ provide a description of the data characteristics and the set of relationships that link the data found within the database.
Metadata
A(n) _____ might be written by a programmer or it might be created through a DBMS utility program.
application
A workgroup database is a(n) _____ database.
multiuser