Chapter 5 Flashcards
It is a managerial activity that applies information systems technologies like database management, data warehousing, and other data management tools to the task of managing an organization’s data resources to meet the information needs of their business stakeholders.
Data Resource Management
vital organizational resource that need to be managed like other important business assets.
Data
store detailed data needed to support the business processes and operations of a company
Operational Databases
are also called subject area databases (SADB), transaction databases, and production databases.
Operational Databases
Examples are a customer database, human resource database, inventory database, and other databases containing data generated by business operations.
Operational Databases
Many organizations replicate and distribute copies or parts of databases to network servers at a variety of sites.
Distributed Databases
These __________ can reside on network servers on the World Wide Web, on corporate intranets or extranets, or on other company networks.
distributed databases
may be copies of operational or analytical databases, hypermedia or discussion databases, or any other type of database.
Distributed databases
Replication and distribution of databases improve database performance at enduser worksites.
Distributed databases
Ensuring that the data in an organization’s distributed databases are consistently and concurrently updated is a major challenge of distributed database management.
Distributed databases
Advantages of Distributed Databases
- Protection of valuable data
- Storage requirements
Challenges of Distributed Databases
- Maintenance of data accuracy
- Replication
- Duplication
- extra computing power and bandwidth necessary to access multiple databases in multiple locations.
large database system may be distributed into smaller databases based on some logical relationship between the data and the location.
Storage Requirements
If a company distributes its database to multiple locations, any change to the data in one location must somehow be updated in all other locations. This updating can be accomplished in one of two ways: replication or duplication.
Maintenance of data accuracy
Involves using a specialized software application that looks at each distributed database and then finds the changes made to it.
Replication
Once these changes have been identified, the ______ makes all of the distributed databases look the same by making the appropriate changes to each one.
Replication process
The _____ is very complex and, depending on the number and size of the distributed databases, can consume a lot of time and computer resources.
Replication process
is much less complicated.
Duplication
identifies one database as a master and then duplicates that database at a prescribed time after hours so that each distributed location has the same data.
Duplication
One drawback to the duplication process is that no changes can ever be made to any database other than the master to avoid having local changes overwritten during the duplication process.
Duplication
Websites provide an endless variety of hyperlinked pages of multimedia documents in hypermedia databases for us to access.
External Databases
Data are available in the form of statistics on economic and demographic activity from statistical databanks, or we can view or download abstracts or complete copies of hundreds of newspapers, magazines, newsletters, research papers, and other published material and periodicals from bibliographic and full-text databases.
External Databases
The rapid growth of Web sites on the Internet and corporate intranets and extranets has dramatically increased the use of databases of hypertext and hypermedia documents.
Hypermedia Databases
A Website stores such information in a hypermedia database consisting of hyperlinked pages of multimedia (text, graphic and photographic images, video clips, audio segments, and so on).
Hypermedia Databases
from a database management point of view, the set of interconnected multimedia pages on a Website is a database of interrelated hypermedia page elements, rather than interrelated data records.
Hypermedia Databases
A _________ stores data that have been extracted from the various operational, external, and other databases of an organization.
Data warehouse
It is a central source of the data that have been cleaned, transformed, and cataloged so that they can be used by managers and other business professionals for data mining, online analytical processing, and other forms of business analysis, market research, and decision support.
Data Warehousing
______ may be subdivided into data marts , which hold subsets of data from the warehouse that focus on specific aspects of a company, such as a department or a business process.
Data warehousing
Data from various operational and external databases are captured, cleaned, and
transformed into data that can be better used for analysis.
Data Acquisition
This ______ might include activities like consolidating data from several
sources, filtering out unwanted data, correcting incorrect data, converting data to new data elements, or aggregating data into new data subsets.
acquisition process
These data are then stored in the _______________, from which they can be
moved into ________ or to an ___________ that holds data in a more useful form for certain types of analysis.
- enterprise data warehouse
- data marts
- analytical data store
Data that define the data in the data warehouse, are stored in a metadata repository and cataloged by a metadata directory.
Metadata
a variety of ___________can be provided to query, report, mine, and analyze the data for delivery via Internet and intranet Web systems to business end users
analytical software tools
One important characteristic about the data in a data warehouse is that, unlike a typical database in which ___________, data in a data warehouse are _________, which means that once the data are gathered up, formatted for storage, and stored in the data warehouse, they will never change.
- changes can occur constantly
- static
the data in a data warehouse are analyzed to reveal hidden patterns and trends in historical business activity.
Data Mining
This analysis can be used to help managers make decisions about strategic changes in business operations to gain competitive advantages in the marketplace.
Data Mining
can discover new correlations, patterns, and trends in vast amounts of business data (frequently several terabytes of data) stored in data warehouses
Data mining
It uses advanced pattern recognition algorithms, as well as a variety of mathematical and statistical techniques, to sift through mountains of data to extract previously unknown strategic business information.
Data mining software
the process of reducing something’s strength or effectiveness through sustained attack
or pressure
attrition
Enumeration:
For example, many companies use data mining to:
● Perform market-basket analysis to identify new product bundles.
● Find root causes of quality or manufacturing problems.
● Prevent customer attrition and acquire new customers.
● Cross-sell to existing customers.
● Profile customers with more accuracy
consolidates data records, formerly held in separate files, into databases that can be accessed by many different application programs.
database management approach
serves as a software interface between users and databases, which helps users easily access the data in a database.
database management system (DBMS)
was conceived as the foundation of modern methods for managing organizational data.
database management approach
involves the use of database management software to control how databases are created, interrogated, and maintained to provide information that end users need.
database management
is the main software tool of the database
management approach because it controls the creation, maintenance, and use of the
databases of an organization and its end users
database management system (DBMS)
Enumeration:
Examples of popular DBMS software
- IBM’s DB2 Universal Database
- Oracle 10g by Oracle Corp.
- MySQL (a popular open-source DBMS)
Language and graphical to define entities, relationships, integrity constraints, and authorization rights.
Database Definition
Language and graphical tools to access data without complicated coding
Nonprocedural Access
Graphical tools to develop menus, data entry forms, and reports
Application Development
Language that combines nonprocedural access with full capabilities of a programming language
Procedural Language Interface
Control mechanisms to prevent interference from simultaneous users and recover lost data after a failure
Transaction Processing
Tools to monitor and improve database performance
Database Tuning