Chapter 5 Flashcards
what does high quality mean in terms of data
accurate, complete, timely, consistent, accessible, relevant, and concise
Why is managing data difficult (general)?
data are processed in several stages and often in multiple locations
Why is managing data difficult (specific)?
- amount of data increase exponentially with time
- data are also scattered throughout organizations (collected by different individuals using different methods, thus data is stored in many locations and servers and in different systems, databases, formats, and languages(human and computer))
- data are generated from multiple sources
- new sources of data are constantly being developed
- data are subject to data rot
- data security, quality and integrity are critical but easily jeopardized
- orgs have different ISs for specifc business processes, and this impose unqiue requirements on data
- federal gov regulation
- companies are drwoning in much unstructred data
What are some sources where data comes from?
- internal sources (ex. corporate databases and company documents)
- personal sources (ex. personal thoughts, opinions and experiences)
- external sources (ex. commercial databases, gov reports, corporate websites)
- the web (clickstream data)
def. clickstream data
Data collected about user behaviour and browsing patterns by monitoring users’ activities when they visit a website. (click on hyperlinks)
What are some examples of data degrading overtime?
customers move to new addresses/change names, companies go out of business, new products are developed, companies expand into new countries, employees are hired or fired
What is data rot? What are its two aspects?
refers primarily to problems with the media on which the data are stored
Aspects
Physical problems :Over time, temperature, humidity, and exposure to light can cause physical problems with storage media and thus make it difficult to access the data.
Difficulty finding the machines needed to access the data
What is the impact on data of having ISs develop over time?
Information systems that specifically support these processes impose unique requirements on data, which results in repetition and conflicts across the organization
-ex. he marketing function might maintain information on customers, sales territories, and markets. These data might be duplicated within the billing or customer service functions. This situation can produce inconsistent data within the enterprise
What does inconsistent data prevent a company from developing?
a unified view of core business information (data concerning customers, products, finances, etc.) across the org and its ISs
What is the most significant government regulation affecting data?
Bill 198
requires:
(1)public companies evaluate and disclose the effectiveness of their internal financial controls
(2)independent auditors for these companies agree to this disclosure
-also holds CEOs and CFOs personally responcible for these diclosures
How do gov regulations impact data?
they require companies to account for how information is being managed within their organizations
What must companies do with the amount of data to be able t profit?
companies must develop a strategy for managing these data effectively
def. data governance
An approach to managing information across an entire organization which involves a formal set of business processes and policies that are designed to ensure that data are handled in a certain, well-defined fashion
What happens in data governance (general)?
the organization follows unambiguous rules for creating, collecting, handling, and protecting its information
What is the goal of data governance?
make information available, transparent, and useful for the people who are authorized to access it, from the moment it enters an organization until it is outdated and deleted
def. master data management
A process that provides companies with the ability to store, maintain, exchange, and synchronize a consistent, accurate, and timely “single version of the truth” for a company’s core master data.
What is a strategy for implementing data governance?
master data mangement
def. master data
A set of core data, such as customer, product, employee, vendor, geographic location, and so on, that span an enterprise’s information systems.
What is the difference between transaction data and master data?
Transaction data, which are generated and captured by operational systems, describe the business’s activities or transactions. In contrast, master data are applied to multiple transactions and are used to categorize, aggregate, and evaluate the transaction data
How did businesses manage their data during the first adopted computer applications era?
file management environment
def. data file (table)
a collection of logically related records
What happens in a file management environment?
each application has a specific data file related to it, which contains all of the data record the application requires
over times, orgs evloped numerous applications, each with an associated, application-specific data file
What can the use of databases solve? (6)
minimize:
- data redundancy
- data isolation
- data inconsistency
maximize
- data security
- data integrity
- data independence
how are databases arranged?
arranged so that one set of software programs—the database management system—provides all users with access to all of the data.