pic 3 - Data and Knowledge Management Flashcards
Difficulties of managing data
- Amount of data is increasing exponentially.
- Data is scattered throughout organizations.
- Data is generated from multiple sources.
- New sources of data are constantly being developed. Data becomes let current over time.
- Data rot
- Data security, quality, and integrity are critical, yet they are easily jeopardized.
- Federal gov. regulations require companies to account for how information is being managed within their organizations. Companies are downing in data, much of which is unstructured.
- Big data
Data silo
A collection of data held by one group that is not easily accessible by other groups. They hinder the process of gaining actionable insight from organizational data, create barriers to an overall view of the enterprise and its data.
Data streams
Data that are continuously generated by point-of-sale systems, clickstream data, social media, and sensors.
Data rot
Refers primarily to problems with the media on which the data are stored.
- Temperature, humidity, and exposure to light can cause physical problems with storage media and make it difficult to access data.
- Another aspect is finding the machines needed to access the data.
Data governance
An approach to managing information across an entire organization.
- Involves a formal set of business processes and policies that are designed to ensure that data are handled in a certain, well-defined fashion.
- Objective is to make info available, transparent, and useful for the people who are authorized to access it, from the moment it enters an organization until it become outdated and is deleted.
Master data management
A process that spans all of an organization’s business processes and applications.
- Strategy for implementing data governance.
- Provides companies the ability to store, maintain, exchange, and synchronize a consistent, accurate, and timely “single version of the truth” for the company’s master data.
Master data
A set of core data (e.g., customer, product, employee, vendor, geographic location) that span the enterprise’s information systems.
Transactional data
Data generated and captured by operational systems that describe the business’s activities, or transactions.
Data file
A collection of logically related records.
Database systems minimize the following problems:
- Data redundancy – same data stored in multiple locations.
- Data isolation – applications cannot access data associated with other applications.
- Data inconsistency – various copies of the data do not agree.
Database systems also maximize the following benefits:
- Data security – Since data is put in once place in databases, there is risk of losing a lot of data at once. Databases must have extremely high security measures in place to minimize mistakes and deter attacks.
- Data integrity – Data meet certain constraints (E.g., there are no letter in a SIN).
- Data independence – Applications and data are independent of one another; that is, applications and data are not linked to each other, so all applications are able to access the same data.
A bit (binary digit)
the smallest unit of data a computer can process. Binary means it can only consist of a 0 or 1.
A byte
a group of 8 bits that represent a single character. Can be a letter, number, or symbol.
Field
a characteristic of interest that describes an entity. It is a logical grouping of characters into a word, a small group of words, or an identification number.
Record
a logical grouping of related fields.
Table
a logical grouping of related records. AKA a data file.
Database
a logical grouping of related files.
Database management system (DBMS)
A set of programs that provide users with tools to create and manage a database.
- Managing a database refers to the processes of adding, deleting, accessing, modifying, and analyzing data that are stored in a database.
Relational database model
Data model based on the simple concept of tables in order to capitalize on characteristics of rows and columns of data.