Chapter 5: Data and Knowledge Management Flashcards
What are some of the difficulties managing data
Data Rot
- refers to the gradual deterioration or corruption of digital data over time
Data Security
- Data security, quality, and integrity are critical, yet they are easily jeopardized.
Inconsistent Data
- repetition and conflicts across the organization (ex. marketing depart. having duplicate customer information as the sales depart)
Regulations
- Legal requirements relating to data also differ among countries as well as among industries, and they change frequently.
- Bill 198, the Canadian equivalent to the U.S. Sarbanes-Oxley Act—require companies to account for how information is being managed within their organizations
define clickstream data
Clickstream data: are those data that visitors and customers produce when they visit a website
- provides user behaviour and browsing patterns
define data goverance
Data governance: is an approach to managing information across an entire organization.
- involves a formal set of business processes and policies that are designed to ensure that data are handled in a certain, well-defined fashion - policies and procedures to manage data
One strategy for implementing data governance is —- data ——–
Master data management
define master data and master data management
Master data: semi-permeant, (it could change but it doesn’t change often), are a set of core data, such as customer, product, employee, vendor, geographic location, that span the enterprise’s information systems.
* Ex. A person’s last name, could be changed in case person gets married
Master data management: is a method of managing an organization’s critical data to ensure data consistency, accuracy, and reliability throughout the enterprise.
define Transactional data
Transactional data: permeant data, which are generated and captured by operational systems, describe the business’s activities, or transactions
What is the data hierarchy? List in order and try to define
5 words
Bit, Byte, Field, Record, Table
Bit (binary digit): represents the smallest unit of data a computer can process.
* The term binary means that a bit can consist only of a 0 or a 1.
Byte: A group of eight bits, represents a single character. A byte can be a letter, a number, or a symbol.
Field: A logical grouping of characters into a word, a small group of words, or an identification number
* Ex. a student’s name in a university’s computer files would appear in the “name” field
Record: A logical grouping of related fields, such as the student’s name, the courses taken, the date, and the grade
Table: A logical grouping of related records
* Ex. a grouping of the records from a particular course, consisting of course number, professor, and students’ grades, would constitute a data file for that course
Define data file and database
Data file: is a logically related record
Database: is a structured system for managing and querying organized data
Ex. the student course file could be grouped with files on students’ personal histories and financial backgrounds to create a student database
define Database management system (DBMS) and Relational database model
Database management system (DBMS): is a set of programs that provide users with tools to create and manage a database.
Relational database model made up of tables. Each of these tables contains records (listed in rows) and attributes (listed in columns).
Define a Data base model, Entity, Instance, and Attribute
Data model: is a diagram that represents entities in the database and their relationships
Entity: is a person, a place, a thing, or an event—about which an organization maintains information
- ex. : “student”
Instance: of an entity refers to each row in a relational table, which is a specific, unique representation of the entity.
- Ex. a particular person named John Smith is an instance of the “Student” entity.
Attribute: each characteristic or quality of a particular entity
- ex. For a “Student” entity, attributes could include “Name,” “Student Number,” and “Grade.”
Define primary key, secondary key, foreign key?
Primary key: Every record in the database must contain at least one field that uniquely identifies that record so that it can be* retrieved, updated, and sorted*.
* Ex. a Canadian university would use a unique student number as its primary key
Secondary key: is another field that has some identifying information but typically does not identify the record with complete accuracy
* Ex. the student’s major might be a secondary key if a user wanted to identify all of the students majoring in a particular field of study
* It should not be the primary key, however, because many students can have the same major.
Foreign key: It’s a column in one table that helps link that table to another table. It usually contains values that match with the primary key of the other table, creating a relationship.
What are some of the disavantages and advantages of relational database model?
Advantages:
Data Integrity:
* ensuring that data is accurate and consistent.
Structured Query Language (SQL):
* SQL provides a standardized way to interact with the database, making it easy to retrieve, manipulate, and manage data.
Data Relationships:
* The model allows for the creation of relationships between tables, enabling complex queries and data retrieva
Disadvantages:
Inflexible for Unstructured Data:
* Not well-suited for storing unstructured data like documents, social media content, or sensor data.
Cost:
* Licensing and maintaining can be expensive.
what are the three characteristics of Big Data?
Hint: VVV
Volume: We noted the huge volume of Big Data.
- Ex. Smart electrical meters, sensors in heavy industrial equipment, and telemetry from automobiles compound the volume problem
Velocity: The rate at which data flow into an organization is rapidly increasing.
- Ex. the Internet and mobile technology enable online retailers to compile histories not only on final sales, but on their customers’ every click and interaction.
Variety: Big Data formats change rapidly. They include satellite imagery, broadcast audio streams, digital music files, web page content, scans of government documents, and comments posted on social networks.
How can companies use Big Data to gain a competitive advantage
Data-Driven Decisions
- Use big data analytics to inform strategic decisions. Analyze customer behavior, market trends, and operational data to make more informed and timely choices.
Customer Insights
Market Analysis
Operational Efficiency
Product Development
Risk Management
Competitive Analysis
Quality Control
Supply Chain Optimization
Predictive Maintenance
Fraud Detection
Personalized Marketing
E-commerce Recommendations
Healthcare Diagnosis
Energy Efficiency
define Query by example (QBE)
Query by example (QBE): is a user-friendly way of searching for information in a database.