Exam 2 Flashcards
real-time information
immediate, up-to-date information
real-time systems
provide real-time information in response to requests
info quality
timeliness accuracy relevancy completeness consistent uniqueness
costs of low quality information
time, money, reputations and even jobs
data steward
responsible for ensuring the polices and procedures are implemented across the organization and acts as a liaison between the mis department and the business
data governance
refers to the overall management of the availability, usability, integrity and security of company data
SQL
strutted query language
QBE
query-by-example
primary key
a field that uniquely identifies a given record in a table
foreign key
a primary key of one table that appears as an attribute in another table and acts to provide a logical relationship between the two tables
data warehouse
a type fo database that integrates copies of transaction data from disparate source systems and provisions them for analytical use
ETL
extraction, transformation and loading
a process that extracts information from internal and external databases, transforms the information using a common set of enterprise definitions and loads the information into a data warehouse
data mart
contains a subset of data warehouse information
SLA
service level agreements
many db can be subject to application SLAs that require 99.99% uptime. system failure can result in chaos and lawsuits. data warehouses may not have SLAs
purposes of a DW
descriptive
diagnostic
predictive
prescriptive
some reasons for DW failure
lack of executive support lave of companion lack of vision and imagination lack of resources failure to understand magnitude and complexity biting off too much no data steward function not understanding proper business objectives
cube
common term for the representation of multidimensional information
pre defined calculations
difficult back up and recovery
more security
OLAP
on-line analytical processing
cubes ofter refereed to as OLAP cubes
dimensional model
contains the same information as normalized model, but packages the data in a format that delivers user understandability, query performance and resilience to change
dimension
a particular attribute of information
data cleansing / scrubbing
a process that weeds out and fixes or discards inconsistent, incorrect or incomplete information
data broker
a business that collects personal information about consumers and sells that information to other organizations
big data
more data than your firm can comfortably handle volume velocity variety veracity complexity
size of a terabyte
1000 gigabytes
size of a petabyte
1000 terabytes
two dominant sources
CGM - consumer generated media
IOT - internet of things
taxonomies
a scheme of classification
ontologies
a set of concepts and categories in a subject area that shows their properties and the relations between them
BLOB
binary large object
DB data type
percent of major corporation decisions by gut
40%
percent of major corporation decisions done correctly
50%