Week 2 (Ch. 5, 12) Flashcards
what is metadata
“data about data”
what is big data
large data sets that conventional data processing technologies do not have sufficient power to analyze
what is a database
organized collection of related information
what is a DBMS (database management system)
software for creating, storing, maintaining and accessing database files
what are the three categorizations of big data
volume, variety and velocity
what is volume in the context of big data
size
what is variety in the context of big data
number of data types
what is velocity in the context of big data
processing speed
what is velocity in the context of big data
processing speed
what are heursitics
mental shortcut
what are structured decisions
decision based solely on inputs
what are unstructured decisions
non programmed decision involving lots of unknowns
what are semi structured decisions
factors are known but human experience and outside factor influences decisions
what are operational decisions
decisions that employees make to make the organization run
what are strategic decisions
set the course of an organization
what are tactical/managerial decisions
decisions about how things will get done
what are decision support systems (DSS)
interactive computer system that helps managers make decisions
what are executive information systems (EIS)
similar to DSS (decision support system) but customized for executives
what are expert systems
systems that give managers advice such as a human consultant would
what is ‘what if’ analysis
outlines impact of the change in a variable in a model
what is sensitivity analysis
outlines the impact when one variable is changes repeatedly
what is goal seek analysis
finds inputs necessary to achieve goal
what is optimization analysis
finds the optimum value for a target variable by changing other variables
what is descriptive analysis
‘what happened?’
what is diagnostic analytics
‘why did it happen?’
what is predictive analytics
‘what might happen?’
what is prescriptive analytics
‘what to do next?’
what is data governance
managing availability, usability, integrity and security of data
what are legacy systems
outdated information systems
what are canned reports
reports that provide regular summaries of information
what are ad hoc reporting tools
tools that allow users to create their own reports
what is online analytical procession (OLAP)
computing method that enables users to select and extract data for analysis from numerous perspectives (used by managers, analysts)
what is a data cube
storage for OLAP
what is data mining
process of analyzing data for previously unknown trends
what are the two conditions for effective data mining
clean and consistent data, data should reflect current and future trends
what is a data warehouse
set of databases
what is a data mart
subset of data warehouse focused on specific set of information
what is a relational data model
data is organized into inter related tables
what is a flat file database
only one file/table of data
what is a record in relational database
a row
what is an attribute in relational database
a column
organize bit, field, record, file and byte by order of size
file, record, field, byte, bit
what is normalizing data
reducing data redundancy and ensuring data integrity
why do people use databases
control over redundant data, data integrity, data sharing and security/recovery
what is sql
language used to manipulate and communicate with databases
what is OLTP
OLAP for recent data, smaller and used for day to day activities