Unit 11: Behind the Scenes: Databases and Information Systems Flashcards
data warehouse
large-scale collection of data that contains and organizes in one place all the data form an organizations’ multiple databases
3 sources for data warehouses
- internal sources - sales, billing, inventory, and customer databases
- external sources - vendors and suppliers
- clickstream data - software used on company websites to capture info about each click user makes as they navigate through the site
time-variant data?
data that doesn’t all pertain to one period of time
data staging?
an intermediate storage area used for data processing during the ETL proecess
ETL?
extract, transform, and load - the process fo formatting/cleansing data to enable data of different sources and types to comingle for analysis
OLAP?
online analytical processing - software that provides standardized tools for viewing and manipulating data in a data warehouse
data mart?
a related set of data that is grouped together and separated tout form the main body of data in the data warehouse
data mining?
the process by which a great amount of data are analyzed and investigated
Hadoop?
an open-source platform that makes complex unsaturated data easier to manage
how Hadoop stores files and processes data
- file storage - break data into chunks and is then distributed across many servers to be stored
- data processing - uses MapReduce
MapReduce?
sends code to each of the servers storing the data and uses multiple processors to process its set of data
6 data-mining techniques
- anomaly detection - identify outliers
- association/affinity group - determines which data goes together
- classification - defines data classes to spot trends
- clustering - organize data into smaller subgroups
- estimation/regression - assign value to data based on certain criteria
- visualization - a visual representation of data
information system?
a software-based solution used to gather and analyze data
TPS?
transaction-processing system - an operational-level system that keeps track of everyday business transactions/activity
real-time processing?
the database is updated while the transaction is taking place
OLTP?
online transaction processing - ensure that the data in the TPS is current
Batch processing?
transaction data is accumulated until a certain point is read then several transactions are processed at once
What is the ACID test?
a test to ensure transactions are processed accurately
ACID?
- A - atomicity - all components of the transaction are treated as on or the transaction is not completed
- C - consistency - at the end of a transaction a new piece of data is completed or the transaction remains incomplete
- I - isolation - during process each transaction is tread separately from other transactions
- D - durability - complete transactions are saved by TPS - cannot be undone
MIS?
management information system - provides timely and accurate info that enable managers to make critical business decisions
What types of reports are generated by MISs? (3)
- detail report - list of transactions that occurred over a certain period of time
- summary report - provides a consolidated picture of detailed data
- exception report - shows conditions that are unusual or that need attention by the system users
consolidation?
the act of unifying something
DSS?
decision support system - type of business info system designed to help a manager develop solutions for specific problems
model management system?
software that assists in building management models in a DSS
BI?
business intelligence - the ability to improve business decision making with databases and other fact-based support systems
business intelligence system?
a form of business info system that is used at the executive level
ERP system?
enterprise resource planning system - a software system that accumulates all info relevant to running a business and makes it available to whom ever needs it
knowledge-based system?
a system that provides intelligence that supplements the user’s own intellect
expert system?
tries to replicate the decision making of a human expert in order to solve problems
NLP system?
natural language processing system - enables users to communicate with computer systems using a natural spoken/written language instead of a programming language
AI?
artificial intelligence - attempts to create computers that think like humans
data redundancy?
Unnecessary duplication of data