TOpic 3 Flashcards
What are common challenges in managing data?
- data increases exponentially with time
- data is scattered throughout organizations
- multiple sources of data
- new sources of data
- data becomes outdated
- data media rots
- high volumes –> complex
- may be compromised (security, quality, integrity)
- redundancy or inconsistancy
How is data governance used to address common challenges in managing data? (4)
- provides a planned approach to data management for all types of data
- includes a formal set of business processes and policies for data handling
- requires well-defined, unambiguous rules to avoid functional inconsistency
- rules address creating, collecting, handling, and protecting data
What is Big Data?
high-variety, high-volume, high-velocity information assets that require new forms of processing to enable enhanced decision making, insight discovery, and process optimization
What are data warehouses?
a repository of historical data organized by subject to support decision makers in the organization
What are the necessary elements to successfully implement and maintain data warehouses?
- link source systems that provide data to the warehouse or mart
What are knowledge management systems?
The use of modern information technologies to systematize, enhance, and expedite intrafirm and interfirm knowledge management
What are the benefits of implementing knowledge management systems?
What are the challenges of implementing knowledge management systems?
What is the process of querying a relational database, entity relationship modelling, and normalization and joins?
What is the relational database model?
based on two dimensional tables that are related. Their records are listed in rows and attributes listed in columns. Users retrieve, analyze, and understand data from the model.
What is entity-relationship modelling?
What is normalization and joins?
a method for analyzing and reducing a relational database to its nost streanlined form
What are types of sources of data?
- internal sources
- personal sources
- external sources
- news sources
What is the main objective of data governance?
to enable available, transparent, and useful data, “a single version of the truth”
Data governance is a subset of which of the following?
1. IT governance
2. data management
3. Big Data
4. IT management
!. IT governance
What are the 5 data governance areas?
- master data management
- meta data management
- data quality
- data catalogue
- data lineage
What is transactional data?
represents activities or events, such as a payroll cheque or customer invoice; stored in transaction files or as tables as part of a database
What is master data?
a set of core data, such as employee name, address, customer name, or customer credit limit that are applied to multiple transactions; stored in a master file or as tables as part of a database
What is a database management system (DBMS)?
a set of programs with tools to create and manage databases
What do DBMSs mimimize (3) and maximize (3)?
Min:
redundancy, isolation, inconsistancy
Max:
security, integrity, independence
What are the 4 things big data generally consists of?
- traditional enterprise data
- machine-generated/sensor data
- social data
- images captured by billions of devices around the world
What’s a data mart?
a low-cost, scaled-down version of a data warehouse designed fro end-user needs
What are data lakes?
stores current and historical data in its raw form for the purpose of analyzing the data
What is knowledge management (KM)?
a process that helps manipulate important knowledge that comprises part of the organization’s memory, usually in an unstructured format
What is explicit vs. tacit knowledge?
explicit: objective, rational, technical
tacit: subjective or experiential