Module 5 Flashcards

1
Q

How do we manage data?

A

Through complex and powerful information systems
SAP Enterprise Resource Planning (ERP) system is the world’s most popular

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are databases?

A

Structuring data
Database management system - manages access and usage of all users to all data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do databases minimize?

A

Data redundancy
Data isolation
Data inconsistency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What do database management systems maximize?

A

Data security
Data integrity
Data independence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Big Data?

A

Data is being collected at an increasingly rapid pace from many sources
New types of data are being collected
Organizations and individuals must process that data
Worldwide, reports show that data is increasing by 50% per year
Big Data = data so large and complex it cannot be managed by traditional systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the most common data warehouse?

A

The central enterprise data warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are independent data marts?

A

They store data for a single or a few applications, such as marketing or finance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is hub and spoke?

A

It stores data in a central data warehouse while simultaneously maintaining dependent data marts that obtain their data from the central repository.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is knowledge management?

A

It is a process that helps organizations manipulate important knowledge that is part of the organizations memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is knowledge?

A

Information that is contextual, relevant, and useful, developed with the assistance of expertise, also known as intellectual capital (or intellectual assets)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is explicit knowledge?

A

Objective, rational, technical knowledge that has been documented

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is tacit knowledge?

A

Cumulative store of subjective or experiential learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the difficulties in managing data?

A

Amount of data increases exponentially over time
Data are scattered throughout organizations
Data obtained from multiple internal and external sources
Data degrade over time
Data subject to media rot
Data security, quality, and integrity are critical yet easily jeopardized
Information systems that do not communicate with each other can result in inconsistent data
Federal regulations require corporations to account for how their data are managed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Data Governance?

A

It is an approach to managing information across an entire organization
Uses Master Data Management

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does Data Governance provide control over?

A

It provides control over Master Data and Transaction Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the database approach?

A

Database is arranged so that one set of software programs - the database management system - provides all users with access to all data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What does database minimize?

A

Data redundancy: the same data are stored in many places
Data isolation: applications cannot access data associated with other applications
Data inconsistency: various copies of the data do not agree

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What do database management systems maximize?

A

Data security: databases must have extremely high security measures in place to deter mistakes and attacks since data is stored in one place
Data integrity: data must meet certain constraints, such as no alphabetic characters in a social insurance number field
Data independence: applications and data are not linked to each other (kept separate or independent by the DBMS that controls access), so that all applications are able to access the same data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the data hierarchy?

A

Bit: (binary digit) represents the smallest unit of data a computer can process
Byte: represents a single character, often often composed of eight bits
Field: a logical grouping of related characters
Record: a logical grouping of related fields
File (or table): a logical grouping of related records
Database: a logical grouping of related files

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What do database management systems do?

A

They create and manage a database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the relational database model based on?

A

On the concept of two-dimensional tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is a data model?

A

It is a diagram that represents the entities in the database and their relationships

23
Q

What is an entity?

A

It is a person, place, thing, or event about which an organization maintains information

24
Q

What is an instance?

A

It is one specific, unique representation of the entity

25
Q

What is an attribute?

A

It is a characteristic or quality of a particular entity

26
Q

What is a primary key?

A

IT is a field that uniquely identifies a record

27
Q

What are secondary keys?

A

They are other identifying fields that typically do not identify the file with complete accuracy

28
Q

What are foreign key fields?

A

They are used to uniquely identify a row of another table that is linked to the current table

29
Q

What are the issues of big data?

A

Sources can include:
Untrusted sources
Data could be “dirty” i.e. inaccurate, incomplete, incorrect, duplicate or erroneous
Big data changes since quality issues can arise during collection that are temporary or permanent

30
Q

How do we manage Big Data?

A

Deciding what needs to be done is the most important task
Then, the organization needs to move from traditional processing to Big Data processing

31
Q

What is Big Data processing?

A

Integrate information silos into a database environment
Develop data warehouses to access multiple databases or long term data
Query the data warehouse using business intelligence

32
Q

How do we put Big Data to use?

A

Making data available: using public data (open data) or meta analysis
Enabling organizations to conduct experiments: ex multiple views of a web site to test consumer repsonce
Microsegmenting customers: dividing customers into smaller groups to provide tailored services
Creating new business models
Being able to analyze more data

33
Q

How does Big Data interact with functional areas of the organization?

A

Human resources: discovering health care trends to lower health care costs
Product development: text mining the Internet to explore customer preferences on a 3-blink turn signal
Operations: reducing fuel consumption on deliveries and optimizing delivery routes
Marketing: creating of 1500 tailored marketing messages by integrating data from internal systems with social media
Government operations: water management systems based on water control sensors

34
Q

What are the characteristics of data warehouses and data marts?

A

Organized by business dimension or subject
Use On-line analytical processing
Integrated
Time Variant
Nonvolatile
Multidimensional

35
Q

What does the environment for data warehouses and data marts include?

A

Source systems that provide data to the data warehouse or data mart
Data-integration technology and processes that prepare the data for use
Different architectures for storing data in an organization’s data warehouse or data marts
Different tools and applications for the variety of users
Metadata, data quality, and governance to ensure that the data warehouse or data mart meets its purposes

36
Q

What is data integration?

A

To extract data from source systems, transform them, and load them into a data mart or warehouse
Can be performed by hand-written code or by commercial data-integration software
Can be transformed to make them more useful

37
Q

How do we store data?

A

The most common architecture is one central enterprise data warehouse, without data marts, called “a single version of the truth”

38
Q

What are independent data marts?

A

They store data for a single or a few applications, such as in marketing or finance

39
Q

What is metadata?

A

Data is about data

40
Q

What is data quality?

A

The quality of the data in the warehouse must be adequate to satisfy users’ needs

41
Q

What does governance require?

A

It requires that people, committees and processes be in place to plan and control the development and use of the data warehouse

42
Q

What are users?

A

There are a large number of potential BI users, including IT developers; front-line workers; analysts; information workers; managers and executives; and suppliers, customers, and regulators

43
Q

What do knowledge management systems do?

A

They use modern information technologies to systemize, enhance, and expedite knowledge management, with the goal to make the most productive use of knowledge

44
Q

What are query languages?

A

They search for information in databases or data warehouses

45
Q

What is normalization?

A

A method for analyzing and reducing a relational database to its most simple form
Purpose is to provide minimum redundancy
Focus is to reduce non-key attributes
Improves processing efficiency of the database
It optimizes the tables in a relational database

46
Q

What are joins?

A

They link relational database tables with common attributes

47
Q

What are the key words of SQL (Structured Query Language)?

A

SELECT - what to locate
FROM - specify the source files
WHERE - provides conditions for the search

48
Q

What are Entity-Relationship Models (ERD)?

A

Database designers plan the database design in a process called entity-relationship modeling (ERD)
ER diagrams consist of entities, attributes and relationships organized using business rules
Business rules describe how the organization uses its data to run its operations
Entities are shown as rectangles, with relationships shown on the lines between them

49
Q

What are the degrees of relationships for ERD’s?

A

Unary: single entity
Binary: two related entities
Ternary: three related entities

50
Q

What are the relationship classifications of ERD’s?

A

One-to-one
One-to-many
Many-to-man

51
Q

What are some ERD terms?

A

Cardinality: the number of times a single record/instance of one entity can be associated with a single record/instance of another entity
Mandatory single: ex inventory quantity on hand with its sale price
Optional single: ex employee wage rate with pay cheque
Mandatory many: ex department details with employee details
Optional many: customer details with customer sales details/invoices

52
Q

What are functional dependencies?

A

They express how attributes are associated and need to be retained during normalization

53
Q

What does the third normalized form show?

A

There is no data redundancy
Foreign keys are used to link tables