Data Warehous, Big Data And KMS - Lecture 3-3 Flashcards

1
Q

Data warehouse

A

A repository of historical data that are organized by dimensions to support decision makers in the organization.
◦Organized toward data analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data mart

A

◦A scaled down version of a DW that is designed for the end user needs in an individual department.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Characteristics of data warehouse

A
  1. Subject oriented
    - customer
    - location
    - product
  2. Integration
    Data warehouse
    - POS terminal
    - ERP
    - Website
    - Clickstream
  3. Time variant
    - days, weeks, months, quarters
  4. Non-vVolitile
    - can’t edit or change it
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data warehouse framework

A

Sources
- can be in all different formats

Integration
- combining all different formats
E - extract
T - transportation (into uniform format)
L - load

Warehouse
- data warehouse
Sources, table, summary data

This creates the Data Mart
Ex/ profit loss -> financial analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Characteristics of BIG DATA

A
  1. Volume -> quantity
  2. Velocity -> rate/speed
  3. Variety -> types (video, text, audio, image)
    • structured ( TPS, ERP, is)
    • unstructured (textual - social media)
    • semi - structural
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Technical Challenges

A
  1. Volume -> storage -> media/devices -> optimize of hardware
  2. Velocity: rate <-> process -> techniques, architecture
       NO SQL, must use HADOOP, mapreduce to deal with big data 
  3. Data quality
    (Errors, missing info, noises spell check)
    -> machine learning -> decision making
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why is there no SQL for big data

A

Not only SQL; Processing unstructured data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Big data

A

data so large and complex it cannot be managed by traditional systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

HADOOP

A

a framework for storing & processing Big Data in a distributed environment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

DATA QUALITY

A

Data could be “dirty,” i.e. inaccurate, incomplete, incorrect, duplicate or erroneous (e.g.
incorrect spelling)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Management challenges

A
  1. Consumption
    - context recommendations
    - content creation
    - personalized profile
    - content classification
    (Production)
  2. Team
    - hybrid skill set
  3. Privacy and control -> GDPR
    General data protection regulation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Knowledge management systems

A

Knowledge management (KM): a process that helps organizations manipulate important knowledge that is part of the organization’s memory

Data
- organized

Information (comm213)
- processed

Knowledge
- explicit
Policies, vision, standard operating procedure
- knowledge
Tacit knowledge -> skills experience

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Data

A

are the raw bits and pieces of information with no context.
- Data can be quantitative or qualitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Information

A

data in the context. For example, “15, 23, 14, and 85′′ are the numbers of students that had registered for upcoming classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Knowledge

A

Once we have put our data into context, aggregated and analyzed it, we can use it to make decisions for our organization. This consumption of information produces knowledge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Explicit knowledge

A

objective, rational, technical knowledge that has been documented.
Examples: policies, procedural guides, reports, products, strategies, and goals

17
Q

Tacit knowledge

A

cumulative store of subjective or experiential learning
Examples: experiences, insights, expertise, know-how, trade secrets, understanding, and skill sets

18
Q

What is knowledge management system cycle

A

Knowledge management systems (KMSs) use modern information technologies to systematize, enhance and expedite knowledge management, with the goal to make the most productive use of knowledge

19
Q

Steps in knowledge management system cycle

A
  1. Creat (gap)
  2. Capture (improvement)
  3. Refine (using tacit knowledge)
  4. Store (technology - so everyone can access)
  5. Manage (changes - past knowledge replaces with new)
  6. Disseminate
    Final KNOWLEDGE