Lecture 7: Data Warehouses, Business Intelligence and Big Data Analytics Flashcards

1
Q

What is a transaction processing system?

A

System that records data on fundamental operations occurring within the company

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is batch processing?

A

Data is stored in temporary storage and processed as a single unit at a specific time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is online transaction processing?

A

Dta is processed immediately in real-time, current state of the system is always reflected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a ERP, CRM and SCM system?

A

Enterprise Resource Planning System: Integrates core functions of the company into homogenous system

Customer Relationship Management System: Integrates customer data to be used by various departments

Supply Chain Management System: Provides a holistic overview of value chain, including flow of raw materials

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are Operational Systems and Business Intelligence tools?

A

Operational systems: Represent the input side of databases, data warehouses and data marts

Business intelligence tools: More sophisticated analytics systems, represent the output side

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is online analytical processing?

A
  • Transaction-level data stored in relational databases is aggregated and summarized
  • Results of analysis are steroid in data cubes
  • Data cubes structure results across multiple dimensions (Space, products, time)
  • Running queries on data cubes enables substantially quicker response times than running them on original database
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is data mining?

A
  • Data mining refers to the use of algorithms to identify hidden patterns in larger data sets
  • Some basic types of patterns include: Associations, clusters and sequential relationships
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are association rules?

A
  • Associations are certain attribute values that frequently occur together within a data set
  • Association rule mining seeks to identify the most frequent affinities amongst items
  • Support: is the fraction of transactions that contain a certain set of items X
  • Confidence: is the fraction of transactions that contain Y among those transactions that contain X
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the four Vs of Big Data?

A
  1. Volume
  2. Velocity
  3. Variety
  4. Veracity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are neural networks?

A

They replicate the basic functionality of the human brain to support decision making by predicting future outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is hadoop?

A

Open-source software framework used for (distributed) storage and analysis of big data sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the four primary advantages of hadoop?

A
  1. Flexibility - can handle any type of data from any source
  2. Scalability - Works on single low-end PC that can be scaled to combine hundreds of computers
  3. Cost effectiveness (open source)
  4. Fault tolerance (designed to avoid singe point failure, such as computer crashing)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly