Session 5 Flashcards

1
Q

What are the main problems with the Traditional File Environment ?

A
  • Files maintained separately by different departments
  • Data redundancy
  • Data inconsistency
  • Program-data dependence
  • Lack of flexibility
  • Poor security
  • Lack of data sharing and availability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a relational DBMS ?

A
  • Represent data as two-dimensional tables

- Each table contains data on entity and attributes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the new type of database created by social media and GAFA ?

A

“No SQL”
Non-relational databases that scale out better than other database.

No strict schema

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the characteristics of Non-relational databases: “No S Q L” ?

A

– More flexible data model
– Data sets stored across distributed machines
– Easier to scale
– Handle large volumes of unstructured and structured data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the characteristics of Databases in the cloud

A

– Appeal to start-ups, smaller businesses
– Amazon Relational Database Service, Microsoft S Q L Azure
– Private clouds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Big Data ?

A

Massive sets of unstructured/semi-structured data
from web traffic, social media, sensors, and so on

Videos youtube (World economic forum & SAP)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the name of the tool which use to analyze big data ?

A

We use Business Intelligence Infrastructure to analyze big data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the use of a data warehouse ?

A

A data warehouse = – Stores current and historical data from many core
operational transaction systems
– Consolidates and standardizes information for use across
enterprise, but data cannot be altered
– Provides analysis and reporting tools

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a data marts ?

A
  • Subset of data warehouse

- Typically focus on single subject or line of business

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Hadoop ?

A

Hadoop is a kind of software which allows us to analyze the data by using a simple computer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

With new technology we can process all the data in the same time

A

BII 3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is IoT ?

A

IoT = Internet of Things

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

uii

A

Operational Data can put directly into the Data Warehouse because they are clearly structured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is OLAP ?

A

Online Analytical Processing
Supports multidimensional data analysis
-Viewing data using multiple dimensions
- Each aspect of information ( product, pricing, cost, region, time period) is different dimension
- Example : How many washers sold in the East in June compared with other regions
OLAP enables rapid, online answers to ad hoc queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is Data Mining ?

A

Video youtube Data mining creative learning
Finds hidden patterns, relationships in datasets
- Example: customer buying patterns
Infers rules to predict future behavior
Types of information obtainable from data mining:
- Associations
- sequences
- classifications
- clustering
- forecasting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

For what is use Data Mining ?

A
  • Fraud detection
  • Card Marketing
  • Cardholder & Profitability
17
Q

What must do a firm before creating a new database in place ?

A

Before new database is in place, a firm must:
– Identify and correct faulty data
– Establish better routines for editing data once database in
operation

18
Q

What is data quality audit ?

A

Data quality audit is a structured survey of the accuracy and level of completeness of the data in an information.

19
Q

What is Data cleansing ?

A

Data cleansing consists of activities for detecting and correcting data in a database that are incorrect, incomplete, improperly formatted or redundant.