Session 5 Flashcards
What are the main problems with the Traditional File Environment ?
- Files maintained separately by different departments
- Data redundancy
- Data inconsistency
- Program-data dependence
- Lack of flexibility
- Poor security
- Lack of data sharing and availability
What is a relational DBMS ?
- Represent data as two-dimensional tables
- Each table contains data on entity and attributes
What is the new type of database created by social media and GAFA ?
“No SQL”
Non-relational databases that scale out better than other database.
No strict schema
What are the characteristics of Non-relational databases: “No S Q L” ?
– More flexible data model
– Data sets stored across distributed machines
– Easier to scale
– Handle large volumes of unstructured and structured data
What are the characteristics of Databases in the cloud
– Appeal to start-ups, smaller businesses
– Amazon Relational Database Service, Microsoft S Q L Azure
– Private clouds
What is Big Data ?
Massive sets of unstructured/semi-structured data
from web traffic, social media, sensors, and so on
Videos youtube (World economic forum & SAP)
What is the name of the tool which use to analyze big data ?
We use Business Intelligence Infrastructure to analyze big data.
What is the use of a data warehouse ?
A data warehouse = – Stores current and historical data from many core
operational transaction systems
– Consolidates and standardizes information for use across
enterprise, but data cannot be altered
– Provides analysis and reporting tools
What is a data marts ?
- Subset of data warehouse
- Typically focus on single subject or line of business
What is Hadoop ?
Hadoop is a kind of software which allows us to analyze the data by using a simple computer
With new technology we can process all the data in the same time
BII 3
What is IoT ?
IoT = Internet of Things
uii
Operational Data can put directly into the Data Warehouse because they are clearly structured
What is OLAP ?
Online Analytical Processing
Supports multidimensional data analysis
-Viewing data using multiple dimensions
- Each aspect of information ( product, pricing, cost, region, time period) is different dimension
- Example : How many washers sold in the East in June compared with other regions
OLAP enables rapid, online answers to ad hoc queries.
What is Data Mining ?
Video youtube Data mining creative learning
Finds hidden patterns, relationships in datasets
- Example: customer buying patterns
Infers rules to predict future behavior
Types of information obtainable from data mining:
- Associations
- sequences
- classifications
- clustering
- forecasting