Business Intelligence & Big Data Flashcards
What is Business Intelligence
Includes a wide range of applications, practices and technologies for the extraction, transformation, integration, analysis, interpretation and presentation of data to support improved decision making
What is a Data Warehouse?
A database that stores large amounts of historical data in a form that readily supports analysis and management decision making
What is ETL?
A process used to pull data from different data sources to populate and maintain the data warehouse
Explain the Extract step
Designed to access the various data sources and pull from each source the data desired to update the data warehouse
Explain the transform step
The data that will be used to update the data warehouse is edited and, if necessary, converted to a different format
Explain the load step
Updates the existing data warehouse with the data that have passed through the extract and transform steps. This creates a new, updated version of the data warehouse.
What is a Data Mart
A smaller version of a data warehouse-scaled down to meet the specific needs of a specific business unit.
What is Big Data?
Data collections that are enormous and complex that traditional relational management software, hardware and analysis processes are incapable of dealing with them
ACID
Atomicity Consistency Isolation Durability
- properties that guarantee database transactions are processed reliably and ensure the integrity of data in the database.
What is In-Memory Database?
A database management system that stores an entire database in random access memory (RAM)
Business Intelligence Tools
Spreadsheets
Dashboards
OLAP (Online Analytical Processing)
Drill-Down Analysis
Data Mining
Reporting and Quering Tools
What is OLAP?
A method used to analyze multidimensional data from many different perspectives
- Dimensions: Time, metric, location
What is Drill-Down Analysis?
Involves the interactive examination of high-level summary data in increasing detail to gain insight into certain elements
Explain Data Mining
A BI tool used to explore large amounts of data for hidden patterns to predict future trends and behaviors for use in decision making
Process of Data Mining
- Selection of the data to be used
- Preprocessing of the data to remove erroneous, incomplete or inconsistent data
- Transformation of the data to create an appropriate set of data to support the data mining technique to be used
- The actual data mining process
- Evaluation of the results