Big Data & Big Business Intelligence Flashcards
What are the four characteristics in big data?
volume How much data is there? The size amount of data to be analyzed
variety How is the data structure? Form of data is structure or unstructured
veracity How reliable and accurate is the data? The quality trustworthiness accuracy and precison of the data
velocity How fast is the data generated? the speed of the data that is generated, processed and analyzed
What are the storage techniques used to support data analytics?
Data warehouse - large amount of historical data which is stored
Data lakes - architecture used to store massive amount of data original format until it’s needed
Data marts - small version of data warehouse
What is full refresh?
A way to load data, which process replaces and erases the entire database and starts all over. This is only used when an entity can’t figure out which data has been modified, added, or deleted.
What are the four types of data analytics?
Descriptive - focuses on the past unknown patterns
Diagnostic - focuses on the past looks for the source or cause of the pattern identified
Predictive - focuses on the future trends and patterns
Prescriptive - focuses on the future and predict the outcome the best course of action going forward
What are the Business Intelligence Technologies cloud computing?
Cloud Computing
- Storage
- Processing on 3rd party infastructure
- Web based applications
What are the Business Intelligence Technologies storage?
Data warehouse
data lake
data mart
What are the Business Intelligence Technologies Software Applications?
- Data analysis and visualization tools
- Dashboards
- Machine Learning
What is data mining useful in figuring out? and
patterns and trends
What are the three main roles for data governance described by the ISACA?
Data Owners - responsible for strategic decisions to provide overall direction for data governance.
Data stewards - responsible for the business perspective. Data stewards are data experts from different functional domains who focus on the correct data usage
Data Custodians - IT employees responsible for managing data from a technical perspective
What is ETL how it’s used?
Accountants use ETL
Extract - when extracting data they need to consider the data relevancy, accurancy, completeness, validity, timeliness and consistency
Transformed - data from extraction can be inconsistent and have formatting issues. Transforming method is aggregate the data.
Loading phase - is when
When would an accountant use probability analysis?
When there is a high level of uncertainty for uncertainty for critical variable
Ex would be when revenue is correlated with economic indicator
What type of data analytic technique would management avoid?
They would avoid primary analytics
only consider Descriptive analytics
Diagnostic analytics
Predictive analytics
Which data governance principals is achieived by forthcoming constraints, options and potential shortcoming of data related decisions?
remember the term shortcoming which is “Integrity”
What isn’t a characteristic of relational database (RDB) ?
storing information sequentially
remember the RDB creates tables, rows , and columns to store data. Also, create relationship and communicate between tables through primary and foreign keys
not a advantage of database normalization?
improves speed for queries