INFS1000 Lec 11 Business Intelligence Flashcards
What is BUSINESS INTELLIGENCE?
Methodologies / technologies that transform raw data into useful info for business purposes.
Why do organizations need business intelligence? (2)
- Business collect massive amounts of data: so need to provide right info to right person at right time.
- Data can show patterns and relationships: info that can help (1) better decision making and (2) forecast changes (e.g. drop in site views).
What are the 2 ways BI can help businesses?
- Data warehouses: storing and organizing large amounts of data.
- BI Tools / Systems: making sense of data from DWs.
How does BI achieve this? (3)
- DW: Integrating data: from multiple internal and external sources (e.g. departments, customers, suppliers).
- DW: Preparing data: data clensing.
- BIT: Analysing data: (e.g. summing, grouping, averaging).
How does a DATA WAREHOUSE work? (3)
- Regularly copies data from OPERATIONAL DATABASE to create massive stock of HISTORICAL data for analysis purposes. (e.g. sales from WW every night).
- Collates data from EXTERNAL RESOURCES (e.g. demographic, weather data).
- Creates meta-data for these large pools of data.
What problem does OPERATIONAL DATA pose for BI systems?
e.g. sales transactions, customer records, employees data etc.
- Operational data is “RAW”, collected to meet IMMEDIATE BUSINESS OBJECTIVE (e.g. generate a report), usually “DIRTY”, hence unsuitable for sophisticated / large-scale BI analysis.
- Doing analyses on operational databases might slow them down and affect day-to-day business.
What are the 4 types of DIRTY DATA?
- Incorrect data
- Values missing (e.g. postcode missing from address).
- Data INCONSISTENT: product data changes over time (i.e. diff colours / versions), time-zone specific data in international sales.
- GRANULARITY inappropriate: too much / little detail.
What are the 3 types of BI Methods / Tools?
- Reporting: (1) filtering (2) sorting (3) grouping (4) simple calculations.
- OLAP - Online Analytical Processing: MULTIDIMENSIONAL aggregation / visualization of historical data.
- Data Mining: sophisticated statistical tehcniques to discover patterns / relationships used to predict future outcomes.
What are the advantages of OLAP?
- Very quick (i.e. in real time), thus enables executives to make timely decisions.
- Can view information in different configurations.
Define the 7 main applications of data mining:
- Consumer clustering
- Customer churn
- Fraud detection
- Direct marketing
- Interactive marketing
- Market basket analysis
- Trend analysis
- Consumer clustering: identify common characteristics of customers who tend to buy same products / services.
- Customer churn: identify reason customer switch to competitors, predict why customer are likely to do it. (e.g. complaint history, number of days to contract expiry, phone model).
- Fraud detection: identify characteristics of transactions that are most likely to be fraudulent.
- Direct marketing: identify which prosepctive clients should be included in mailing lists to obatin highest response rate.
- Interactive marketing: predict what each individual accessing a website is most likely to be interested in seeing.
- Market basket analysis: understand what products / services are commonly purchased togehter, and on what days of the week.
- Trend analysis: reveal difference between typical customer this month and last month.
What are the 6 things that drive data mining?
- Statistics / maths.
- AI / Machine learning.
- Huge databases.
- Cheap computer processing and storage.
- Sophisticated marketing, finance, and other business professionals.
- Data management tech.
What are does MARKET BASKET ANALYSIS help with?
Identify CROSS-SELLING OPPORTUNITIES.
Define probability, support, confidence, lift.
P(Mask)
Support P(Mask AND Fin)
Confidence P(Mask / Fin) = P(Mask AND Fin) / P(Fin)
Lift: ratio of confidence to the base probability of just buying items P(Mask / Fin) / P(Fin).
What are the 7 main applications of data mining?
What are some extra ones?
- Consumer clustering
- Customer churn
- Fraud detection
- Direct marketing
- Interactive marketing
- Market basket analysis
- Trend analysis
Other: inferring demographics (e.g. Amazon), loyalty programs (e.g. casinos and hotels).