INFS1000 Lec 11 Business Intelligence Flashcards

1
Q

What is BUSINESS INTELLIGENCE?

A

Methodologies / technologies that transform raw data into useful info for business purposes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why do organizations need business intelligence? (2)

A
  1. Business collect massive amounts of data: so need to provide right info to right person at right time.
  2. Data can show patterns and relationships: info that can help (1) better decision making and (2) forecast changes (e.g. drop in site views).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 2 ways BI can help businesses?

A
  1. Data warehouses: storing and organizing large amounts of data.
  2. BI Tools / Systems: making sense of data from DWs.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does BI achieve this? (3)

A
  1. DW: Integrating data: from multiple internal and external sources (e.g. departments, customers, suppliers).
  2. DW: Preparing data: data clensing.
  3. BIT: Analysing data: (e.g. summing, grouping, averaging).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How does a DATA WAREHOUSE work? (3)

A
  • Regularly copies data from OPERATIONAL DATABASE to create massive stock of HISTORICAL data for analysis purposes. (e.g. sales from WW every night).
  • Collates data from EXTERNAL RESOURCES (e.g. demographic, weather data).
  • Creates meta-data for these large pools of data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What problem does OPERATIONAL DATA pose for BI systems?

e.g. sales transactions, customer records, employees data etc.

A
  • Operational data is “RAW”, collected to meet IMMEDIATE BUSINESS OBJECTIVE (e.g. generate a report), usually “DIRTY”, hence unsuitable for sophisticated / large-scale BI analysis.
  • Doing analyses on operational databases might slow them down and affect day-to-day business.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the 4 types of DIRTY DATA?

A
  1. Incorrect data
  2. Values missing (e.g. postcode missing from address).
  3. Data INCONSISTENT: product data changes over time (i.e. diff colours / versions), time-zone specific data in international sales.
  4. GRANULARITY inappropriate: too much / little detail.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the 3 types of BI Methods / Tools?

A
  1. Reporting: (1) filtering (2) sorting (3) grouping (4) simple calculations.
  2. OLAP - Online Analytical Processing: MULTIDIMENSIONAL aggregation / visualization of historical data.
  3. Data Mining: sophisticated statistical tehcniques to discover patterns / relationships used to predict future outcomes.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the advantages of OLAP?

A
  • Very quick (i.e. in real time), thus enables executives to make timely decisions.
  • Can view information in different configurations.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define the 7 main applications of data mining:

  1. Consumer clustering
  2. Customer churn
  3. Fraud detection
  4. Direct marketing
  5. Interactive marketing
  6. Market basket analysis
  7. Trend analysis
A
  1. Consumer clustering: identify common characteristics of customers who tend to buy same products / services.
  2. Customer churn: identify reason customer switch to competitors, predict why customer are likely to do it. (e.g. complaint history, number of days to contract expiry, phone model).
  3. Fraud detection: identify characteristics of transactions that are most likely to be fraudulent.
  4. Direct marketing: identify which prosepctive clients should be included in mailing lists to obatin highest response rate.
  5. Interactive marketing: predict what each individual accessing a website is most likely to be interested in seeing.
  6. Market basket analysis: understand what products / services are commonly purchased togehter, and on what days of the week.
  7. Trend analysis: reveal difference between typical customer this month and last month.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the 6 things that drive data mining?

A
  1. Statistics / maths.
  2. AI / Machine learning.
  3. Huge databases.
  4. Cheap computer processing and storage.
  5. Sophisticated marketing, finance, and other business professionals.
  6. Data management tech.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are does MARKET BASKET ANALYSIS help with?

A

Identify CROSS-SELLING OPPORTUNITIES.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define probability, support, confidence, lift.

A

P(Mask)

Support P(Mask AND Fin)

Confidence P(Mask / Fin) = P(Mask AND Fin) / P(Fin)

Lift: ratio of confidence to the base probability of just buying items P(Mask / Fin) / P(Fin).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the 7 main applications of data mining?

What are some extra ones?

A
  1. Consumer clustering
  2. Customer churn
  3. Fraud detection
  4. Direct marketing
  5. Interactive marketing
  6. Market basket analysis
  7. Trend analysis

Other: inferring demographics (e.g. Amazon), loyalty programs (e.g. casinos and hotels).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly