Lecture 1 - Business Analytics Trends Flashcards

1
Q

5V’s of Big Data

A
  1. Volume
  2. Variety
  3. Velocity
  4. Value
  5. Veracity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the technical challenge of data? (2)

A
  1. How to deal with volume of the data

2. How to deal with real time data processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the managerial challenge with data?

A

To manage the data that comes in from everywhere and integrate these data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Degenerative tendency?

A

To focus inward on coasts and efforts, rather than outward on opportunities, changes and threats.

Pro of big data (comes from external sources) is that it force companies to use external data instead of only internal data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the most valuable thing you can do with big data?

A

Developing new products and services

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Caveat 1: Big data and thick data

A

Beware of quant bias or quant addiction; don’t trust too much only on quantitative data, also look at what is not measurable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Thick Data is?

A

Data in which emotions are visible and the context where the data is collected is taken into account.

  • Big data needs to be supported with Thick data.
  • Context can relativise or give meaning to the data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Caveat 2: Relate to business challenges

A

The challenge is not to just start exploring the data but to be aware of your business goals and focus your data analytic efforts on solving these
problems and improving these decisions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why Big Data now? combination of different causes (4)

A
  1. Availability of massive amounts of digital data
  2. Combination of technical developments and societal needs
  3. A philosophical view: Rationalism vs empiricism
  4. The discovery, in science itself, of the power of data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why Big Data now: Technological developments (6)

A

Radical changes in: (now vs. how it was)
1. The way elementary data are captured
o Sensors (automated) vs keyboard (human)
2. The way data is stored → business intelligence (data warehouses, data analytics etc.)
o Main memory and cloud vs hard disk
3. The way data is analyzed
o Data-driven methods vs sampling
4. The way data is provided to users
o Data logistics (keeping data together) vs data integration
5. The way data is presented → visualization
o Graphical interactive visualizations vs management reports
6. The way knowledge (business rules, models) is created → by means of machine learning/
data mining
o Learning/mining vs (labor-intensive) knowledge acquisition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Rationalism vs empiricism

A

Rationalism: theory, thinking and abstract knowledge
Empiricism: rely on data, empirical data

Now we are in the empirical time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Researchers view

A

More data is more important than better algorithms.

The more data you use for the training of the algorithm, the better the performance of the algorithm.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How did we go from Big data to AI? (3)

A
  1. The gap between the data and what you can do with it becomes bigger.
  2. Data became the problem, but the problem empowered the solution.
  3. Big data enabled AI machine learning techniques, which become a solution of solving these gaps.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How are AI technologies driven by data and analytics? (3)

A
  1. We use historical data for training
  2. New data for re-training
  3. With the models, make predictions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Three types of learning:

A
  1. Supervised: try to predict the variable you want to learn
  2. Unsupervised: don’t have any initial variable to learn
  3. Reinforcement: learn by trying out
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

From action to a decision (4):

A
  1. Monitor (action) > data
  2. Contextualize the data > information
  3. Learning from information > knowledge
  4. Information + knowledge = decision
17
Q

How does Provosts (2012) position data science? (3)

A
  1. Data science is somewhere between data engineering as part of computer science, the technology part and data driven decision making.
  2. Automated data driven decision making is in the area of business analytics.
  3. Data science uses the techniques from AI, data management to support this automated data drive decision making (DDD).
18
Q

Data science

A

A set of fundamental principles that support and guide the principled extraction of information and knowledge from data.

19
Q

Data mining

A

The actual extraction of knowledge from data, via technologies that incorporate these principles.

20
Q

Data-driven decision making (DDD)

A

The practice of basing decisions on the analysis of data, rather than purely on intuition.

21
Q

Data sciences principles (Provost, 2012): (2)

A
  1. Entities that are similar with respect to known features or attributes are similar with respect to unknown features or attributes
  2. Extracting useful knowledge from data to solve business problems can be treated systematically by following a process with reasonably well-defined stages.
22
Q

More data science principles: (4)

A
  1. If you look too hard you will find something but it might not generalise beyond the data.
  2. To draw causal conclusions, look at confounding factors, possibly unseen ones.
  3. Look if you found a local optimum or the right optimum.
  4. decompose the problem into sub-problems
23
Q

Process mining

A

Learn about the process model from the data