Big Data Flashcards

1
Q

What is the definition of Big Data?

A

Large volumes of extensively varying data that is generated and processed at high velocity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is Big Data becoming increasingly troublesome in IS/IT?

A

It cannot be easily stored on a single computer or sent over a firm’s network. How do we analyse?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Give some examples of Big Data sources:

A

Loyalty cards, online transactions, tracking systems, healthcare, social media, image metadata.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the three Vs of Big Data with examples?

A

Volume: Large Hadron Collider generated 13 PetaBytes of data in 1 year.
Velocity: Users demand real-time services - on-demand video generates 100 billion personalised suggestions every week.
Variety: text, images audio, 3D on social media.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Data Analytics?

A

Using software to analyse data to answer: Why are there differences between groups?, Why is this?, How can I improve in this respect?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How might business analytics be used in a business context?

A

Gain information to make more profit and lower overheads (Can give generic examples)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is multi-level analysis?

A

Looking from small specific trends to large trends. i.e. From an individual to whole market.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Give some examples of questions Big Data Analytics can answer:

A

What are we doing well and how do we maintain this?
What are we doing badly and how can we fix this?
How are competitors doing?
How did competitors get to where they are?
Forecasts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What must you think about when performing analytics?

A

Who are the observers and what do they want?
How often do analytics need to be carried out (Frequency)?
What exactly do we want to know?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Expand on observers and what they want:

A

Which stakeholder is the analysis for? What would they like to know (e.g. how to increase sales)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Expand on frequency:

A

For example should the frequency be seasonal (clothing industry). Or minute by minute as it might be when food shopping. How can more be sold.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Expand on what we want to know:

A

Sales? Profit? Waste? Costs? etc.

How do you want to implement the results? e.g. marketing, new products, changing prices etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is ‘actionable insight’?

A

It is results from big data analysis that provides what was required (insight) and how to achieve it (actionable).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Give the basic systematic analytic strategy:

A

Observer needs -> Focal ‘thing’ -> Obtain data -> analysis and action -> Observer needs…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Give a scenario where actionable insight is typically used in the consumer industry:

A

Reactivation or re-engagement of customers. Using big data analytics to find a way to make customers return to your business. e.g. personalised ads or amazons repeat purchase option for things that run out e.g. glue.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the 6 debates for big data?

A
  1. Inductive vs deductive.
  2. Algorithms vs humans.
  3. Centralised vs decentralised.
  4. Model improvement vs Innovation.
  5. Controlled access vs open access.
  6. Ethics.
17
Q

What are the 6 debates for big data?

A
  1. Inductive vs deductive.
  2. Algorithms vs humans.
  3. Centralised vs decentralised.
  4. Model improvement vs Innovation.
  5. Controlled access vs open access.
  6. Ethics.
18
Q

Expand on centralised vs decentralised:

A

To what extent will your company carry out analytics? select groups/ whole business or individual departments. How much will management be involved?

19
Q

Expand on centralised vs decentralised:

A

To what extent will your company carry out analytics? select groups/ whole business or individual departments. How much will management be involved?

20
Q

Expand on model improvement vs Innovation:

A

Use information provided to improve a business model or create a whole new one? e.g. Netflix disc to streaming.

21
Q

Expand on controlled access vs open access:

A

Organisations distributing their data through formal and secure channels so only specific groups can analyse or make data public to allow for things like crowdsourcing.

22
Q

Think about the adv vs disadv of all:

A

Check powerpoint.