Lecture 1 Flashcards

1
Q

Tall Data

A

many observation, few variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Wide data

A

few observations, many variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Advantages of Big Data

A

1.Big
2.Always on
3.Nonreactive ( people are not aware they have been recorded)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Incomplete data

A

Some info is missing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Inaccessible data
ex.PPD

A

Outside of the organization:
business and ethical barriers to access the data

Inside of the organization:
databases are not integrated within the system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Unrepresentative data

A

Invalid data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Dirty data

A

Loaded with junk or spam

Ex. Twitter bots; Fake reviews

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Sensitive data

A

releasing privacy or confidential details

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Big data

A

a collection of complex data sets, which uses tools and models to extract insights from it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

primary data

A

data collected to answer a research data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

secondary data

A

data collected for non-research purposes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Uses of Big data

A

1.Personalization
(recommendation algorithms)

2.Boosting engagement
(Facebook likes)

3.New product development

  1. Reducing churn
    ( when a customer quits)
  2. Public for economy
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Customer churn

A

customer quits some service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Is Big Data biased or unbiased?

A

Biased

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Insights of Big Data

A

Big is relative
Data quality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Calculation of the relative size

A

f = n/N
(sample size/ population size)

17
Q

Disadvantages of Big Data

A

1.Inaccessible
2.Incomplete
3.Non-representative
4.Drifting
5. Algorithmically confounded
6.Dirty
7.Sensitive

18
Q

Algorithmically confounded

A

the design of the platform can influence user’s behavior, introducing biases

ex. Recommended searches, will increase the magnitude of certain searches.

19
Q

Drifting

“If you want to measure change, don’t change the measure”

A

the measure or method changes the system over the time

ex. Google changes the data-generating process, to improve its customer service.