Week 7 & 8 Flashcards

1
Q

Data linkage

A

The merging of datasets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data integration & augmentation

A

Complementing survey data with additional information, usually from non-traditional data sources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data forms

A

Designed data (experiments, surveys) and organic data (aspirational, transactional)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Forms of data in big data

A

Transactional data, social media data, Internet of Things (IoT) data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Charasteristics of big data (7 V’s)

A
  1. Volume - big data is massive
  2. Variety - (no) structure
  3. Velocity - the speed of new data generation
  4. Veracity - accuracy
  5. Variability - the difference in meaning
  6. Value - increases when linked
  7. Visualization
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Three ethical principles

A
  1. Beneficence - minimizing harm, while maximizing benefit
  2. Justice - burdens of research should not be unequally shared among groups of subjects, with some baring burdens and sme reaping benefits
  3. Autonomy - obtaining informed consent
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Differences surveys and big data

A

Surveys use ‘designed data’, big data is ‘organic data’
For surveys researchers have control over the content, for big data there is no control
Surveys have a detailed documentation on the data generating process, big data don’t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data curation

A

The management of data, to preserve the data to make analysis and reuse possible.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Data provenance

A

The process of tracing and recoding the origins of data and its movement between data bases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Privacy-utility trade-off

A

Balancing the risk to the participants while still getting the most out of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The big data paradox

A

The bigger the dataset, the bigger the chance of bias. As with every added N the confidence interval becomes narrower, the chance the real value is inside becomes smaller.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly