Module 1 - Types of Data in Big Data Environments Flashcards

1
Q

Types of data processed by BD

A
  1. structured data
  2. unstructured data
  3. semi-structured data
    4 (technically not a data type) - metadata
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

synonym of type of data?

A

data format

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Structured Data characteristics are:

A
  1. conforms to a data model
  2. is stored in tabular form
  3. can be relational
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Unstructured Data Characteristics are:

A
  1. does not conform to data model or data schema

2. is generally inconsisten and non relational

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Differences between structured and unstructured data?

A
  1. Unstructured data requires special or customized logic for pre-processing and storage while structured not
  2. Unstructured can not be processed by SQL
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Semi-Structured data is?

A

the one that have a defined level of structure but can not be relational in nature

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Examples of semi-structured data ?

A
  1. excel,
  2. edi,
  3. emails,
  4. rss
  5. feeds
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Exmaples of metadata?

A
  1. xml tags

2. attributes providing the file size and resolution of a digital photographyt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

who has more noise-to-signal ratio? structured, unstructured or semistructured?

A

semi and unstructured data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly