Module 1 - Types of Data in Big Data Environments Flashcards
1
Q
Types of data processed by BD
A
- structured data
- unstructured data
- semi-structured data
4 (technically not a data type) - metadata
2
Q
synonym of type of data?
A
data format
3
Q
Structured Data characteristics are:
A
- conforms to a data model
- is stored in tabular form
- can be relational
4
Q
Unstructured Data Characteristics are:
A
- does not conform to data model or data schema
2. is generally inconsisten and non relational
5
Q
Differences between structured and unstructured data?
A
- Unstructured data requires special or customized logic for pre-processing and storage while structured not
- Unstructured can not be processed by SQL
6
Q
Semi-Structured data is?
A
the one that have a defined level of structure but can not be relational in nature
7
Q
Examples of semi-structured data ?
A
- excel,
- edi,
- emails,
- rss
- feeds
8
Q
Exmaples of metadata?
A
- xml tags
2. attributes providing the file size and resolution of a digital photographyt
9
Q
who has more noise-to-signal ratio? structured, unstructured or semistructured?
A
semi and unstructured data