4.11.1 Big Data Flashcards

1
Q

What is Big Data?

A

a catch-all term for data that won’t fit the usual containers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is the lack of structure of big data such an issue?

A

analysing the data is made significantly more difficult

relational databases are not appropriate because they require the data to fit into a row-and-column format.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is big data in terms of volume?

A

Volume: the sheer amount of data is on a very large scale, too big to fit into a single server. Size impacts when the data doesn’t fit onto a single server because relational databases don’t scale well across multiple machines

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is big data in terms of variety?

A

Variety: the type of data being collected is wide-ranging, varied and may be difficult to classify. data in many forms such as structured, unstructured, text, multimedia

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is big data in terms of velocity?

A

Velocity: the data changes quickly and may include constantly changing data sources, there may be a large degree of latency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What happens when data is too big to fit on one server?

A

the processing must be distributed across more than one machine
functional programming is a solution, because it makes it easier to write correct and efficient distributed code because the code can run across multiple servers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is structured / unstructured data?

A

Structured data - data that fit into a standard database structure of columns and rows (fields and records).

Unstructured data - data that doesn’t fit into a standard database structure of columns and rows (fields and records).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly