A215 - Big Data Flashcards

1
Q

Big Data can be described in terms of 3 Vs. List all 3 and define each one.

A
  • Volume - Capacity to store it exceeds a single server.
  • Velocity - Data is produced/processed at high speeds.
  • Variety - Data can appear in many different forms.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

What is Big Data?

A

A large quantity of unstructured data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

In a fact-based model what is the definition of a fact?

A

A fact is a piece of data that cannot be deconstructed further and is timestamped.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 3 components of graph schema?

A
  • Nodes - Represent core entities
  • Edges - Represent connections between entities
  • Properties - Represent information about entities
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Describe the 3 characteristic of functional programming that make it good for Big Data.

A
  • Immutable data structures - Elements cannot be changed, removed, or added. Adding items creates a new list, leaving the original unchanged.
  • Statelessness - Program has no subroutines or variables so execution is very deterministic and can be replicated on many different systems with identical results.
  • Higher-order functions - Functions can be run in parallel systems safely because the process doesn’t disturb other parts of the dataset.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly