A215 - Big Data Flashcards
1
Q
Big Data can be described in terms of 3 Vs. List all 3 and define each one.
A
- Volume - Capacity to store it exceeds a single server.
- Velocity - Data is produced/processed at high speeds.
- Variety - Data can appear in many different forms.
1
Q
What is Big Data?
A
A large quantity of unstructured data
2
Q
In a fact-based model what is the definition of a fact?
A
A fact is a piece of data that cannot be deconstructed further and is timestamped.
3
Q
What are the 3 components of graph schema?
A
- Nodes - Represent core entities
- Edges - Represent connections between entities
- Properties - Represent information about entities
4
Q
Describe the 3 characteristic of functional programming that make it good for Big Data.
A
- Immutable data structures - Elements cannot be changed, removed, or added. Adding items creates a new list, leaving the original unchanged.
- Statelessness - Program has no subroutines or variables so execution is very deterministic and can be replicated on many different systems with identical results.
- Higher-order functions - Functions can be run in parallel systems safely because the process doesn’t disturb other parts of the dataset.