Big Data Flashcards
1
Q
Big data
A
A term used to describe data that doesn’t fit the usual containers
2
Q
Features of big data
A
- Volume
- Velocity
- Variety
3
Q
Volume
A
- There is too much data to fit on a single server
- Data must be stored over multiple servers
4
Q
Velocity
A
- Data on the servers is created and modified rapidly
- Response times must be in milliseconds
5
Q
Vareity
A
Data held on the servers consist of many different types of data
6
Q
Analyzing big data
A
There is a lack of structure, requiring machine learning techniques to discern patterns
7
Q
Functional programming in big data
A
Functional programming is used as:
- The programs are stateless (no side effects)
- Data structures are immutable
- Supports higher order functions
8
Q
Face-based model
A
- Each individual piece of information is stored as a fact
- Facts are immutable, overwritten and stored with a timestamp
- Reduces the risk of losing data
- Does not require an index, new data is simply appended to the data set
9
Q
Graph schema for big data
A
Uses graphs consisting of nodes (entities) and edges (relationships) to represent a dataset
10
Q
A