Big Data Flashcards
Big Data Def
Data that cannot fit in the usual containers
Three defining features of BD
Volume
Velocity
Variety
Volume
Too much data for a single server
Velocity
Data is created and modified rapidly, servers need to be updated regularly
Variety
Many different types data eg text, multiment
Hardest thing about Big Data and why?
Lack of structure
Hard to analyse, doesn’t conform to row-column nature of DB
Hence ML methods needed to extract useful data
Functional Programming
Functional programs use immutable data structures
Makes it easier to write efficient code, with multiple servers
Facts based model contains…
…Contains info piece and timestamp
Why contain a timestamp in a facts based model?
The most recent fact will be retrieved