Big Data Flashcards
What are the three Vs of Big Data.
Define them
Volume - There is too much data to fit on a single server
Velocity - The Data on the servers is created and modified rapidly, needing to respond to changing data in milliseconds
Variety - The data held on the server consists of many different types of media e.g. multimedia files, unstructured data, text
How is Big Data stored and why can’t it be stored on a conventional database?
Big Data is stored over multiple servers, across multiple machines .
Conventional databases require data to conform to a row and column structure and do not scale well across multiple servers.
Why is functional programming used for Big Data across multiple machines?
Functional programs are stateless so they have no side effects and they make use of immutable data structures. This makes it easier to write correct, efficient distributed code.