Topic 11 - Big data - Complete Flashcards
Define volume?
Organisations gather and store large amounts of data, the amount of data collected determines whether or not it is big data.
Define velocity?
Data streams are collected in a near-to-real-time fashion, making the processing of this data a challenge.
Define variety in terms of data type?
Data comes in a wide range of formats, such as text, audio, video, images and unstructured and structured data.
Define structured data?
Data that is appropriate to be entered into a relational database in a row and column format.
Define unstructured data?
Data that is difficult to organise and is not appropriate to store in a database in row and column format.
Define distributed processing?
When data is too big to be processed on a single machine, the processing is distributed across several machines.
Define functional programming?
Is a paradigm based on the use of functions to determine variables, where functions use expressions similar to mathematical expressions to calculate the value of a variable.
Define a function?
Is a subroutine that carries out a set of instructions and returns a value.
Define statelessness?
Each function is calculated with no reliance on any data other than which was passed as functions input.
Define higher-order functions?
Are functions that either take other functions as an input or return a function as an output.