Chapter 22 - Big Data Flashcards
What is big data
Big data are extremely large collections of data that may be analysed to reveal patterns and trends especially relating to human behaviour
What characteristics does big data have
Volume
Large amount of data, more than can be easily handled by a single computer
Velocity
Data arrived continuously and often has to be processed very quickly to yield useful results
Variety
Disparate non uniform data of different sizes sources shapes arrive irregularly some from internal sources and some from external sources some structured but much unstructured
What is the fourth V
Veracity
Is data true ? Can it be relied upon
What is data mining ?
Analysing data to identify patterns and establish relationships such as associations sequences and correlations
What is Predictive analytics
A type of data mining which aims to predict future events. Like the chance of someone being persuaded to update a flight
What is text analytics
Scanning tests such as emails and word docs to extract useful information
What is statistical analytics
Used to identify trends, correlations and changes in behaviour
What are the dangers of big data
Cost
Regulation
Loss and theft of data
Incorrect data (veracity)
Employee monitoring - snooping on employees