Big Data Flashcards
What are the 5 Vs of big data?
Volume Velocity Value Veracity Variety
What is Big Data?
Big Data refers to data collected from you by companies and organisations
What is Big Data used for?
Big Data is used for companies and organisations to improve their services and generate more profit
What are the dangers of Big Data?
It may have errors or be stolen by hackers for malicious purposes
How much data on average is generated per month by a single user?
40 Exabytes
What is volume?
The amount of data being generated
What is velocity?
The speed that data is being generated at
What is veracity?
The trustworthiness of data being generated
What is variety?
The amount of different data
What is value?
How useful the data is to the company collecting it
What is a Big Data framework?
It is used to quickly manage, store and analyse Big Data
What is an example of a Big Data framework?
Hadoop
What are some of the difficulties of analysing Big Data?
It is very large in size and requires lots of computing power to analyse
What is parallel processing?
Parallel processing is where many computers work on different parts of one task at the same time
What is distributed data?
It is where data is stored on multiple computers or nodes