what is 'Big Data' and outline its three basic characteristics Flashcards
what is big data?
extremely large and complex datasets that cannot be easily managed or analyzed using traditional data processing techniques
what are the data sets of ‘Big Data’ characterised by?
- volume
- velocity
- variety
what does big data involve? (Volume)
vast amounts of data, often ranging from terabytes to petabytes and beyond.
what can this abundance of data come from? (volume)
from various sources such as social media interactions, sensors, transaction records and multimedia content e.g Facebook generates petabytes of data each day
what does the big data that is generated and collected at high speeds require? (velocity)
real-time or near-real-time processing and analysis
what does the velocity aspect refer to? (velocity)
the rapid rate at which data is produced, streamed, or transmitted
what does big data encompass? (variety)
diverse types of data, including structured, semi-structured and unstructured data.
what does structured data follow? (variety)
a predefined format and is typically stored in databases
what does semi-structured data may have? (variety)
some organizational properties but lacks a strict schema
what does unstructured data lack?
a predefined data model and includes text documents, images, audio, and video files