Book - Chapter 1 intro to big data analytics Flashcards
What are the vs of big data
Volume. Velocity. Variety.
What is meta data
The minimum you should know about the data
What is paraders
How has the data been processed. What are the artefacts left in the data
What is velocity
It is speed
What are the three attributes that stand out of defining big data characteristics
Huge volume of data
Complexity of data types and structures
Speed of new date of creation and growth
What is huge volume of data
Rather than thousands of rows, big data can be billions of rows and millions of columns
What is complexity of data types and structures
It reflects the variety of new data sources, formats and structures, including digital traces been left on the web and other digital repositories for subsequent analysis
What is speed of new data creation and growth
If you describe high velocity data, the rapid data ingestion in near real-time analysis
What way is big data sometimes described as having
The big free v’s
What are the big three Vs
Volume, variety and velocity
Can big data be Efficiently analysed using only traditional database or methods
No it requires new tools and technologies to store, manage and realise the business benefits
What main two forms can big data come from
Structured and nonstructured
How is most of the big data formed
Usually unstructured or semistructured in nature Which requires different techniques and tools to process and analyse
Where does 80 to 90% of future data growth come from
Non-structured data types
What sort of data in addition could the RDBMS have
Quasi-or semistructured data, such as three form cell log information taking from an email ticket of the problem, customer chat history