T8. Data Mining Flashcards
What is Data Mining
Data mining involves analysing large data sets or big data to identify patterns and relationships to
predict future trends.
What is Big Data
Data sets that are so complex that traditional database and other applications are unable to capture,
manage and process them within an acceptable time frame.
What is the 3 V’s
Big data challenges can be defined as the 3V’s:
* Volume – the amount of data to be processed
* Variety – the number of types of data to be analysed
* Velocity – the speed of data processing
How can Big Data be used?
To predict health trends and forecast demand for services/resources
To optimise efficiency/effectiveness of treatments by analysing demographic
data/social indicators/lifestyle habits
To identify people suitable for screening by analysing medical records
Describe the threats to privacy of using Data Mining
Data mining analyses large data sets to discover patterns. The shopping habits and
preferences which is stored about a customer could be used for unauthorised purposes or
purposes for which the customer has not given permission or for purposes unknown to the
customer.