1 Flashcards
Is a term that describes the large amount of data-both structure and unstructured- that inundates a business on a day to day basis
Big data
Its refers to the amount of data being generated and in the age of big data more data is being generated every minute that ever before
Volume
Its refers to the speed of the data being generated and the rate at which it ‘s being processed in term ma of both collection analysis
Velocity
Its refers to the quality reliability or uncertainty of the data.
Veracity
Its refers to this broad range of different types of data can come form many different sources
Variety
Data mining phases
Data cleaning
Data integration
Data selection
Data transformation
Data mining
Pattern evaluation
Knowledge presentation
To remove noise and inconsistent data
Data cleaning
Write down at least five most common data mining tasks
Description task
Estimation task
Clustesting task
Classification task
Prediction task
A collection of interrelated data
Databases
It refers to extracting or mining knowledge from large amounts of data
Data mining
This refers to data that is so large fast or complex that its difficult or impossible to process using tradition method
Big data