Big Data Flashcards
Speed of big data generation
very fast (real time)
Big data formats
various (text, image, videos etc).
Big data is normally ______ data
unstructured data
What is big data used for
- predict behaviours
- provide insights
- make better choices (informed decisions)
Abate information
Data -> information -> knowledge -> insight
(add context, add business context, understand the question)
How is big data stored
data lake
How is big data transformed
ELT
Prescriptive Analystics
Predicts and suggests what to do to make the most of an opportunity or avoid a risk
Predictive Analytics
uses maths and past data to guess what will happen in the future (helps react quickly to changes)
If you are building statistical models that find correlations and trends among data elements and data sets, which activity are you performing?
Develop Hypotheses & Methods
When Securing Big Data, what does recombination measure?
Measures the ability to reconstitute private and sensitive data
Common techniques used in data and text mining
Data reduction
Association
Profiling
Clustering
You need to discover possible relationships or to show data patterns in an exploratory fashion when you do not necessarily have a specific question to ask. What kind of data tool would you use to identify patterns of data using various algorithms?
Data Mining