Big data Flashcards
What is big data
In the narrow sense it is sample size
in the broad sense it means sample size and dimensionality
Structured data
Structured: Displayed in columns and rows as well as relational DB
20% of enterprise data
requires less storage
easier to manage and protect
Unstructured data
80% of enterprise data
requires more storage
more difficult to manage and protect
4 characteristics of big data
Data at rest
Data in motion
Data in many forms
Data in doubt
Data extraction
Separating the noise from the signal
4 big data benefits for business
Diverse use cases for data sets
data driven insights
better customer insights
and increased market intelligence
Big data barriers
Data policies
Skills
Tools and technologies
Data management
Organizational change
Big data myths
ONLY FOR INTERNET AGE COMPANIES
NO NEED FOR RDBMS
ALL ABOUT DATA SCIENCE
Machine learning (turnings proofs)
ALL RELY ON THE DEVELOPEMENT OF THE TYPE WRITER LIKE COMPUTING MACHINES THAT MUST OBEY A SET OF RUILES,. AND HIS SUBSEQUENT DEVELOPEMENT OF A UNIVERSAL COMPUTING MACHINE
Machine learning in short
Supervised machine learning: classification
Unsupervised machine learning: clustering
Reinforcement machine learning: robotics
Data science
Enables businesses to process huge ammounts of structured and unstructured data to detect patterns:
Widely used in: Sport, government healthcare and financial sector.
Data analysis
Data explosion: unstructured data doubling every three months
Data-led innovation decoupling data from apps
Technology: commodity priced storage
Monetization: Growth of enterprise data
Social media: Emergence of companies that scrub data from social media
Data mobilization: Novel approaches to analyze unstructured data
Transformation journey
- Turning business needs into VC
- Designing big data architecture
- Integrating big data with existing apps
- Working on data quality
- Turning design into code
- Training users