Big Data and Data Analytics Flashcards
1
Q
What is the definition of Big Data?
A
- Extremely large data sets
- too complex for traditional tools
- analyzed computationally to reveal patterns and trends
2
Q
Who coined the term “Big Data”?
A
John Mashey in the 1990s
3
Q
What are the 4 V’s of Big Data?
A
- Volume: large amount of data generated and stored
- Velocity: high speed and continuous flow
- Variety: structured, semi-structured, unstructured data
- Veracity: accuracy and reliability of data
4
Q
What is structured data?
A
- conforms to a data model or schema
- stored in tabular form
5
Q
Give an example of semi-structured data.
A
- Data with tags or markers
- JSON or XML
- don’t fit rigid structures.
6
Q
What percentage of data is estimated to be unstructured?
A
90%
7
Q
What is Data Analytics?
A
Technologies that turn raw data into insights for decision-making.
8
Q
What are the four types of data analytics?
A
- Descriptive: what has happened
- Diagnostic: why it happened
- Predictive: what will happen in the future
- Prescriptive: what is recommended
8
Q
How does the Project Green Light initiative use data?
A
- uses data from google maps
- measures traffic flow
- analyzes patterns to improve urban traffic management