Introduction to Data Ecosystem Flashcards
What is data?
A collection of facts, figures, and statistics that can exist in various forms like numbers, text, sound, and images.
What is the difference between structured and unstructured data?
Structured Data: Organized format (e.g., spreadsheets, databases).
Unstructured Data: No predefined format (e.g., videos, emails, social media posts).
What are the three versions of data?
Raw Data – Unprocessed facts and figures.
Information – Processed and organized data.
Insights – Actionable conclusions derived from information.
What are the four Vs of data?
Volume – The amount of data.
Velocity – Speed of data generation.
Variety – Different forms of data.
Value – The usefulness of data.
What is data science?
The study of data to extract meaningful insights using mathematics, statistics, AI, and computer science.
What are the four key questions that data scientists answer?
What happened?
Why did it happen?
What will happen in the future?
What actions can be taken?
Name at least five industries that use data science.
Ecommerce, Healthcare, Finance, Social Media, Manufacturing, Agriculture, Transportation.
How does Uber use data science?
Uber uses demand forecasting to predict ride demand based on time, location, and historical data.
How does Amazon’s recommendation system work?
Uses AI and machine learning to analyze customer behavior and suggest relevant products.
Why is data visualization important?
Helps interpret and present data efficiently using graphs, charts, and dashboards.
What are the types of data and define?
- Qualitative: non-numeric in nature. For example, gender, product category, characteristics of bird
- Quantitative: numerical values that can be measurable. We can perform mathematical operations to this type of data. For example, product cost, weather temperature etc.
What are the type of qualitative data?
Nominal: Classifies data on basis of characteristics, no inherent sequence or hierarchy, e.g. hair colour
Ordinal: Classifies data containing order/rand, e.g. Bachelors, Masters, Doctarate
What are the type of quantitative data?
Discrete: Takes specified distinct value, e.g. pencil in box
Continous: Takes any value within specified range, e.g. Temperature.