Domain 3: Processing Flashcards
1
Q
What is the main goal of the Processing phase of Data Analytics?
A
- To transform the data and make it more consumable by the analytics and visualization tools.
- Choosing the right ETL tool based on workload requirements, performance, cost, and orchestration needs
2
Q
What are the goals of ETL?
A
ETL helps you transform, flatten, and enrich your data from source to target. The goal is to:
- Handle terabytes and petabytes of data
- Simplify querying disparate data sets
- Combine existing and legacy data with modern data sets
- Prepare data for machine learning
3
Q
What are some of the issues you face with ETL?
A
- Data volumes may grow
- Additional data sources may be added
- A large proportion of ETL is hand-coded
- Data formats may change over time
- Target schemas may change