Lecture 5: Big Data Architecture Flashcards
1
Q
Core Principles of Big Data Architecture
A
- Scalability
- Fault-tolerance
- Optimization
- Modularity
2
Q
Lambda Architecture
A
- Batch layer: processes data in batches
- Speed layer: processes data in real-time
- Serving layer: merges outputs from batch and speed layer
3
Q
Kappa Architecture
A
- Stream processing layer: handles all data processing in real-time without a separate batch layer
- Storage layer: stores processed data for querying and analytics
4
Q
Data Lakehouse
A
- Combines the flexibility of data lakes with the performance and structure of data warehouses
- Unified storage layer: structured and unstructured data
- Query engine: allows for querying raw data
5
Q
Data Mesh
A
- Domain-oriented decentralization: each domain team is responsible for its data
- Self-service data infrastructure: provides tools and frameworks that allow teams to manage and use data independently
6
Q
Medallion Architecture
A
- Bronze layer: raw data
- Silver layer: filtered, cleaned, augmented data
- Gold: business-level data