Lecture 5: Big Data Architecture Flashcards

1
Q

Core Principles of Big Data Architecture

A
  • Scalability
  • Fault-tolerance
  • Optimization
  • Modularity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Lambda Architecture

A
  • Batch layer: processes data in batches
  • Speed layer: processes data in real-time
  • Serving layer: merges outputs from batch and speed layer
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Kappa Architecture

A
  • Stream processing layer: handles all data processing in real-time without a separate batch layer
  • Storage layer: stores processed data for querying and analytics
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data Lakehouse

A
  • Combines the flexibility of data lakes with the performance and structure of data warehouses
  • Unified storage layer: structured and unstructured data
  • Query engine: allows for querying raw data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Data Mesh

A
  • Domain-oriented decentralization: each domain team is responsible for its data
  • Self-service data infrastructure: provides tools and frameworks that allow teams to manage and use data independently
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Medallion Architecture

A
  • Bronze layer: raw data
  • Silver layer: filtered, cleaned, augmented data
  • Gold: business-level data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly