Lecture 3/4: Big Data Fundamentals Flashcards

1
Q

5 V’s of Big Data

A
  • Volume
  • Variety
  • Velocity
  • Veracity
  • Value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Distributed Storage Systems

A
  • HDFS
  • Cloud storage
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data Processing Frameworks

A
  • MapReduce
  • Apache Spark
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Real-Time Streaming Technologies

A
  • Kafka
  • Storm
  • Flink
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

ETL Pipelines

A
  • Extract, transform, load (ETL) processes used to integrate data from different sources into a centralized repository
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Challenges with Big Data (Lecture)

A
  • Scalability
  • Integration
  • Quality and consistency
  • Security and privacy
  • Real-time processing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly