Apache Kafka Flashcards

1
Q

What is Apache Kafka?

A

Apache Kafka is a popular distributed event streaming platform designed to efficiently manage real-time data feeds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define [Distributed] in relation to an Event Streaming Platform.

A

Distributed refers to Kafka running on multiple servers (a cluster) instead of a single machine. Allows for High Availability and Scalability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define Event Streaming.

A

Event: A record of something that happened (“user clicked a button”)

Streaming: Events are continuously produced, processed, and consumed in real time.

Kafka ingests, stores, and processes these events efficiently, making it ideal for:

Real-time analytics (tracking user activity)
Message Queuing (connecting microservices)
Log collection (system monitoring)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Message Queuing?

A

Message Queuing is a method where microservices communicate asynchronously by sending and receiving messages through a queue.

A message broker (Apache Kafka, RabbitMQ, AWS SQS) acts as an intermediary, ensuring messages are delivered reliably between services without direct dependencies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Explain how Message Queuing works.

A
  1. A Producer (Sender) Service creates a message (“Ordered Placed: OrderID 1234”). Sends it to a message queue (Kafka Topic).
  2. Message Broker (Queue System). Stores message temporarily until a consumer retrieves them. Ensures reliable delivery, even if the consumer is offline.
  3. Consumer (Receiver) Service. Subscribes to a queue (Kafka Topic). Processes the message when it’s available (“Prepare shipping for OrderID 1234”)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Advantages of Kafka? (4)

A
  1. Real-Time Data Processing: Kafka allows for the processing of real-time data streams, enabling businesses to make decisions quickly.
  2. Scalability: Kafka is highly scalable and can manage a large volume of data without impacting performance.
  3. Fault Tolerance: Kafka is fault-tolerant, ensuring the data is not lost even in case of hardware failure.
  4. High Throughput: Kafka can process a large amount of data with low latency, making it suitable for applications that require real-time processing.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define Kafka Clusters.

A

Kafka Clusters are distributed systems that consist of multiple Kafka brokers working together to handle and process real-time data streams.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define Brokers.

A

Brokers are the core of the Kafka cluster. They receive messages from producers, store them in partitions, and deliver them to consumers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define Topics.

A

Topics are the channels through which data is organized and categorized. They can be divided into multiple partitions for better scalability and performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define Partition.

A

Partitions are the fundamental unit of data storage in Kafka. Topics that are divided into multiple partitions are distributed across the brokers in the cluster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define Producers.

A

Producers help in publishing data to Kafka topics. They send messages to specific topics within the Kafka cluster.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Define Consumers.

A

Consumers subscribe to topics and receive messages from them. They can process the received messages, store them, or perform other actions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define Offsets.

A

Offsets are unique identifiers that represent the position of a message within a specific partition of a topic. They are crucial for tracking the progress of consumers within a topic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly