Kafka Flashcards

1
Q

Kafka difference # 1

A

It works as a modern distributed system that runs as a cluster and can scale to handle all the applications in even the most massive of companies. Rather than running dozens of individual messaging brokers, hand wired to different apps, this lets you have a central platform that can scale elastically to handle all the streams of data in a company.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Kafka difference # 2

A

Kafka is a true storage system built to store data for as long as you might like. This has huge advantages in using it as a connecting layer as it provides real delivery guarantees—its data is replicated, persistent, and can be kept around as long as you like.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Kafka difference # 3

A

The world of stream processing raises the level of abstraction quite significantly. Messaging systems mostly just hand out messages. The stream processing capabilities in Kafka let you compute derived streams and datasets dynamically off of your streams with far less code.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does DAG stand for?

A

Directed Acyclic Graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does DAG stand for?

A

Directed Acyclic Graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a Source Processor?

A

Sources are where information flows into the Kafka Streams application. Data is read from a Kafka topic and sent into one or more stream processors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Stream Processor?

A

These processors are responsible for applying data processing/transformation logic on the input stream. In the high-level DSL, these processors are defined ising a set of built-in operators that are exposed by the Kafka Streams library.

Examples of operators are filter, map, flatmap, and join.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a Sink processor?

A

Sinks are where enriched, transformed, filtered, or otherwise processed records are written back to Kafka, either to be handled by another stream processing application or to be sent to a downstream data store via something like Kafka Connect. Like source processors, sink processors are connected to a Kafka topic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

3 kinds of processors in Kafka

A
  1. Source processors
  2. Stream processors
  3. Sink processors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Parallelise

A

Adapt for running on a parallel processing system.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly