Introduction Flashcards
What is a paradigm shift that has been happening in the latest years (according to confluence)?
A shift from state based system to even driven systems.
What’s the basic analogy of what kafka is?
It’s a distributed log storage
How many companies use kafka worldwide?
35% of the Fortune 500’s.
What is the basic use case for kakfa?
Processing large stream of events in event driven systems.
What is a producer in kakfa?
And application that gets data into the cluster.
What is a Broker in kafka?
A broker is an individual “node” in a kafka cluster.
What is a consumer in kakfa?
An application that processes the data from the cluster.
Can a consumer also be a producer?
Yes. An application can be both and generate events as much as consume it.
Can a consumer also be a producer?
Yes. An application can be both and generate events as much as consume it.
What is the zookeeper ensamble?
It’s a small cluster responsible for keeping consensus and cluster status data for the kakfa cluster.
Will zookeeper be used in the future?
Since version 2.8, it’s been available as an option to self manage the cluster.
What is a topic in kafka?
Is a storage of related events (like a log).
Is there a limit on the number of topics?
There’s not theoretical limit on number of topics but a practical limit on the number of partitions.
What is a partition in kafka?
Partitions are the blocks of data that compose a topic and can be stored into different brokers for the same of durability and replication.
How is data written to topics?
Data is always appended to the end of the “log”/topic and is immutable.
What are the semantics of writing data to kafka?
- You always write at the end of the log
- Data is immutable
- Data can have expiration date
- Each event has a sequencial offset number
What are the semantics of reading data from kakfa?
- Reading doesn’t remove of destroy the data.
- Consumers read data independently reading from different offsets