Kafka Cluster Architectures and Administering Kafka Flashcards

Question

Altering a topic

Answer 1

sh kafka-topics.sh --zookeeper --alter --topic --partitions

Answer 2

No rebalancing does not occur when partitions are increased. It occurs when the no of consumers change.

Answer 3

The reasons can be to spread out the topic further or to decrease the throughput for a single partition.

Answer 4

No it is not possible to delete/reduce existing partition count as it will mean data loss. To reduce the partition delete the topic and recreate it.

Answer 5

sh kafka-topics.sh --zookeeper --topics --delete

Answer 6

No deleting a topic will delete all the messages and it is not reversible.

Answer 7

sh kafka-topics.sh --zookeeper --list

Answer 8

No it is marked for deletion and kafka runs a periodic job that deletes it.

Answer 9

sh kafka-topics.sh --zookeeper --describe --topic

Answer 10

sh kafka-topics.sh ... --under-replicated-partitions -> It will show all the partitions where one or more of the replicas for the partition are not in sync with the leader sh kafka-topics.sh ... --unavailablepartitions -> It will show all partitions without a leader means that the partition is unavailable for produce or consume clients

Answer 11

For older consumers the information is stored in zookeeper and for new consumer the information is stored in brokers

Answer 12

sh kafka-consumer-groups.sh --zookeeper --list

Answer 13

sh kafka-consumer-groups.sh --bootstrapserver --new-consumer --list

Answer 14

sh kafka-consumer-groups.sh --zookeeper --describe --group

Answer 15

Deletion of consumer groups is only supported for old consumer clients. It will delete entire group from Zookeeper and all offsets of all the topics that group is consuming. In order to perform this operation all the consumers in the group have to shut down first.

Answer 16

sh kafka-consumer-groups.sh --zookeeper --delete --group --topic

Answer 17

``` There is no script to export offsets directly. But we can use kafka-runclass.sh script to execute any java class for the tool in proper environment. Exporting offsets will generate a file that will produce out for topic partition and its offset and will be in format that import can understand. ```

Answer 18

sh kafka-runclass.sh kafka.tools.ExportZkOffsets --zkconnect --group --output-file

Answer 19

1) First we need to export current offsets of the consumer group. This will create the format file that import can understand 2) Then for partitions for which we want to change value we can change the offset to desired values. 3) Before importing new offsets it is important that all the consumers in that consumer group are stopped. 4) Import offsets using command line

Answer 20

sh kafka-runs-class.sh kafka.tools.ImportZkOffsets --zkconnect --input-file Notice that group name is not required while importing offsets, it is because it is embedded in the export file.

Answer 21

There are some common configurations that are applied on cluster wide level to all the topics and consumer groups. But sometimes for some special topics we need to override those defaults. Those configurations are called override configurations.

Answer 22

sh kafka-configs.sh --zookeeper --alter --entity-type topics --entity-name --add-config =|,= sh kafka-configs.sh --zookeeper --alter --entity-type topics --entity-name --add-config retention.ms=360000

Answer 23

delete. retention.ms - How long in ms, deleted tombstones will be retained for the topic. Only valid for log compacted topics flush. ms - How long before forcing flush of this topic's message to disk max. message.bytes - The maximum size of single message for this topic. min. insync.replicas - min number of replicas to be in sync for the partition to be available. retention. bytes - The amount of messages in bytes to be retained for this topic. retention. ms - How long the messages should be retained for this topic in ms segment. ms - How frequently the log segment for each partition should be rotated.

Answer 24

For producer/consumer only valid overrides are their quotas. producer-bytes-rate - rate at which producer can produce per broker consumer-bytes-rate - rate at which consumer can consume per broker

Answer 25

It is the rate bytes/sec at which the specific client of CLIENT ID is allowed to either produce/consume on a per broker basis. Suppose we have five broker cluster and we specify quota of 10 MB/sec for a client. That client will be allowed to produce 10 MB/sec on each broker at the same time for a total of 50MB/sec.

Answer 26

sh kafka-configs.sh --zookeeper --describe --entity-type topics --entity-name

Answer 27

1) For re-election of leader replicas | 2) Utility for assigning partitions to brokers.

Answer 28

sh kafka-preferred-replica-election.sh --zookeeper

Answer 29

1) --topic - name of the single topic to consume 2) --whilelist - regular expression for topics to consume 3) --blacklist - topics to cosume except those provided by this regular expression only one of these options can be provided 4) --from-beginning - Consume messages in the topic(s) from the oldest offset. Otherwise default is latest. 5) --max-messages - Consume atmost NUM messages before exiting. 6) --partition - Consume only from partition ID NUM 7) --formatter - Specifies message formatter class to use to decode the messages. This defaults to kafka.tools.DefaultFormatter

Answer 30

1) DefaultFormatter 2) ChecksumFormatter 3) LoggingMessageFormatter 4) NoOpMessageFormatter

Answer 31

1) --broker-list - specifies comma separated list of brokers in the cluster 2) --topic - Topic that you are producing messages to 3) key.serializer - message encoder to use to serialize key. Defaults to DefaultEncoder 4) value.serializer - message encoder to use to serialize value. Defaults to DefaultEncoder 5) compression.codec - specify the type of codec to use when producing message. gzip, snappy or lz4 6) --sync - produces message synchronously, waiting for each message to be acknowledged before sending the next one.

Kafka Cluster Architectures and Administering Kafka Flashcards

(57 cards)