Kinesis Flashcards
What is Kinesis?
A managed alternative to Apache Kafka to Easily collect, process, and analyze video and data streams in real time
What is mainly great for Kinesis?
for “Real-time” big data, for streaming processing frameworks (Spark, NiFi, etc…)
How is data replicated in Kinesis?
automatically to 3 AZs
What are the Kinesis services?
Kinesis Streams
Kinesis Analytics
Kinesis Data Firehose
What is Kinesis Streams?
It is Kinesis itself, low latency streaming ingests at scale
What is Kinesis Analytics?
managed service to perform real-time analytics on streams using SQL
What is Kinesis Data Firehose?
fully managed service to load streams into S3, Redshift, ElasticSearch, Splunk
What are common streams consumed by Kinesis Streams?
ClickStreams
IoT devices
Metrics and logs
Are Kinesis Streams divided?
in ordered Shards / partitions
What is the data retention period by default in Kinesis Streams?
1 day, up to 7
What ability has Kinesis that SQS does not?
to reprocess / replay data
How many consumers can have a Kinesis Stream?
multiple
How does Kinesis scale out?
adding new shards, it does not auto scale
What can’t you do to data inserted in Kinesis?
delete it, it is inmutable
What is the writing speed of a Kinesis Stream Shard?
1 MB/s or 1000 messages at write PER SHARD
What is the reading speed of a Kinesis Stream Shard?
2 MB/s at read PER SHARD
How is billing in Kinesis Streams?
per shard provisioned
What can happens to the number of shards over time?
can evolve, reshard or merge
How are records ordered in Kinesis Streams?
per shard
What contains a record sent from a producer to Kinesis?
A message key and the data itself