Kinesis Flashcards
What is streaming data?
data generated continuously by many data sources, typically in small sizes (Kb)
online purchases, stock prices game data social network data geodata i.e. map tracking iOT data
What is Kinesis?
AWS Platform to send your streaming data to
Lets you build your own custom applications
What is Kinesis Streams?
Receives data from data producers
Stores data in Shards
Sends data from shards to data consumers
What is Kinesis Firehose?
Receives data from data producers.
Don’t have streams or shards, it’s completely automated
When data comes into it, data is analyzed or sent on
i.e. sends data to S3, and then to Redshift
or sends data write to ElasticSearch Cluster
Automated way of doing Kinesis. don’t need to worry about consumers.
What is Kinesis Analytics?
Lets you run SQL Queries on data in Firehose or Streams.
Store that SQL info in S3, Redshift or ElasticSearch.
What is a Shard?
Uniquely identified group of data records in a stream
A Shard supports up to 5 transactions per second for reads, up to to max 2MB per second
up to 1000 records per second for writes up to maximum 1MB / second
Describe the capacity of Kinesis Streams
Data capacity of stream is function of number of shards you specify for the stream
Total capacity of the stream is the sum of the capacity of its shards