Kinesis overview and Streams Flashcards
What are the 3 types of Data Collection?
- Real-Time (Immediate Actions)
- Near Real-Time (Reactive Actions)
- Batch (Historical Analysis)
What are 3 examples of Real-Time data collection?
- Kinesis Data Streams (KDS)
- Simple Queue Service (SQS)
- Internet of Things (IoT)
What are 2 examples of Near Real-Time Data Collection?
- Kinesis Data Firehose (KDF)
- Database Migration Service (DMS)
What are 2 examples of Batch Data Collection?
- Snowball
- Data Pipeline
What are the 3 Kinesis Services?
- Kinesis Streams
- Kinesis Analytics
- Kinesis Firehose
What does Kinesis Streams allow you to do?
Low latency streaming ingest at scale
What does Kinesis Analytics allow you to do?
Perform real-time analytics on streams using SQL
What does Kinesis Firehose allow you to do?
Load streams into S3, Redshift, ElasticSearch & Splunk
How do Kinesis Streams work?
- Producers update data in shards/partitions
- Consumers read from partitions
What is the data retention in Kinesis Streams?
- Default is 24 hours
- Can go up to 7 days
How many applications can consume the same stream?
Multiple applications can consume the same stream
Can an application update data in a Kinesis stream?
No, the data is immutable. It is append-only and will remain until the data retention period has been reached
How are records ordered?
They are ordered per shard
What is produced to a shard?
Records and Record Keys are produced to shards
What is a record made up of?
A record is made up of a Data Blob