Collection Flashcards
How long are records accessible in stream ?
24 hours
By how many days can we raise the limit for which a record in stream is accessible ?
7 days
How do we raise the limit from 24 hours to upto 1 day ?
Enabling extend Data Retention
What is the maximum size of a data blob within 1 record ?
1 Mb
How many records per second can each shard support ?
1000 PUT records
What is the maximum number of shards you can have in a stream ?
No upper limit
What is the maximum number of streams you can have in an account ?
No upper limit
How much data per second can a single shard ingest ?
1 Mb of data per second
How many writes can a single shard ingest ?
1000 records
What is the default shard limit for Virginia, Oregon and Ireland ?
500 shards
What is the default shard limit outside of these 3 regions ?
200 shards
How many read transactions per second can each shard support ?
5
How many records can each read transaction provide ?
10,000 records
What is the upper limit of read transaction ?
10 Mb
GetRecords can retrieve how many Mb of data per call from a single shard ?
10 Mb
GetRecords can retrieve how many records per call ?
Upto 10,000
How many read transactions is 1 call to GetRecords counted as ?
1
What is the maximum speed of each shard via GetRecords ?
2 Mb per second
If calls to GetRecords returns 10 Mb, subsequent calls made within the next 5 seconds, do what ?
Throw an Exception
How many consumers per stream can be registered to use enhanced fan-out ?
20
What 3 things does a record consist of ?
- Sequence Number
- Partition Key
- Data Blob
You are accumulating data from IoT devices and you must send data within 10 seconds to Amazon ElasticSearch service. That data should also be consumed by other services when needed. Which service do you recommend using?
- Kinesis Data Streams
- Kinesis Data Firehose
- SQS
- Database Migration Service
Kinesis Data Stream
You need a managed service that can deliver data to Amazon S3 and scale automatically for you. You want to be billed only for the actual usage of the service and be able to handle peak loads. Which service do you recommend?
- Kinesis Data Streams
- Kinesis Data Firehose
- SQS
- Kinesis Analytics
Kinesis Data Firehose
You are sending a lot of 100B data records and would like to ensure you can use Kinesis to receive your data. What should you use to ensure optimal throughput, that has asynchronous features ?
- Kinesis SDK
- Kinesis Producer Library
- Kinesis Client Library
- Kinesis Connector Library
- Kinesis Agent
Kinesis Producer Library
(Through batching (collection and aggregation), we can achieve maximum throughput using the KPL. KPL is also supporting an asynchronous API)