Domain 1: Collection Flashcards
Which Kinesis services offers asynchronous features and high throughput?
Kinesis Producer Library
Where must compression be implemented in Kinesis?
By the end user
How many GetRecords API calls are allowed per second by Kinesis streams in Classic mode?
- Maximum of 5 GetRecords API calls per shard per second = 200ms latency
- If 5 consumers application consume from the same shard, means every consumer can poll once a second and receive less than 400 KB/s
What is the average latency in Kinesis Steams Enhanced Fan Out mode?
70ms
What is the throughput of Kinesis Consumer Classic mode?
2MB/sec
What are the 4 services that Kinesis Firehose can write to?
S3, Redshift, ElasticSearch, Splunk
Describe the key features of the Kinesis Producer Library (KPL)
- Used for building high performance, long-running producers
- Automated and configurable retry mechanism
- Synchronous or Asynchronous API (better performance for async)
- Submits metrics to CloudWatch for monitoring
- Batching (Collect and Aggregate)
- Compression must be implemented by the user
- KPL Records must be de-coded with KCL or special helper library
Which protocol is not supported by IoT Device Gateway?
FTP
What is the minimum latency for Firehose with non full batches?
60 seconds
What data conversions are possible using Firehose with S3/
JSON to Parquet/ORC
What data transformations are possible using Firehose with Lambda?
CSV to JSON
What compression algorithms are supported by Firehose with S3?
GZIP, ZIP, SNAPPY
What compression algorithm is supported by Firehose with Redshift?
GZIP
How are you charged on Firehose?
Amount of data going through Firehose
Can Spark and KCL read from Firehose?
No. They can only read from Kinesis Data Streams
What is the minimum buffer time in Firehose?
60 seconds
Can resharding be done in parallel?
No
To how many AZs is data replicated in Kinesis Data Streams?
3
What is the default retention period in Kinesis Data Streams?
24 hours
(or customizable to 365 days)
Can data be deleted from Kinesis streams?
No
What is a key best practice with partition keys in Kinesis Streams?
Highly distributed keys
What is the maximum size of data blobs in Kinesis?
1MB
What are the throughput limits for Kinesis producers?
1MB/s or 1000 messages/s at write per shard
What happens if you exceed throughput limits on Kinesis producers?
Provisioned Throughput Exception