AWK Kinesis Flashcards
For Kinesis data streams, how does it work?
Kinesis data streams enable you to inject data from thousands of sources, Kinesis data streams scales based on the number of shards you create. Kinesis data streams buffer the data for 24hrs by default and enable one or more consumers to read form the stream.
When you create a Kinesis stream, in what region are you creating the stream or is it a global service?
You are creating the stream in the region you have selected as Kinesis data stream is not a global service.
What is Kinesis?
It is a family of products for data stream processing, this means injection, analysis/process and store.
What is the Kinesis produce?
it is the entity that puts data into the stream, - IOT device _ Mobile device - Application device - EC2 device - On-prem server
What is a Kinesis consumer?
This is the entity that takes data out of the stream.
Can I have multiple consumers?
Yes
What types of Kinesis consumers can I have?
You cna have,
- EC2 using Kinesis customer library (KCL)
Lambda
Kinesis Firehose
What types of streams can I have in Kinesis?
- Data streams
- Video streams
How long is dat stored in a Kinesis stream?
24hrs (you cna increase this to 7day for an extra charge)
How dose a Kinesis stream related to shards?
A kinesis stream is a collection of shards.
What are the units associated with a single shard?
Read at 2mb per second
Write at 1mb per second
What is the max number of shards?
500
I require a Kinesis stream capable of 10mb write, how many shards do I need?
You need 10
How many data records per second can a single shard in a Kinesis accept?
100 per shard
How big can a single Kinesis data record be?
1MB
What is the partition key used for in Kinesis?
It is used to select the shard to use when writing the data.
We are using Kinesis and our org has a policy where all data in transit and at rest is encrypted, is it possible to have Kinesis encrypt the data at rest?
Yes 100%, you can use SSE-KMS with AWS keys or SSE-KMS encryption with client keys.
How can I monitor the Kinesis stream metrics?
You can use cloud whatch to monitor shard level metrics like incoming bytes, outgoing bytes, etc
I wnat to access Kinesis form my VPC without going on the internet, how cna I do this?
VPC endpoints.
I have Lambda configured to operate in the vPC, how best cna I have lambda access Kinesis?
Through VPC endpoints
What is an enhansed fan out in relation to Kinesis
It means that you select a consumer to be given more of the bandwidth.
What is Kinesis Firehose?
It enables you to take data From Kinesis stream and push it to a datastore like Elasticsearch S3 Redshift Splunk
For Kinesis what the two input sources you can have?
- Kinesis
- Direct, send records direct to Kinesis Firehose.
I want to push CloudWatch events into a Kinesis Firehose, how can I do this?
Kinesis Firehose can take Direct input from these sources.