Failed Questions Flashcards
If items in DynamoDB are being overwritten by PuItem that have the same partition key, what can be done to prevent this?
Use conditional writes
If there are more files than expected coming from Kinesis Firehose, what is the most likely root cause?
Kinesis Data Firehose scaled.
Are Security Groups Stateful?
Yes
Are NACLs stateful?
No
When auto-scaling EMR, what is a good metric to scale based on compute?
YARNMemoryAvailablePercentage
When recovering an EC2 instance, do you retain the public IP address that was assigned to it?
Yes
What is contained in the Glue Data Catalog
Information contains references to data that is used as sources and targets of your extract, transform, and load (ETL) jobs in AWS Glue.
What is the minimum storage duration for transitioning objects from S3 Standard to S3 One-Zone IA?
30 days
Can Kinesis Data Firehose support multiple
No
How do you schedule refreshing of materialized views in Redshift?
The Amazon Redshift query editor v2
What is the Lambda maximum runtime?
30 minutes
What is the percentage required for schemas to be considered similar in Glue?
70
What is the minimum amount of data recommended for Snow Mobile?
10 PB
Can RedShift Spectrum move data from S3 to Redshift?
No. It can only query
Does Kinesis Data Analytics process all columns or subsets of columns that you define?
All columns
Can S3 notifications support SQS Standard and FIFO destinations?
No, only standard queues
Does SQS integrate with third party solutions?
No
Does EventBridge integrate with third party solutions?
Yes
Can Athena perform ETL
No
What AWS service can be used to share data between organizations?
AWS Data Exchange
Can Athena connect to third party resources?
No
Can Quicksight support customer-provided keys?
No
How can you programmatically ensure that an AWS Glue catalog is up to date as partitions are created in S3?
Boto3 using create_partition API call.
What does Glue DataBrew NEST_TO_MAP do?
Converts user-selected columns into key value pairs. The order is not maintained
What does Origin Access Control do in a cloudfront distribution?
It secures the content being served from cloudfront
How do you copy snapshots between redshift clusters?
Copy grant in the destination region and configure cross-region snapshots in the source region.
What is query result reuse in Athena?
Query Result Reuse allows results to be shared among users in a workgroup. Good for every query where the source data does not change.
Where do you place the DMS replication instance for the best performance?
The target VPC
What is map state in Step Functions?
It runs a set of workflow steps for each item in a dataset.
Can an EMR cluster reside in multiple AZs?
No
Can Quicksight read from a Kinesis Data Stream?
Not directly.
What is low cardinality?
Dataset or a column in a database has a small number of distinct values.
Can you share encrypted RDS snapshots with another account that use default KMS key?
No. It must be customer managed key
What is the best way to store output data from an ECS cluster?
Use EFS
Can Athena use PartiQL
Yes, to query DynamoDb
How can you view a query run plan in Athena?
Use EXPLAIN ANALYZE
Can Glue Workflows trigger Lambda?
No. Not meant for generic orchestration
What tables in RedShift can be used to identify long running queries and table locks?
STL_ALERT_EVENT_LOG
SVV_TRANSACTIONS
Is Kinesis or SQS most cost effective?
SQS
What is the best AWS service to aggregate CloudWatch Logs?
Kinesis Data Firehose
When using Data Firehose for log aggregation, where should the service be configured?
In the logging account.
What is an EC2 instance store?
An instance store provides temporary block-level storage for your instance. This storage is located on disks that are physically attached to the host computer.
What is the default throughput of an SQS FIFO queue?
300 messages per second with batching up to 10 messages for a max of 3000.
Can you attach an instance store volume?
No.
What is the best use case for the Redshift ALL
Used for small tables that don’t update often.
If a RedShift cluster has a lot of load on one node and minimal on another, what can be done?
Change the distribution key to a column that has the largest dimension.
What does largest dimension mean when talking about a database?
A column with the largest number of unique entries.
What can be done when there are an excessive number of partitions in S3 that are impacting performance with Athena and Glue?
A Glue partition index
Athena Partition projection
Does DMS migrate empty tables?
No
What service can be used to move data from S3 to Kinesis Data Streams?
DMS
Can Kinesis Data Analytics ingest data directly?
No
Can Redshift read streaming data from Kinesis?
Yes
How do you configure Redshift to read from a Kinesis Data Stream?
CREATE EXTERNAL SCHEMA
Can Kinesis Data Firehose write directly to DynamoDB?
No
What configuration needs to be enabled to begin using Spark on Athena?
You need to create a spark enabled workgroup.
Does each object in S3 have a unique key when it is encrypted with SSE-S3
Yes
Can S3 Select query data in Glacier?
No. You need to move it into S3 first
When a second KCL is added, what happens?
It load balances with the first. For example, there were 4 shards being read by 1 KCL and now there 2 shards being read by 2 KCL.
What is AWS Application Auto Scaling
It is used by DynamoDB for autoscaling and adjusts provisioned throughput capacity on your behalf in response to actual traffic patterns.
In S3 lifecycle policies, What takes precedence, Transition or Delete Markers
Transition
In S3 lifecycle policies, What takes precedence, Transition or Permanent Deletion?
Permanent Deletion
How can you save the output of an Athena query to S3?
Use the UNLOAD command
What should cold storage in Open Search be used for?
Rarely accessed data
Is OAUTHv2 a valid authentication mechanism for OpenSearch
No
How can you reduce the time it takes for your queries to scan large tables in RedShift?
Use sortkey on columns that are friequently used in query filters.