Missed Practice Exam Questions Flashcards

1
Q

What are some tool options for public online interactive data visualization at low cost?

A

HighChart and D3.js

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the data capacity limit in Aurora?

A

64TB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are some ways that HBase can integrate with S3?

A

Read replicas on S3, Store HBase StoreFiles and metadata on S3, Snapshots of HBase data on S3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What needs to be done to ensure that all VPC flow logs for Redshift COPY and UNLOAD commands are logged?

A

Enable Enhanced VPC Routing (forces all commands to use the VPC)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What visualization type should be used when there is multi-dimensional data that needs to be analyzed for outliers and trends?

A

Heatmap

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What should be done if you are using EMR with S3 data and you encounter consistency issues?

A

Enable EMRFS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How should you implement a real-time, multi-AZ replica of a Redshift cluster?

A

Spin up separate Redshift clusters across AZs, use Kinesis Streams to simultaneously write data to each other, use Route53 to route users to the nearest cluster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What two methods can be employed to outside access Kibana deployed within a VPC?

A

Setup an SSH tunnel with port forwarding to allow access on port 5601 or Set up a reverse proxy server between your browser and Elasticsearch Service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What security option should be employed on an S3 bucket when you need to restrict user access at a file level?

A

SSE-KMS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the maximum buffer time in Kinesis Firehose?

A

5 minutes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Does Glue integrate with Elasticsearch?

A

No

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Which services should be avoided if requirements demand low maintenance?

A

Kinesis Data Streams, EMR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Which IoT authentication protocol is most popular with mobile devices?

A

Cognito

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are best practices for loading large amounts of data between S3 and Redshift regularly?

A

Split files into 1 to 124MB files, use GZIP compression, use a single COPY command, load data in the same order as your sort key

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What tool should be used for integrating data between relational databases and EMR?

A

Sqoop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What security mechanisms are supported by EMR?

A

LUKS, KMS, SSE-KMS

17
Q

What are three ways that Pig integrates with S3?

A

Directly writing to HCatalog tables in S3, Loading customer JAR files from S3 using the REGISTER command, Submitting work on the EMR console using Pig scripts stored in S3

18
Q

Can Pig integrate with Glue?

A

No

19
Q

Does EMRFS offer a caching feature?

A

No

20
Q

What should be done when experiencing high costs and low speeds when using EMRFS?

A

Use S3DistCp as a preliminary step

21
Q

Can Athena access S3 data that is in Glacier?

A

No

22
Q

Where must encryption artifacts be stored when configuring in flight encryption between S3 and EMR?

A

S3

23
Q

In Kinesis Data Analytics, what are Stagger Windowed queries best for?

A

Data arriving at inconsistent times