Big Data Whitepaper Flashcards

1
Q

What is the minimum number of DPUs required for a Glue ETL job?

A

2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the default number of DPUs allocated to a Glue ETL job

A

10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What two languages does Glue ETL use when generating code?

A

Python and Spark

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the minimum interval for Glue ETL jobs?

A

5 minutes (not the right tool for streaming data)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What kind of databases is AWS Glue not compatible with?

A

NoSQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the maximum item size in DynamoDB?

A

400KB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How many data centers is DynamoDB replicated across?

A

3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What needs to be used to achieve regional replication in DynamoDB?

A

DynamoDB Streams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

By default, how often does Elasticsearch take snapshots and backup to S3?

A

Daily

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the maximum EBS volume size per Elasticsearch instance?

A

1.5TB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the default maximum number of nodes per ES domain?

A

20

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the absolute maximum number of nodes per ES domain?

A

100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are two anti-patterns for ES?

A

OLTP and Ad-hoc queries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What Quicksight edition must be used if requiring encryption at rest?

A

Enterprise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When should EC2 be used in a big data setting vs other serverless or managed solutions?

A

Specialized environments or compliance requirements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are some notebooks that can be used with Athena?

A

RStudio, Jupyter, Zeppeling