Analytics - EMR Flashcards

1
Q

Managed haddop framework on ec2 instances. Includes spark, hbase, presto, flin, hive.

EMR notebooks, several integration points with AWS.

a) S3
b) RDS
c) EMR

A

EMR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

In emr ________ node manages the cluster, tracks the sattus of tasks moniotrs cluster health.

a) master node
b) core node
c) task node

A

master

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

in emr______________ node hosots hdfs data runs tasks can be scaled up and down but with some risk. Multi node clusters have at least one.

a) master node
b) core node
c) task node

A

core

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

in emr ______________ nodes runs tasks, does not hhost data, optional, no risk of data loss when removing, good use of spot instances.

a) master node
b) core node
c) task node

A

tasks

spots instacnes can help with reducing costs. No risk of data loss, help epand your clsute dynamcially.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

in emr a _________________________ terminate cluster onc all steps are complete. Loading data, processing, sotring then shut down to save money.

a) transient cluster
b_ rds instance

A

transient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly