Analytics - EMR Flashcards
Managed haddop framework on ec2 instances. Includes spark, hbase, presto, flin, hive.
EMR notebooks, several integration points with AWS.
a) S3
b) RDS
c) EMR
EMR
In emr ________ node manages the cluster, tracks the sattus of tasks moniotrs cluster health.
a) master node
b) core node
c) task node
master
in emr______________ node hosots hdfs data runs tasks can be scaled up and down but with some risk. Multi node clusters have at least one.
a) master node
b) core node
c) task node
core
in emr ______________ nodes runs tasks, does not hhost data, optional, no risk of data loss when removing, good use of spot instances.
a) master node
b) core node
c) task node
tasks
spots instacnes can help with reducing costs. No risk of data loss, help epand your clsute dynamcially.
in emr a _________________________ terminate cluster onc all steps are complete. Loading data, processing, sotring then shut down to save money.
a) transient cluster
b_ rds instance
transient