Apache Spark - Analytics Flashcards
A Dstributed processing framework for big data
a) RDS
b) S3
c) Aapache spark
Apache spark
Supports code use across Patch processing, interactive queries (spark SQL), real time analytics, Machine learning, Graph porcessing.
a) RDS
b) S3
c) Aapache spark
Apache Spark
Handles realtim time streaming data which can be intgerated ith Kinesis, Kafka on EMR
a) RDS
b) S3
c) Aapache spark
Apache spark
___________ not meant for OLTP
a) RDS
b) S3
c) Aapache spark
Apache Spark
True or false
Spark apps are run as indepednant processes on a cluster
True
True or False
Within the Spark components.
Spark core consists of - Memory management, fault recovery, scheduling, dsitribute & monitor jobs, interact with stoagre scala, python, java, R
True
True or False
As part of the Spark components
Spark streaming consists of Real-time streamining analytics, structured streamining, twitter, kafka, flume, hdfs, zeromq.
True