1. Intro to Big Data Flashcards
Which of the following is NOT one of the 5 Vs of Big Data?
A) Velocity
B) Variety
C) Veracity
D) Virtualization
D) Virtualization
What is the main challenge in dealing with Big Data?
A) Finding enough data
B) Storing and analyzing the data
C) Keeping the data secure
D) Making the data visually appealing
B) Storing and analyzing the data
What is Hadoop primarily used for?
A) Web development
B) Distributed data storage and processing
C) Graphic design
D) Game development
B) Distributed data storage and processing
Which scaling approach involves spreading data and processing across more machines?
A) Scale-up
B) Scale-down
C) Scale-out
D) Scale-in
C) Scale-out
In Hadoop’s HDFS, what is the role of the NameNode?
A) It stores the actual data
B) It manages the file system namespace and regulates access to files
C) It performs checkpointing for the NameNode
D) It is a client that reads and writes data
B) It manages the file system namespace and regulates access to files
Which of the following is a component of Hadoop’s architecture?
A) HBase
B) MapReduce
C) Cassandra
D) MongoDB
B) MapReduce
What is the purpose of the SecondaryNameNode in HDFS?
A) It acts as a backup for the NameNode
B) It performs checkpointing for the NameNode
C) It stores the actual data
D) It manages the network traffic
B) It performs checkpointing for the NameNode
Which of the following statements about HDFS is true?
A) HDFS is designed for small files
B) HDFS stores multiple copies of data blocks for fault tolerance
C) HDFS uses a peer-to-peer architecture
D) HDFS is primarily used for real-time data processing
B) HDFS stores multiple copies of data blocks for fault tolerance
Which of the following is NOT a characteristic of Big Data?
A) Velocity
B) Veracity
C) Volatility
D) Volume
C) Volatility
In Hadoop, the NameNode:
A) Stores actual data
B) Manages the file system namespace
C) Performs data processing
D) Acts as a data node
B) Manages the file system namespace
In Hadoop, data is stored in:
A) The NameNode
B) The ResourceManager
C) HDFS
D) YARN
C) HDFS