HBase Concepts Flashcards
What is a node?
A single computer
What is a cluster?
A group of nodes connected and coordinated by certain nodes to perform tasks
What is a Master Node?
A node performing coordination tasks
What is a Slave Node?
A worker node performing tasks assigned to it
What is a Daemon
A process or program that runs in the background
Where is table data stored?
In HDFS
How is HBase data stored in HDFS?
The data is split into HDFS blocks and stored on multiple nodes in the cluster
What is an HBase table split into?
Regions
What serves Regions to clients?
Region Servers
Can a RegionServer have regions for more than one table?
Yes
What is the HBase Master responsible for?
1 - Coordinates which regions are managed by each Region Server
2 - Handles new table creation and other housekeeping operations
Can an Hbase cluster have multiple Masters?
Yes, for high availability. But only one can be active at a time.
What service handles the coordination of the Masters?
Zookeeper
When a cluster has multiple Master, how is the active master determined?
Upon startup all Masters connect to Zookeeper. The first Master to connect, becomes the active master.
What happens if the controlling Master fails?
If you have additional master they will compete to run the cluster again.
What two servers are typically kept together in the slave nodes?
The data Node and RegionServer
List 4 master nodes?
Name Node, Secondary Name Node, Master, Zookeeper
What are tables comprised of?
rows, columns and column families
How are rows sorted?
They are sorted in rowkey order