Amazon EMR | Using HBase Flashcards

1
Q

Does Pig support access from JDBC?

Using HBase

Amazon EMR | Analytics

A

No. Pig does not support access through JDBC.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Apache HBase?

Using HBase

Amazon EMR | Analytics

A

HBase is an open source, non-relational, distributed database modeled after Google’s BigTable. It was developed as part of Apache Software Foundation’s Hadoop project and runs on top of Hadoop Distributed File System(HDFS) to provide BigTable-like capabilities for Hadoop. HBase provides you a fault-tolerant, efficient way of storing large quantities of sparse data using column-based compression and storage. In addition, HBase provides fast lookup of data because data is stored in-memory instead of on disk. HBase is optimized for sequential write operations, and it is highly efficient for batch inserts, updates, and deletes. HBase works seamlessly with Hadoop, sharing its file system and serving as a direct input and output to Hadoop jobs. HBase also integrates with Apache Hive, enabling SQL-like queries over HBase tables, joins with Hive-based tables, and support for Java Database Connectivity (JDBC).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Are there new features in HBase specific to Amazon EMR?

Using HBase

Amazon EMR | Analytics

A

With Amazon EMR you can back up HBase to Amazon S3 (full or incremental, manual or automated) and you can restore from a previously created backup. Learn more about HBase and EMR.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly