What is HBase
It is based on Google BigTable. It is a type of “NoSQL” database which is column-oriented, Open-source, and distributed. It uses the Hadoop Infrastructure (Zookeeper as a lock service, NameNode, HDFS – File system) and hence supports fault tolerance, scalability inherently, and adds random read-write capability.
Tables are distributed as regions, and regions are automatically split and redistributed as data grows. It supports linear and modular scaling adding RegionServers that can be hosted on Public Cloud. Regions are vertically divided by column families into stores which are stored as files on HDFS
Potential Use cases:
- Reads, supported by single-write master
- Ordered Partitioning which supports row-scans
- Range based scans
- Batch Analysis
- Large cache
HBase does not have many features such as triggers, secondary indexes etc.