Apache HBase
HBase is basically the open-source BigTable.
Essentially similar to how Hadoop was implemented as the open-source alternative to Google’s MapReduce.
- Part of the Hadoop Ecosystem, in other words!
- Backed by HDFS
- Hadoop Jobs can read from / write to HBase tables
HBase vs. HDFS?
HDFS is a distributed filesystem, whereas HBase is a NoSQL database.
- So it’s the difference between using a filesystem vs. database. You learned this in SE464, search “Why not just use filesystem”
Hint: You can have real-time random access with HBase.