Apache HBase

HBase is basically the open-source BigTable.

Essentially similar to how Hadoop was implemented as the open-source alternative to Google’s MapReduce.

  • Part of the Hadoop Ecosystem, in other words!
    • Backed by HDFS
    • Hadoop Jobs can read from / write to HBase tables

HBase vs. HDFS?

HDFS is a distributed filesystem, whereas HBase is a NoSQL database.

  • So it’s the difference between using a filesystem vs. database. You learned this in SE464, search “Why not just use filesystem”

Hint: You can have real-time random access with HBase.