Offers real-time aggregations and easy for a beginner to learn to use this
Summary
We use Apache HBase for real-time data grouping, primarily appreciating its column family structure for real-time aggregations. However, we encounter performance issues when the number of regions increases, particularly under heavier loads.
Pros
"The most valuable part is the column family structure. "
Cons
"We've seen performance issues."
What is our primary use case?
We use it for real-time data grouping.
What is most valuable?
The most valuable part is the column family structure. We mainly use it for real-time aggregations. That's why we prefer it as a NoSQL database.
What needs improvement?
We've seen performance issues when we have more regions. The product needs improvement in that area.
So we experience performance issues sometimes when the load increases.
For how long have I used the solution?
It's one of our legacy systems. We've been using it for eight to nine years.
What do I think about the stability of the solution?
We've only seen issues when migrating from an older version to the latest one. Otherwise, it's good.
How are customer service and support?
We don't rely on Apache support. It's our own infrastructure, maintained by our Hadoop team.
How was the initial setup?
The initial deployment is easy.
We deploy both on the cloud and on-premises, depending on the customer.
We use AWS.
What's my experience with pricing, setup cost, and licensing?
The cost depends on the EC2 instances and the size of the data you're indexing.
What other advice do I have?
It's better to use AWS DynamoDB or Cassandra.
I would rate it an eight out of ten. It is easy for a beginner to learn.
*Disclosure: I am a real user, and this review is based on my own experience and opinions.