No more typing reviews! Try our Samantha, our new voice AI agent.

Apache HBase vs MarkLogic comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jan 7, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Apache HBase
Ranking in NoSQL Databases
12th
Average Rating
7.2
Reviews Sentiment
5.1
Number of Reviews
4
Ranking in other categories
No ranking in other categories
MarkLogic
Ranking in NoSQL Databases
8th
Average Rating
8.4
Reviews Sentiment
6.0
Number of Reviews
14
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of July 2026, in the NoSQL Databases category, the mindshare of Apache HBase is 5.1%, down from 5.8% compared to the previous year. The mindshare of MarkLogic is 2.8%, up from 1.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases Mindshare Distribution
ProductMindshare (%)
MarkLogic2.8%
Apache HBase5.1%
Other92.1%
NoSQL Databases
 

Featured Reviews

Ephrem Sisay - PeerSpot reviewer
Senior Software Engineer at a computer software company with 501-1,000 employees
In-memory processing and integration capabilities have optimized query performance
Apache HBase could be improved by optimizing the integration with Apache Phoenix; sometimes the abstraction and lookup jobs lead to issues when there are too many requests. Resource optimization isn't always as successful as it should be, which can cause some query and lookup jobs to fail. For instance, during eligibility checks for credit, if there are many requests on the database, it might fail, and after such a failure, it doesn't allow us to run queries from the moment they stop. If there could be optimization to require less resource usage and allow those jobs and queries to pick up from where they stopped, that would be a great addition to the tool.
reviewer2812596 - PeerSpot reviewer
Senior Data Engineer at a insurance company with 10,001+ employees
Handling hierarchical insurance data has improved ETL workflows and still needs better integration
There are several things I have observed regarding MarkLogic's improvement areas. One challenge I notice is the learning curve and setup; it can be complex for someone new, especially when integrating with other systems or setting up indexing strategies for large datasets. I occasionally spend extra time fine-tuning indexes or query performance for really large documents. Another observation concerns tooling and ecosystem support, as it does not feel as rich as mainstream databases such as Hive or SQL servers in terms of connectors and integration or community resources. Sometimes I need to build custom scripts to bridge these gaps. Finally, monitoring and debugging distributed queries can be tricky; while it has built-in tools, deeper performance profiling or tracing is not always intuitive. Overall, these are not deal-breakers, but improvements in onboarding, ecosystem connectors, and monitoring would enhance the experience.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Apache HBase is a database used for data storage."
"The best features of Apache HBase include being embedded, making it very fast; when it's linking, it operates with virtually no delay, and all of the queries are very fast too due to some internal optimization which makes it very sufficient and efficient."
"The in-memory processing lets us optimize our queries and helps us run concurrent queries and other jobs such as the lookup jobs we always use Apache HBase for."
"The most valuable part is the column family structure."
"We moved to MarkLogic and created the API using JavaScript server-side language, and we saw almost 60% improvement in the speed of the search."
"Overall, it reduced data transformation efforts, simplified architecture, and made it easier to build richer and more connected database models."
"We have definitely seen a return on investment, as the end customer really felt that things are going fast and they are able to get the data very fast, retrieve it, change it, transform it, and merge or discard many things they want to do in MarkLogic, saving three or four hours daily because we have enabled them to have the data in real time instead of waiting for another day."
"Using MarkLogic has had a significant positive impact on our organization, especially in terms of performance, flexibility, and reliability."
"MarkLogic's greatest asset is its strong engineering foundation. It was specifically designed with search capabilities in mind, and the developers placed a great emphasis on ensuring the quality of the indexing and all subsequent layers that were added."
"MarkLogic has had a tangible positive impact on our organization."
"The rules can show us if there are missing items, like titles, and we can add them in to ensure everything is filled and makes sense and there are no missing details."
"Compared to our previous approach, we are saving approximately two days per week, representing a forty percent reduction in time spent discussing or trying to understand data, and MarkLogic has delivered significant time savings."
 

Cons

"We've seen performance issues."
"The setup of Apache HBase needs a lot of time, and the linkage is not the program itself, but the activation and connecting to the NYPD engine always takes considerable time."
"Apache HBase could be improved by optimizing the integration with Apache Phoenix; sometimes the abstraction and lookup jobs lead to issues when there are too many requests."
"I don't like using Apache HBase to store huge amounts of data because of many performance issues."
"While MarkLogic itself is powerful, it can be improved in terms of ease of usage, cost, and the learning curve."
"Regarding improvement, I have identified a few areas. MarkLogic is quite powerful, but some areas need enhancement."
"I would say the features can be improved, as maybe the UI could be a little better."
"The learning curve for new users or developers on MarkLogic is a bit tough at first, but afterwards it is really easy to understand for developers new to MarkLogic."
"However, there is very limited documentation about MarkLogic's AI capabilities."
"When I started with MarkLogic, I found that its learning curve and developer experience are not very comfortable for beginners."
"Scalability-wise, I think it is acceptable. We have to add a lot of cost when we want to scale it because it is really hard to buy a new cluster."
"MarkLogic's scalability is very bad. In production, when you get to know that your data is increasing and you need to add one more node, that is not easy and not straightforward."
 

Pricing and Cost Advice

Information not available
"MarkLogic is a pricey option, but there are some advantages to its pricing structure. For medium-sized clients or departments within larger companies, it is possible to obtain a license for one or two nodes for less than a hundred thousand dollars. Additionally, if you only need to deploy a single node, you can do so for under fifty thousand dollars. This is in contrast to other high-quality software options that are only accessible to larger businesses, where the starting price can be upwards of two hundred thousand dollars."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
902,894 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
19%
Comms Service Provider
8%
Manufacturing Company
7%
Construction Company
7%
Educational Organization
27%
Transportation Company
13%
Financial Services Firm
10%
Program Development Consultancy
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
By reviewers
Company SizeCount
Small Business5
Midsize Enterprise3
Large Enterprise11
 

Questions from the Community

What needs improvement with Apache HBase?
Apache HBase could be improved by optimizing the integration with Apache Phoenix; sometimes the abstraction and lookup jobs lead to issues when there are too many requests. Resource optimization is...
What advice do you have for others considering Apache HBase?
I'm working for a corporate that uses Apache HBase for their Big Data platform and I'm a Big Data engineer there. We're using a version of Apache HBase that is compatible with the other Big Data to...
What is your experience regarding pricing and costs for Apache HBase?
The cost depends on the EC2 instances and the size of the data you're indexing.
What is your experience regarding pricing and costs for MarkLogic?
As far as I know, I am not directly associated with pricing, setup cost, and licensing, but MarkLogic is a bit more expensive than other technologies available. However, the support is really good,...
What needs improvement with MarkLogic?
MarkLogic can be improved by introducing a lot of languages to querying. Although I think it is self-sufficient if you know JavaScript and XQuery, for new people who want to onboard into MarkLogic,...
What is your primary use case for MarkLogic?
MarkLogic serves as our enterprise-level database with multiple applications. We have numerous source systems that dump data into MarkLogic. DHF flows manage transformation, harmonization, mapping,...
 

Also Known As

HBase
No data available
 

Overview

 

Sample Customers

Bloomberg, Wells Fargo, Apple, Capital One, NVIDIA
ALM, American Psychological Association, American Society of Agronomy, Cond_ Nast, Centers for Medicare and Medicaid Services, Institute of Engineering and Technology, JWG Group, Lagardre Active, RSuite CMS, Wiley
Find out what your peers are saying about Apache HBase vs. MarkLogic and other solutions. Updated: June 2026.
902,894 professionals have used our research since 2012.