No more typing reviews! Try our Samantha, our new voice AI agent.

Gremlin Reliability Management Platform vs VMware Aria Operations for Applications comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Gremlin Reliability Managem...
Ranking in Application Performance Monitoring (APM) and Observability
23rd
Ranking in IT Infrastructure Monitoring
25th
Average Rating
8.8
Reviews Sentiment
7.0
Number of Reviews
7
Ranking in other categories
DevSecOps (7th)
VMware Aria Operations for ...
Ranking in Application Performance Monitoring (APM) and Observability
16th
Ranking in IT Infrastructure Monitoring
16th
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
14
Ranking in other categories
Container Monitoring (6th), Cloud Monitoring Software (12th)
 

Mindshare comparison

As of May 2026, in the Application Performance Monitoring (APM) and Observability category, the mindshare of Gremlin Reliability Management Platform is 0.1%. The mindshare of VMware Aria Operations for Applications is 1.4%, up from 0.8% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Application Performance Monitoring (APM) and Observability Mindshare Distribution
ProductMindshare (%)
VMware Aria Operations for Applications1.4%
Gremlin Reliability Management Platform0.1%
Other98.5%
Application Performance Monitoring (APM) and Observability
 

Featured Reviews

VL
Senior Software Engineer at a sports company with 10,001+ employees
Chaos experiments have revealed weak points and now provide controlled cost-saving tests
The best features of Gremlin Reliability Management Platform are the safe failure injection, which is crucial as we can simulate the failures in a manner that we know these are just dumping tests and not the actual issues. Whether it is the CPU spike or the memory exhaustion, or the network latency, or the server shutdown, server shutdown is one of the most favorite features that I have in Gremlin Reliability Management Platform. The controlled blast radius is another standout feature. The controlled blast radius feature has helped my team in that we actually wanted to target only one specific container, our Docker containers that we deployed. It helped us to conduct tests in a very specific, isolated manner instead of launching a larger test or focusing on hundreds of servers at a time, resulting in very limited impact. Since ours is a very small team, we do not want to impact other servers. This controlled blast radius helped us to only focus on our servers and not impact any other team. Gremlin Reliability Management Platform has positively impacted my organization because before Gremlin Reliability Management Platform, we did not even know how to conduct these chaos engineering tests. We heard about it, but we had no idea of how to do something of that nature. If there are ten servers, ten systems in our architecture and if suddenly something goes down, nobody knew what would happen next. We did not even know how to simulate these types of tests. This lack of confidence has been mitigated by using Gremlin Reliability Management Platform. Now we can confidently test and see which system is the most critical. If this goes down, what happens? How much business valuation are we going to impact? How much loss are we going to incur? All of this is now clearly visible and transparent. Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments. We were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime.
AS
Consultant at HCLTech
Automation and insightful diagnostics elevate operations while improvements are welcome
The new version 8.18 has brought significant improvements. The main purpose of using this tool is having a single console to manage all entities. The cloud integration capabilities, including private, public, and hybrid, are already well-implemented. The ability to manage multiple vCenters across different geographical locations is very effective. The new Skyline health feature and certificate and licensing management are particularly useful improvements. The dashboard's overview tab provides comprehensive information about current vCenter status, resizable amounts, reclamation opportunities, anomalies, diagnostics, licensing, and certificates. However, there is room for improvement in application analysis. The current functionality lacks detailed monitoring at the kernel level, focusing mainly on CPU, memory, and storage matrices. Enhanced application monitoring capabilities, including tracking downtime for specific types of applications such as transactional or e-commerce applications, would be beneficial.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments, and we were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime."
"The Enterprise Reliability Platform has positively impacted my organization as it has significantly increased the efficiency and reliability of our systems."
"More than anything, we fix failures even before they occur, which is basically proactive risk detection and risk mitigation."
"Gremlin Reliability Management Platform is amazing with the reliability score, providing built-in Chaos Engineering experiments that you run on your service to receive a reliability score along with insights on the issues and risks present in your service that you can examine and work on."
"We are seeing a return on investment from using Gremlin Reliability Management Platform because we are getting less production issues by thirty percent, as I mentioned earlier, making it a great investment."
"Gremlin Reliability Management Platform has impacted my organization positively as it helped a lot and reduced our failures, allowing us to find critical pinpoints in our application that had existed for three to ten months and led to too many improvements, reduced downtime, and a smoother experience for our application on AWS."
"Using Gremlin Reliability Management Platform has raised more than fifty percent of the reliability of the infrastructure."
"We haven't had any issues with stability."
"I gladly recommend VMware Aria Operations for Applications because it is cost effective and has commendable AI response capabilities."
"The solution is great for virtualization and preparing the infrastructure in Tanzu to test products, it's very fast and has good visibility."
"VMware comes with a support team, and if you have trouble, you can easily create a ticket, and VMware will help you. Therefore, the best aspect is the support."
"This solution will give you a single pane of glass for everything, and centralized monitoring."
"The solution provides single-console management for multiple VMware environments and cloud integrations, whether hybrid, private, or public cloud."
"The solution is great for virtualization and preparing the infrastructure in Tanzu to test products. It's very fast and has good visibility."
"Scalability for VMware Aria Operations for Applications is easy to implement and manage."
 

Cons

"If you really look at the cost-benefit visibility, it is not very evident by using Gremlin Reliability Management Platform."
"I think that it will be important to have resources to perform self-directed studies on Gremlin Reliability Management Platform as an improvement."
"I think Gremlin Reliability Management Platform can be improved by integrating with more AWS services or GCP services."
"Gremlin Reliability Management Platform can be improved as the pricing is a bit expensive and the learning curve for beginners is a bit difficult."
"The implementation is a long process that should be improved."
"The documentation and integration with Kubernetes could be improved."
"In the new version, I would love to see more prediction capabilities. It would be great if one could see the alerts get a little more enriched with information and become more human-friendly instead of the technical stuff that they put in there. I think those would be really awesome outcomes to get."
"Its billing model is consumption-based. I understand the consumption-based model, but it is not necessarily easy to estimate and guess how many points or how much we are going to consume on a specific application up until we get to that point."
"The documentation and integration with Kubernetes could be improved."
"An area for improvement would be more in-depth reporting, such as analyzing disk IO during business hours compared to after hours."
"They could make it more easy to plug-in data so that a nontechnical person will be able to use it, like accountants or finance people. That way they don't have to ask us."
"I find that there could be improvements in the support service response time, as the urgency varies unless specified as Priority 2 or 1 cases."
 

Pricing and Cost Advice

Information not available
"Different locations require different setups. In your terms, around 300 to around 400K USD."
"The licensing costs are very high, particularly when you consider that we have to purchase a level 1 license for every integration, such as the load balancer, HAProxy, and the MSSP. And if you want to use vSAN, that's another license. Then, of course, Tanzu Observability has its own separate license."
"I would rate the pricing as three out of five."
"I don't have the details. In our case, there is a mixture in place. We have production usage, and we are also doing training for VMware. So, we also have a training instance. It is worth the money you would spend on it. That's because if you were to build all of this yourself by using some of the open source tools, then you would need a lot of time."
report
Use our free recommendation engine to learn which Application Performance Monitoring (APM) and Observability solutions are best for your needs.
894,738 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Printing Company
11%
Construction Company
11%
Financial Services Firm
10%
Sports Company
10%
Financial Services Firm
12%
Manufacturing Company
11%
Government
7%
Computer Software Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business3
Large Enterprise6
By reviewers
Company SizeCount
Small Business4
Midsize Enterprise1
Large Enterprise10
 

Questions from the Community

What needs improvement with Gremlin Reliability Management Platform?
There are certain areas where I think Gremlin Reliability Management Platform can improve. I would certainly add features related to AI and GenAI for recommendations. While dependency identificatio...
What is your primary use case for Gremlin Reliability Management Platform?
The primary reason I am using Gremlin Reliability Management Platform is to proactively test failures, identify weaknesses in my system, and fix them before real incidents actually occur. From a pr...
What advice do you have for others considering Gremlin Reliability Management Platform?
I would certainly suggest others venture into Gremlin Reliability Management Platform, as there is no second thought about it. However, I would not recommend jumping straight into production chaos....
What is your experience regarding pricing and costs for VMware Tanzu Observability by Wavefront?
The solution includes cost drivers that can be configured according to your environment. You can input server-wise, CPU-wise, storage, and memory costs per hour or day. Once cost drivers are provid...
What needs improvement with VMware Tanzu Observability by Wavefront?
The new version 8.18 has brought significant improvements. The main purpose of using this tool is having a single console to manage all entities. The cloud integration capabilities, including priva...
What is your primary use case for VMware Tanzu Observability by Wavefront?
I have been involved in several projects where I had to implement automation and monitoring alerting. I performed automation using VMware Aria Operations for Applications. There is an automation ta...
 

Also Known As

No data available
Tanzu Observability, Wavefront, Wavefront by VMware, VMware Tanzu Observability
 

Overview

 

Sample Customers

Information Not Available
1. Atlassian 2. Cisco 3. Databricks 4. DigitalOcean 5. Equinix 6. Fidelity Investments 7. Google 8. Hewlett Packard Enterprise 9. Honeywell 10. IBM 11. Intel 12. JetBlue Airways 13. LinkedIn 14. Lyft 15. Mastercard 16. Microsoft 17. MongoDB 18. Netflix 19. Nvidia 20. Oracle 21. PayPal 22. Pinterest 23. Qualcomm 24. Red Hat 25. Salesforce 26. SAP 27. Spotify 28. Square 29. TMobile 30. Twitter 31. Uber 32. VMware
Find out what your peers are saying about Gremlin Reliability Management Platform vs. VMware Aria Operations for Applications and other solutions. Updated: April 2026.
894,738 professionals have used our research since 2012.