No more typing reviews! Try our Samantha, our new voice AI agent.

Gremlin Reliability Management Platform vs ServiceNow Cloud Observability comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Gremlin Reliability Managem...
Ranking in Application Performance Monitoring (APM) and Observability
25th
Average Rating
8.6
Reviews Sentiment
7.0
Number of Reviews
8
Ranking in other categories
IT Infrastructure Monitoring (23rd), DevSecOps (8th)
ServiceNow Cloud Observability
Ranking in Application Performance Monitoring (APM) and Observability
42nd
Average Rating
7.6
Reviews Sentiment
6.9
Number of Reviews
5
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of July 2026, in the Application Performance Monitoring (APM) and Observability category, the mindshare of Gremlin Reliability Management Platform is 0.2%. The mindshare of ServiceNow Cloud Observability is 0.7%, up from 0.4% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Application Performance Monitoring (APM) and Observability Mindshare Distribution
ProductMindshare (%)
Gremlin Reliability Management Platform0.2%
ServiceNow Cloud Observability0.7%
Other99.1%
Application Performance Monitoring (APM) and Observability
 

Featured Reviews

VL
Senior Software Engineer at a sports company with 10,001+ employees
Chaos experiments have revealed weak points and now provide controlled cost-saving tests
The best features of Gremlin Reliability Management Platform are the safe failure injection, which is crucial as we can simulate the failures in a manner that we know these are just dumping tests and not the actual issues. Whether it is the CPU spike or the memory exhaustion, or the network latency, or the server shutdown, server shutdown is one of the most favorite features that I have in Gremlin Reliability Management Platform. The controlled blast radius is another standout feature. The controlled blast radius feature has helped my team in that we actually wanted to target only one specific container, our Docker containers that we deployed. It helped us to conduct tests in a very specific, isolated manner instead of launching a larger test or focusing on hundreds of servers at a time, resulting in very limited impact. Since ours is a very small team, we do not want to impact other servers. This controlled blast radius helped us to only focus on our servers and not impact any other team. Gremlin Reliability Management Platform has positively impacted my organization because before Gremlin Reliability Management Platform, we did not even know how to conduct these chaos engineering tests. We heard about it, but we had no idea of how to do something of that nature. If there are ten servers, ten systems in our architecture and if suddenly something goes down, nobody knew what would happen next. We did not even know how to simulate these types of tests. This lack of confidence has been mitigated by using Gremlin Reliability Management Platform. Now we can confidently test and see which system is the most critical. If this goes down, what happens? How much business valuation are we going to impact? How much loss are we going to incur? All of this is now clearly visible and transparent. Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments. We were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime.
Uday-Thentu - PeerSpot reviewer
Project Manager at a tech vendor with 10,001+ employees
Automation has reduced incident impact and is driving proactive self‑healing across hybrid services
ServiceNow Cloud Observability's valuable features include AI agents that can be integrated and triggered with workflows for evaluating and taking the right action in both process and technical aspects. On the process side, it ensures the right approvals are triggered. From the technical aspect, it allows integration for taking relevant action without human intervention, moving towards self-healing. The main benefit of ServiceNow Cloud Observability is real-time data, which helps us move from being reactive to more proactive. By defining and fine-tuning workflows, we were able to implement timely notification and auto-action or auto-remediations. At the same time, if anything involves additional cost or requires intervention, it helps trigger approval notifications, and once approval is received, it timely takes the action. The AI agents help define and fine-tune those workflows further.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Gremlin Reliability Management Platform has positively impacted our organization by making outages less frequent and improving recovery time significantly, resulting in fewer complaints on the customer success side and overall optimization of our DevOps process."
"The Enterprise Reliability Platform has positively impacted my organization as it has significantly increased the efficiency and reliability of our systems."
"Gremlin Reliability Management Platform has impacted my organization positively as it helped a lot and reduced our failures, allowing us to find critical pinpoints in our application that had existed for three to ten months and led to too many improvements, reduced downtime, and a smoother experience for our application on AWS."
"More than anything, we fix failures even before they occur, which is basically proactive risk detection and risk mitigation."
"Using Gremlin Reliability Management Platform has raised more than fifty percent of the reliability of the infrastructure."
"Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments, and we were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime."
"We are seeing a return on investment from using Gremlin Reliability Management Platform because we are getting less production issues by thirty percent, as I mentioned earlier, making it a great investment."
"Gremlin Reliability Management Platform is amazing with the reliability score, providing built-in Chaos Engineering experiments that you run on your service to receive a reliability score along with insights on the issues and risks present in your service that you can examine and work on."
"The solution Lightstep/ServiceNow has a couple of pretty advanced functionalities to help us investigate a deviation and help the development teams have better observability in the environment using distributed and complex services."
"To a certain extent, it is possible to save on the costs of the product."
"The main benefit of ServiceNow Cloud Observability is real-time data, which helps us move from being reactive to more proactive."
"The ability to create a stream based on different parameters, operation name, service name, URL, tags, and URI part, is one valuable feature."
"The UI is very intuitive."
 

Cons

"Gremlin Reliability Management Platform can be improved as the pricing is a bit expensive and the learning curve for beginners is a bit difficult."
"I think that it will be important to have resources to perform self-directed studies on Gremlin Reliability Management Platform as an improvement."
"I think Gremlin Reliability Management Platform can be improved by integrating with more AWS services or GCP services."
"I rate it an eight because we are still using it on a trial and error basis, and the pricing could be optimized for better cost visibility and ROI tracking."
"If you really look at the cost-benefit visibility, it is not very evident by using Gremlin Reliability Management Platform."
"The dashboard and graphics must be improved."
"The support team could be better. Because of the different versions of different tactics of integrating reactive code base, the documentation is not very clear if someone has to be onboard. I would rate the documentation of Lightstep a five out of ten. It could need improvement."
"The design of this solution is not very intuitive and probably could come with more friendly tips for beginners."
"In terms of licensing, users would want the product to offer them the ability to tailor the tasks offered in the solution to suit their needs."
"The cost could be reduced or lightweight agents and lightweight modules could be introduced, which would make it much easier to implement."
 

Pricing and Cost Advice

Information not available
"The product is expensive. I rate the tool's pricing model an eight out of ten."
report
Use our free recommendation engine to learn which Application Performance Monitoring (APM) and Observability solutions are best for your needs.
902,988 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Construction Company
13%
Printing Company
10%
Financial Services Firm
9%
Sports Company
9%
Financial Services Firm
12%
Manufacturing Company
11%
Construction Company
10%
Retailer
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business3
Large Enterprise7
No data available
 

Questions from the Community

What needs improvement with Gremlin Reliability Management Platform?
While I have no complaints about Gremlin Reliability Management Platform, I believe the UI can be improved to enhance the developer experience for security engineers and DevOps engineers. Additiona...
What is your primary use case for Gremlin Reliability Management Platform?
My main use case for Gremlin Reliability Management Platform is to see how our applications behave under extreme stress and how resilient our application is when a simulation of server crash alongs...
What advice do you have for others considering Gremlin Reliability Management Platform?
For others considering Gremlin Reliability Management Platform, it is an excellent tool for organizations facing downtime issues, as it allows for chaos testing without needing to check logs and me...
What needs improvement with LightStep?
The cost could be reduced or lightweight agents and lightweight modules could be introduced, which would make it much easier to implement.
What is your primary use case for LightStep?
ServiceNow Cloud Observability's main use case is end-to-end automation starting from the trigger in the cloud. Currently, it focuses more on self-healing and understanding the complete uptime of e...
What advice do you have for others considering LightStep?
Regarding security, I unfortunately do not have much hands-on experience. My focus has been majorly on self-healing activities, but given what I have observed, it does provide a good opportunity in...
 

Also Known As

No data available
LightStep
 

Overview

 

Sample Customers

Information Not Available
InVision, Twilio, Lyft, Yext, DigitalOcean,
Find out what your peers are saying about Gremlin Reliability Management Platform vs. ServiceNow Cloud Observability and other solutions. Updated: June 2026.
902,988 professionals have used our research since 2012.