No more typing reviews! Try our Samantha, our new voice AI agent.

Gremlin Reliability Management Platform vs Stackify comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Gremlin Reliability Managem...
Ranking in Application Performance Monitoring (APM) and Observability
54th
Ranking in IT Infrastructure Monitoring
53rd
Average Rating
8.4
Reviews Sentiment
7.5
Number of Reviews
5
Ranking in other categories
DevSecOps (15th)
Stackify
Ranking in Application Performance Monitoring (APM) and Observability
63rd
Ranking in IT Infrastructure Monitoring
63rd
Average Rating
7.8
Number of Reviews
6
Ranking in other categories
Log Management (58th)
 

Mindshare comparison

As of March 2026, in the Application Performance Monitoring (APM) and Observability category, the mindshare of Gremlin Reliability Management Platform is 0.1%. The mindshare of Stackify is 0.6%, up from 0.2% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Application Performance Monitoring (APM) and Observability Mindshare Distribution
ProductMindshare (%)
Gremlin Reliability Management Platform0.1%
Stackify0.6%
Other99.3%
Application Performance Monitoring (APM) and Observability
 

Featured Reviews

VL
Senior Software Engineer at a sports company with 10,001+ employees
Chaos experiments have revealed weak points and now provide controlled cost-saving tests
The best features of Gremlin Reliability Management Platform are the safe failure injection, which is crucial as we can simulate the failures in a manner that we know these are just dumping tests and not the actual issues. Whether it is the CPU spike or the memory exhaustion, or the network latency, or the server shutdown, server shutdown is one of the most favorite features that I have in Gremlin Reliability Management Platform. The controlled blast radius is another standout feature. The controlled blast radius feature has helped my team in that we actually wanted to target only one specific container, our Docker containers that we deployed. It helped us to conduct tests in a very specific, isolated manner instead of launching a larger test or focusing on hundreds of servers at a time, resulting in very limited impact. Since ours is a very small team, we do not want to impact other servers. This controlled blast radius helped us to only focus on our servers and not impact any other team. Gremlin Reliability Management Platform has positively impacted my organization because before Gremlin Reliability Management Platform, we did not even know how to conduct these chaos engineering tests. We heard about it, but we had no idea of how to do something of that nature. If there are ten servers, ten systems in our architecture and if suddenly something goes down, nobody knew what would happen next. We did not even know how to simulate these types of tests. This lack of confidence has been mitigated by using Gremlin Reliability Management Platform. Now we can confidently test and see which system is the most critical. If this goes down, what happens? How much business valuation are we going to impact? How much loss are we going to incur? All of this is now clearly visible and transparent. Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments. We were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime.
IE
Senior Software Engineer at a tech services company with 1,001-5,000 employees
Has good filtering and rating features and helps with resource and load management
I've not used Stackify for a while, and I'm currently using a solution now that's not as good as Stackify. Among the solutions I've been using so far, Stackify has been one of the best for me, but there's always room for improvement. For example, I don't know if it's just me, but when I try to get the log from Stackify, sometimes it doesn't appear in real-time. It takes a few minutes before the logs appear. When I redeploy my solution and the application starts, I don't see the logs immediately, and it would take two to three minutes before I see the logs. I don't know if other customers have a similar experience. It's the wait time for the logs to appear that's a concern for me, could be improved, and is what the Stackify team should be looking into. In terms of any additional feature that I'd like added to the solution, I'm not sure if Stackify has a way to export logs out. I've been trying to do it. On the solution, you can click on a spiral-like icon and it shows you the entire error, and I'd prefer an export button that would let me download the error and save that into a text file, for example, so it'll be available on my local machine for me to reference it, especially because the log keeps going and as you're using the solution, the system keeps pushing messages on to Stackify, so if I'm looking at a particular error at 12:05 PM, for example, by the time I go back to my system and would like to revisit the error at 12:25 PM, on Stackify, the logs would have gone past that level and I won't see it again which makes it difficult. When you now go back to that timestamp, you don't tend to see it immediately, but if the solution had an export feature for me to save that particular error information on my local machine for reference at a later time, I won't have to go back to Stackify. I just go to that log, specifically to that particular export that I've received on my local machine. I can get it and review it, and it would be easier that way versus me going back to Stackify to find that particular error and request that particular information.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Gremlin Reliability Management Platform is amazing with the reliability score, providing built-in Chaos Engineering experiments that you run on your service to receive a reliability score along with insights on the issues and risks present in your service that you can examine and work on."
"We are seeing a return on investment from using Gremlin Reliability Management Platform because we are getting less production issues by thirty percent, as I mentioned earlier, making it a great investment."
"Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments, and we were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime."
"The Enterprise Reliability Platform has positively impacted my organization as it has significantly increased the efficiency and reliability of our systems."
"Using Gremlin Reliability Management Platform has raised more than fifty percent of the reliability of the infrastructure."
"The filter feature on Stackify is one of the features I found valuable. It's awesome. When I want to get the application logs, the solution gives me many filters. For example, if I want to get logs from my test environment, the option is there for me to select the environment from Stackify, and you can also select the particular application, and you'll see the information you need there. The filter feature alone and the fact that Stackify offers a lot of different filters is what I like the most about the solution because I've used other tools with the filter feature, but the filtering was very difficult, versus Stackify that has good filtering. On Stackify, you can filter the information by the last one hour, or the last four hours, and you can also select the date range and specify the timestamp, then the solution will give you the information based on the date range you specified. Another feature I found valuable on Stackify is its rating feature because it tells you how your application is faring. For example, a rating of A means excellent, while a rating of F means very bad, or that your application is not doing well at all. The ratings are from A to F. I also like that Stackify helps you in terms of load management because the solution gives you information on overutilized resources. These are the most valuable features of the solution."
"It is very simple and very easy to configure."
"The solution is stable and reliable."
"What stood out to us were the metrics and granular details we received."
"The performance dashboard and the accurate level of details are beneficial."
"We switched from New Relic and Loggly as it provides us more info at a lower price."
"Within few hours of install we've identify the source of issue we've been investigating for few days and couldn't pin point."
"The deployment is very fast."
 

Cons

"Gremlin Reliability Management Platform can be improved as the pricing is a bit expensive and the learning curve for beginners is a bit difficult."
"I think that it will be important to have resources to perform self-directed studies on Gremlin Reliability Management Platform as an improvement."
"I think Gremlin Reliability Management Platform can be improved by integrating with more AWS services or GCP services."
"I'm looking to see more performance tools, but heard that they are going to release some."
"The search feature could be improved."
"One thing that happens as a new user on Stackify is when you install the agent it pulls everything and if you're not careful, your log allowance will just be exhausted as you are actually pulling too much data."
"Better mobile support."
"It's not easy to set up. It's hard especially for juniors to understand."
"I would like to be able to see metrics about individual running containers on the host machines."
"When I redeploy my solution and the application starts, I don't see the logs immediately, and it would take two to three minutes before I see the logs."
"It should be easily scalable and configurable in different instances."
 

Pricing and Cost Advice

Information not available
"The price is variable. It depends on how much data we have received in that particular month. Usually, it goes up to $2,000, or, at times, $3,000 USD per month."
report
Use our free recommendation engine to learn which Application Performance Monitoring (APM) and Observability solutions are best for your needs.
885,376 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Sports Company
12%
Printing Company
12%
Construction Company
9%
Media Company
6%
Construction Company
15%
Comms Service Provider
13%
Media Company
9%
Performing Arts
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
By reviewers
Company SizeCount
Small Business3
Midsize Enterprise2
Large Enterprise2
 

Questions from the Community

What needs improvement with Gremlin Reliability Management Platform?
I think that it will be important to have resources to perform self-directed studies on Gremlin Reliability Management Platform as an improvement. There is a small and fast and simple certification...
What is your primary use case for Gremlin Reliability Management Platform?
My main use case for Gremlin Reliability Management Platform is chaos testing. I take my infrastructure and then I sabotage some things to see how they reach the goal. I try network or infrastructu...
What advice do you have for others considering Gremlin Reliability Management Platform?
The main advice I would give to others looking into using Gremlin Reliability Management Platform would be to study it. Do not be shy to fail. Test everything and do lab architectures to test. It i...
Ask a question
Earn 20 points
 

Comparisons

No data available
 

Overview

 

Sample Customers

Information Not Available
MyRacePass, ClearSale, Newitts, Carbonite, Boston Software, Children's International, Starkwood Media Group, Fewzion
Find out what your peers are saying about Gremlin Reliability Management Platform vs. Stackify and other solutions. Updated: March 2026.
885,376 professionals have used our research since 2012.