No more typing reviews! Try our Samantha, our new voice AI agent.

Apache SkyWalking vs Gremlin Reliability Management Platform comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Apache SkyWalking
Ranking in Application Performance Monitoring (APM) and Observability
32nd
Average Rating
8.2
Reviews Sentiment
3.9
Number of Reviews
4
Ranking in other categories
No ranking in other categories
Gremlin Reliability Managem...
Ranking in Application Performance Monitoring (APM) and Observability
25th
Average Rating
8.6
Reviews Sentiment
7.0
Number of Reviews
8
Ranking in other categories
IT Infrastructure Monitoring (23rd), DevSecOps (8th)
 

Mindshare comparison

As of July 2026, in the Application Performance Monitoring (APM) and Observability category, the mindshare of Apache SkyWalking is 0.6%, down from 0.7% compared to the previous year. The mindshare of Gremlin Reliability Management Platform is 0.2%. It is calculated based on PeerSpot user engagement data.
Application Performance Monitoring (APM) and Observability Mindshare Distribution
ProductMindshare (%)
Gremlin Reliability Management Platform0.2%
Apache SkyWalking0.6%
Other99.2%
Application Performance Monitoring (APM) and Observability
 

Featured Reviews

reviewer2784462 - PeerSpot reviewer
Software Engineer at a tech vendor with 10,001+ employees
Tracing has revealed hybrid bottlenecks and delivers full visibility into critical payment flows
Apache SkyWalking provided full visibility into the black hole because before using it, we could not see what was happening when a request left Amazon EKS and went to our on-premises legacy databases. Apache SkyWalking's distributed tracing correlates these two worlds in a single view, showing us that 40% of the latency was actually happening in the network hop between the cloud and the physical data center, not in the code itself. Second, it exposes hidden architectural flaws. By using the automatic dependency mapping, we discovered that some microservices were stuck in a cyclic dependency which was documented nowhere. This visual evidence allowed us to refactor the logic and immediately increased our throughput by 30%. Apache SkyWalking gave us database-level insight without database access. Through its slow query monitoring, the Java agents captured the exact SQL statements that were hanging during peak sales hours. This meant our developers could fix the exact line of code or index without needing to wait for a DBA to pull logs, reducing our mean time to resolution. There are many features that are useful to mention in this case because we obtained different benefits. Apache SkyWalking automatically drew the topology of the 600 pods where we discovered cyclic dependencies between services that no one had documented before and that were slowing down the system. Another valuable feature is resolving hybrid bottlenecks because we isolated a specific network issue between AWS and the physical data center. Without distributed tracing, infrastructure teams blame Java code and vice versa. Database tuning is also important because thanks to slow query metrics captured by the agent, we identified and rewrote the SQL queries that most impacted performance during sales peaks.
VL
Senior Software Engineer at a sports company with 10,001+ employees
Chaos experiments have revealed weak points and now provide controlled cost-saving tests
The best features of Gremlin Reliability Management Platform are the safe failure injection, which is crucial as we can simulate the failures in a manner that we know these are just dumping tests and not the actual issues. Whether it is the CPU spike or the memory exhaustion, or the network latency, or the server shutdown, server shutdown is one of the most favorite features that I have in Gremlin Reliability Management Platform. The controlled blast radius is another standout feature. The controlled blast radius feature has helped my team in that we actually wanted to target only one specific container, our Docker containers that we deployed. It helped us to conduct tests in a very specific, isolated manner instead of launching a larger test or focusing on hundreds of servers at a time, resulting in very limited impact. Since ours is a very small team, we do not want to impact other servers. This controlled blast radius helped us to only focus on our servers and not impact any other team. Gremlin Reliability Management Platform has positively impacted my organization because before Gremlin Reliability Management Platform, we did not even know how to conduct these chaos engineering tests. We heard about it, but we had no idea of how to do something of that nature. If there are ten servers, ten systems in our architecture and if suddenly something goes down, nobody knew what would happen next. We did not even know how to simulate these types of tests. This lack of confidence has been mitigated by using Gremlin Reliability Management Platform. Now we can confidently test and see which system is the most critical. If this goes down, what happens? How much business valuation are we going to impact? How much loss are we going to incur? All of this is now clearly visible and transparent. Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments. We were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Using Apache SkyWalking has had a positive impact on my organization because it has enabled us to identify the causes of various problems more quickly."
"Apache SkyWalking has significantly improved application visibility and reduced troubleshooting times while enhancing security reliability."
"Apache SkyWalking has positively impacted my organization by reducing the time of the team so that they can put in more efforts into their other tasks, saving a lot of time, improving our SLA in resolving any issue, providing good RCA analysis to the leadership team, and helping us in monitoring the entire health in a shorter time span."
"Apache SkyWalking is a very nice tool and an exceptional tool for managing volume and complex architecture on AWS without the prohibitive cost of commercial suites."
"The Enterprise Reliability Platform has positively impacted my organization as it has significantly increased the efficiency and reliability of our systems."
"Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments, and we were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime."
"Gremlin Reliability Management Platform has impacted my organization positively as it helped a lot and reduced our failures, allowing us to find critical pinpoints in our application that had existed for three to ten months and led to too many improvements, reduced downtime, and a smoother experience for our application on AWS."
"Gremlin Reliability Management Platform is amazing with the reliability score, providing built-in Chaos Engineering experiments that you run on your service to receive a reliability score along with insights on the issues and risks present in your service that you can examine and work on."
"More than anything, we fix failures even before they occur, which is basically proactive risk detection and risk mitigation."
"Using Gremlin Reliability Management Platform has raised more than fifty percent of the reliability of the infrastructure."
"We are seeing a return on investment from using Gremlin Reliability Management Platform because we are getting less production issues by thirty percent, as I mentioned earlier, making it a great investment."
"Gremlin Reliability Management Platform has positively impacted our organization by making outages less frequent and improving recovery time significantly, resulting in fewer complaints on the customer success side and overall optimization of our DevOps process."
 

Cons

"Areas for improvement include simplified initial deployment and configurations, better documentation for advanced use cases, and more built-in dashboards and reports."
"Apache SkyWalking can be improved by responding more quickly to new versions of monitored products."
"Apache SkyWalking can be improved with storage management complexity because with this volume of 50 million traces a day, managing data retention on OpenSearch is critical."
"Apache SkyWalking can be improved by enhancing a few things. The learning curve is definitely there, so it needs a good learning curve."
"I think that it will be important to have resources to perform self-directed studies on Gremlin Reliability Management Platform as an improvement."
"I think Gremlin Reliability Management Platform can be improved by integrating with more AWS services or GCP services."
"If you really look at the cost-benefit visibility, it is not very evident by using Gremlin Reliability Management Platform."
"Gremlin Reliability Management Platform can be improved as the pricing is a bit expensive and the learning curve for beginners is a bit difficult."
"I rate it an eight because we are still using it on a trial and error basis, and the pricing could be optimized for better cost visibility and ROI tracking."
report
Use our free recommendation engine to learn which Application Performance Monitoring (APM) and Observability solutions are best for your needs.
902,988 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
18%
Computer Software Company
15%
Manufacturing Company
15%
Retailer
8%
Construction Company
13%
Printing Company
10%
Financial Services Firm
9%
Sports Company
9%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business2
Midsize Enterprise1
Large Enterprise4
By reviewers
Company SizeCount
Small Business3
Large Enterprise7
 

Questions from the Community

What is your experience regarding pricing and costs for Apache SkyWalking?
Our experience with pricing, setup cost, and licensing for Apache SkyWalking is positive since it is free, which was the reason for our decision to use it.
What needs improvement with Apache SkyWalking?
Apache SkyWalking can be improved by enhancing a few things. The learning curve is definitely there, so it needs a good learning curve. Your engineers or experts need to be pretty much handy and so...
What is your primary use case for Apache SkyWalking?
My main use case for Apache SkyWalking includes not only monitoring microservices and APIs but also managing the entire health of the application. I will explain the domains and backgrounds where w...
What needs improvement with Gremlin Reliability Management Platform?
While I have no complaints about Gremlin Reliability Management Platform, I believe the UI can be improved to enhance the developer experience for security engineers and DevOps engineers. Additiona...
What is your primary use case for Gremlin Reliability Management Platform?
My main use case for Gremlin Reliability Management Platform is to see how our applications behave under extreme stress and how resilient our application is when a simulation of server crash alongs...
What advice do you have for others considering Gremlin Reliability Management Platform?
For others considering Gremlin Reliability Management Platform, it is an excellent tool for organizations facing downtime issues, as it allows for chaos testing without needing to check logs and me...
 

Comparisons

No data available
 

Overview

 

Sample Customers

1. Alibaba 2. Amazon 3. Apple 4. Baidu 5. ByteDance 6. Cisco 7. Dell 8. Google 9. Huawei 10. IBM 11. Intel 12. JPMorgan Chase 13. Klarna 14. LinkedIn 15. Microsoft 16. Netflix 17. Oracle 18. PayPal 19. Pinterest 20. Qualcomm 21. SAP 22. Samsung 23. Spotify 24. Tencent 25. Twitter 26. Uber 27. VMware 28. WeChat 29. Xiaomi 30. Zoom
Information Not Available
Find out what your peers are saying about Apache SkyWalking vs. Gremlin Reliability Management Platform and other solutions. Updated: June 2026.
902,988 professionals have used our research since 2012.