No more typing reviews! Try our Samantha, our new voice AI agent.

Gremlin Reliability Management Platform vs Insights Hub comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Gremlin Reliability Managem...
Average Rating
8.8
Reviews Sentiment
7.0
Number of Reviews
7
Ranking in other categories
Application Performance Monitoring (APM) and Observability (28th), IT Infrastructure Monitoring (30th), DevSecOps (8th)
Insights Hub
Average Rating
8.0
Number of Reviews
1
Ranking in other categories
AWS Marketplace (1st)
 

Mindshare comparison

Gremlin Reliability Management Platform and Insights Hub aren’t in the same category and serve different purposes. Gremlin Reliability Management Platform is designed for Application Performance Monitoring (APM) and Observability and holds a mindshare of 0.1%.
Insights Hub, on the other hand, focuses on AWS Marketplace, holds 0.2% mindshare, down 6.6% since last year.
Application Performance Monitoring (APM) and Observability Mindshare Distribution
ProductMindshare (%)
Gremlin Reliability Management Platform0.1%
Dynatrace5.6%
Datadog4.9%
Other89.4%
Application Performance Monitoring (APM) and Observability
AWS Marketplace Mindshare Distribution
ProductMindshare (%)
Insights Hub0.2%
WaitTime Gate Queue0.5%
HZWTech Device Studio0.5%
Other98.8%
AWS Marketplace
 

Featured Reviews

VL
Senior Software Engineer at a sports company with 10,001+ employees
Chaos experiments have revealed weak points and now provide controlled cost-saving tests
The best features of Gremlin Reliability Management Platform are the safe failure injection, which is crucial as we can simulate the failures in a manner that we know these are just dumping tests and not the actual issues. Whether it is the CPU spike or the memory exhaustion, or the network latency, or the server shutdown, server shutdown is one of the most favorite features that I have in Gremlin Reliability Management Platform. The controlled blast radius is another standout feature. The controlled blast radius feature has helped my team in that we actually wanted to target only one specific container, our Docker containers that we deployed. It helped us to conduct tests in a very specific, isolated manner instead of launching a larger test or focusing on hundreds of servers at a time, resulting in very limited impact. Since ours is a very small team, we do not want to impact other servers. This controlled blast radius helped us to only focus on our servers and not impact any other team. Gremlin Reliability Management Platform has positively impacted my organization because before Gremlin Reliability Management Platform, we did not even know how to conduct these chaos engineering tests. We heard about it, but we had no idea of how to do something of that nature. If there are ten servers, ten systems in our architecture and if suddenly something goes down, nobody knew what would happen next. We did not even know how to simulate these types of tests. This lack of confidence has been mitigated by using Gremlin Reliability Management Platform. Now we can confidently test and see which system is the most critical. If this goes down, what happens? How much business valuation are we going to impact? How much loss are we going to incur? All of this is now clearly visible and transparent. Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments. We were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime.
reviewer2787357 - PeerSpot reviewer
Site Reliability Engineer 2 at a tech vendor with 1,001-5,000 employees
Centralized insights have transformed troubleshooting and now cut incident resolution time dramatically
Regarding improvements to Insights Hub, I have identified several areas. The first improvement would be smarter AI with clear root cause suggestions. Currently, AI detects anomalies but often only says "unusual increase in failures detected" without clearly stating what changed, which deployment caused it, or what likely component is responsible. Improvements could include automatic correlation with recent deployments, suggested probable root causes such as code changes, infrastructure scale events, or database latency, and a confidence score. This would reduce investigation time even more. The second improvement would be better noise reduction in alerts. Sometimes anomaly detection generates too many notifications and minor fluctuations are treated as incidents. Improvements could include smarter alert grouping, better baseline tuning, and business impact aware alerting, where alerts are not triggered if latency increased but there is no user impact. The third improvement is stronger cross-cloud and hybrid visibility. Many organizations use Azure, AWS, and on-premises infrastructure. Insights Hub could improve multi-cloud correlations and unified dashboards across environments. Currently, cross-platform visibility often requires custom integrations.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"More than anything, we fix failures even before they occur, which is basically proactive risk detection and risk mitigation."
"We are seeing a return on investment from using Gremlin Reliability Management Platform because we are getting less production issues by thirty percent, as I mentioned earlier, making it a great investment."
"The Enterprise Reliability Platform has positively impacted my organization as it has significantly increased the efficiency and reliability of our systems."
"Using Gremlin Reliability Management Platform has raised more than fifty percent of the reliability of the infrastructure."
"Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments, and we were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime."
"Gremlin Reliability Management Platform has impacted my organization positively as it helped a lot and reduced our failures, allowing us to find critical pinpoints in our application that had existed for three to ten months and led to too many improvements, reduced downtime, and a smoother experience for our application on AWS."
"Gremlin Reliability Management Platform is amazing with the reliability score, providing built-in Chaos Engineering experiments that you run on your service to receive a reliability score along with insights on the issues and risks present in your service that you can examine and work on."
"Insights Hub has provided significant positive impact to my organization, including 30 to 50 percent faster incident resolution, fewer SEV one outages, reduced alert fatigue by 20 to 30 percent, better SLA compliance, and increased customer satisfaction."
 

Cons

"Gremlin Reliability Management Platform can be improved as the pricing is a bit expensive and the learning curve for beginners is a bit difficult."
"I think Gremlin Reliability Management Platform can be improved by integrating with more AWS services or GCP services."
"I think that it will be important to have resources to perform self-directed studies on Gremlin Reliability Management Platform as an improvement."
"If you really look at the cost-benefit visibility, it is not very evident by using Gremlin Reliability Management Platform."
"The learning curve is steep as it requires skill to fully utilize and is not very beginner-friendly, and querying can be complex."
report
Use our free recommendation engine to learn which Application Performance Monitoring (APM) and Observability solutions are best for your needs.
892,287 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Printing Company
13%
Sports Company
11%
Construction Company
10%
Healthcare Company
6%
Insurance Company
74%
Construction Company
12%
Comms Service Provider
4%
Retailer
2%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business3
Large Enterprise6
No data available
 

Questions from the Community

What needs improvement with Gremlin Reliability Management Platform?
Gremlin Reliability Management Platform can be improved as the pricing is a bit expensive and the learning curve for beginners is a bit difficult. It is not easy to get along with, and we need pret...
What is your primary use case for Gremlin Reliability Management Platform?
My main use case for Gremlin Reliability Management Platform is that we wanted to do chaos engineering, and in order for us to orchestrate the tests better, Gremlin helped us a lot. A quick specifi...
What advice do you have for others considering Gremlin Reliability Management Platform?
There were a lot of good examples and great documentation for Gremlin Reliability Management Platform, which is something that I appreciate. It helped us a lot. My advice for others looking into us...
What needs improvement with Insights Hub?
Regarding improvements to Insights Hub, I have identified several areas. The first improvement would be smarter AI with clear root cause suggestions. Currently, AI detects anomalies but often only ...
What is your primary use case for Insights Hub?
Insights Hub serves as a centralized monitoring and data-driven decision-making platform. It acts as a single place where data is collected, analyzed, and turned into actionable insights. The prima...
What advice do you have for others considering Insights Hub?
I am giving Insights Hub a rating of eight out of ten. The reason I have given eight is because of all the features it has and the excellent observability capabilities it provides, such as end-to-e...
 

Comparisons

No data available
No data available
 

Overview

Find out what your peers are saying about Datadog, Dynatrace, Splunk and others in Application Performance Monitoring (APM) and Observability. Updated: April 2026.
892,287 professionals have used our research since 2012.