No more typing reviews! Try our Samantha, our new voice AI agent.

Google Cloud's operations suite (formerly Stackdriver) vs Gremlin Reliability Management Platform comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Google Cloud's operations s...
Ranking in Application Performance Monitoring (APM) and Observability
25th
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
10
Ranking in other categories
Log Management (24th), Cloud Monitoring Software (18th)
Gremlin Reliability Managem...
Ranking in Application Performance Monitoring (APM) and Observability
54th
Average Rating
8.4
Reviews Sentiment
7.5
Number of Reviews
5
Ranking in other categories
IT Infrastructure Monitoring (53rd), DevSecOps (15th)
 

Mindshare comparison

As of March 2026, in the Application Performance Monitoring (APM) and Observability category, the mindshare of Google Cloud's operations suite (formerly Stackdriver) is 0.9%, down from 1.2% compared to the previous year. The mindshare of Gremlin Reliability Management Platform is 0.1%. It is calculated based on PeerSpot user engagement data.
Application Performance Monitoring (APM) and Observability Mindshare Distribution
ProductMindshare (%)
Google Cloud's operations suite (formerly Stackdriver)0.9%
Gremlin Reliability Management Platform0.1%
Other99.0%
Application Performance Monitoring (APM) and Observability
 

Featured Reviews

Anand_Patel - PeerSpot reviewer
Senior Technical Architect at T-Systems International GmbH
Offers reliable Ops Agent and logging transport feature with easy third-party integrations
As part of our company, we implemented several changes in our log analytics pattern, including the storage and procurement process. Earlier, before implementing the solution, our company was able to procure only one year of data, but later, we came to the three-year mark. Around 15-20% reduction has been witnessed in the total analytic consumption of our company. The aforementioned result was possible because the solution allowed the creation of a dashboard where factors like storage costs, proportion of logs, and logs presence in a storage bucket or Big Query can all be checked. Earlier all logs were stored in a raw storage, but currently our company is able to move logs in table bucket that contributes towards cost savings. It has default integration for all gcp services. recently managed Prometheus support gives more flexibility to organizations to remain connected with their current Prometheus setup. We leveraged integrated FinOps Hub for recommendations for our workloads and server configurations, helpd us lot in order to get maximum TCO.
VL
Senior Software Engineer at a sports company with 10,001+ employees
Chaos experiments have revealed weak points and now provide controlled cost-saving tests
The best features of Gremlin Reliability Management Platform are the safe failure injection, which is crucial as we can simulate the failures in a manner that we know these are just dumping tests and not the actual issues. Whether it is the CPU spike or the memory exhaustion, or the network latency, or the server shutdown, server shutdown is one of the most favorite features that I have in Gremlin Reliability Management Platform. The controlled blast radius is another standout feature. The controlled blast radius feature has helped my team in that we actually wanted to target only one specific container, our Docker containers that we deployed. It helped us to conduct tests in a very specific, isolated manner instead of launching a larger test or focusing on hundreds of servers at a time, resulting in very limited impact. Since ours is a very small team, we do not want to impact other servers. This controlled blast radius helped us to only focus on our servers and not impact any other team. Gremlin Reliability Management Platform has positively impacted my organization because before Gremlin Reliability Management Platform, we did not even know how to conduct these chaos engineering tests. We heard about it, but we had no idea of how to do something of that nature. If there are ten servers, ten systems in our architecture and if suddenly something goes down, nobody knew what would happen next. We did not even know how to simulate these types of tests. This lack of confidence has been mitigated by using Gremlin Reliability Management Platform. Now we can confidently test and see which system is the most critical. If this goes down, what happens? How much business valuation are we going to impact? How much loss are we going to incur? All of this is now clearly visible and transparent. Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments. We were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature is the multi-cloud integration, where there is support for both GCP and AWS."
"Our company has a corporate account for Google Cloud and so our systems and clusters integrate really well."
"We find the solution to be stable."
"Our company has a corporate account for Google Cloud and so our systems and clusters integrate really well."
"I like the monitoring feature."
"The features that I have found most valuable are its graphs - if I need any statistics, in Kubernetes or at a VPN level, I can quickly get the reports."
"The cloud login enables us to get our logs from the different platforms that we currently use."
"The features that I have found most valuable are its graphs - if I need any statistics, in Kubernetes or Kong level or VPN level, I can quickly get the reports."
"Gremlin Reliability Management Platform is amazing with the reliability score, providing built-in Chaos Engineering experiments that you run on your service to receive a reliability score along with insights on the issues and risks present in your service that you can examine and work on."
"Using Gremlin Reliability Management Platform has raised more than fifty percent of the reliability of the infrastructure."
"Since using Gremlin Reliability Management Platform, we were able to reduce the incidents by six percent after conducting our limited experiments, and we were also able to increase the uptime from ninety-eight to ninety-nine, which represents a one percent increase in uptime."
"The Enterprise Reliability Platform has positively impacted my organization as it has significantly increased the efficiency and reliability of our systems."
"We are seeing a return on investment from using Gremlin Reliability Management Platform because we are getting less production issues by thirty percent, as I mentioned earlier, making it a great investment."
 

Cons

"Lacking sufficient operations documentation."
"Google Stackdriver is a stable product, but there is some lagging we cannot stop."
"It is difficult to estimate in advance how much something is going to cost."
"It could be more stable."
"This solution could be improved if it offered the ability to analyze charts, such as a solution like Kibana."
"The product provides minimal metrics that are insufficient."
"If I want to track any round-trip or breakdowns of my response times, I'm not able to get it."
"If I want to track any round-trip or breakdowns of my response times, I'm not able to get it. My request goes through various levels of the Google Cloud Platform (GCP) and comes back to my client machine. Suppose that my request has taken 10 seconds overall, so if I want to break it down, to see where the delay is happening within my architecture, I am not able to find that out using Stackdriver."
"I think that it will be important to have resources to perform self-directed studies on Gremlin Reliability Management Platform as an improvement."
"I think Gremlin Reliability Management Platform can be improved by integrating with more AWS services or GCP services."
"Gremlin Reliability Management Platform can be improved as the pricing is a bit expensive and the learning curve for beginners is a bit difficult."
 

Pricing and Cost Advice

"We have a basic standard license without any additional costs."
"The cost of using Stackdriver depends on usage."
"The cost could be lower."
Information not available
report
Use our free recommendation engine to learn which Application Performance Monitoring (APM) and Observability solutions are best for your needs.
885,311 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
13%
Computer Software Company
10%
Manufacturing Company
7%
Comms Service Provider
7%
Sports Company
12%
Printing Company
12%
Construction Company
9%
Media Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business2
Midsize Enterprise1
Large Enterprise8
No data available
 

Questions from the Community

What is your experience regarding pricing and costs for Google Stackdriver?
As Ops Suite, is a google product which effectively comes at zero setup cost, in order to manage your on-premises logs on onsite, it involves negligible cost for using ops agent and it also include...
What needs improvement with Google Stackdriver?
If the errors are caught early in the interface, it would be easier for users to manage. The process of logging analytics can be improved.
What is your primary use case for Google Stackdriver?
I use the solution for logging, defining alerts, and monitoring. Our company's Java and Python logging teams mainly use it.
What needs improvement with Gremlin Reliability Management Platform?
I think that it will be important to have resources to perform self-directed studies on Gremlin Reliability Management Platform as an improvement. There is a small and fast and simple certification...
What is your primary use case for Gremlin Reliability Management Platform?
My main use case for Gremlin Reliability Management Platform is chaos testing. I take my infrastructure and then I sabotage some things to see how they reach the goal. I try network or infrastructu...
What advice do you have for others considering Gremlin Reliability Management Platform?
The main advice I would give to others looking into using Gremlin Reliability Management Platform would be to study it. Do not be shy to fail. Test everything and do lab architectures to test. It i...
 

Also Known As

Google Stackdriver, Stackdriver Monitoring, Stackdriver Logging, Google Cloud Monitoring
No data available
 

Overview

 

Sample Customers

Uber, Batterii, Q42, Dovetail Games
Information Not Available
Find out what your peers are saying about Google Cloud's operations suite (formerly Stackdriver) vs. Gremlin Reliability Management Platform and other solutions. Updated: March 2026.
885,311 professionals have used our research since 2012.