Gremlin Reliability Management Platform empowers organizations to proactively identify and mitigate potential failures. It enhances system resilience through controlled chaos engineering, aiding tech teams in delivering reliable services.
| Product | Mindshare (%) |
|---|---|
| Gremlin Reliability Management Platform | 0.1% |
| Dynatrace | 5.5% |
| Datadog | 4.7% |
| Other | 89.7% |
| Type | Title | Date | |
|---|---|---|---|
| Category | Application Performance Monitoring (APM) and Observability | May 31, 2026 | Download |
| Product | Reviews, tips, and advice from real users | May 31, 2026 | Download |
| Comparison | Gremlin Reliability Management Platform vs Datadog | May 31, 2026 | Download |
| Comparison | Gremlin Reliability Management Platform vs Dynatrace | May 31, 2026 | Download |
| Comparison | Gremlin Reliability Management Platform vs Splunk AppDynamics | May 31, 2026 | Download |
| Company Size | Count |
|---|---|
| Small Business | 3 |
| Large Enterprise | 5 |
| Company Size | Count |
|---|---|
| Small Business | 42 |
| Midsize Enterprise | 14 |
| Large Enterprise | 29 |
Designed for tech-savvy users, Gremlin enables teams to implement chaos engineering effectively to ensure system reliability. It offers precise control over variables, allowing teams to simulate real-world scenarios and fortify system operations. Gremlin plays a strategic role in preventing downtime and maintaining optimal service delivery through a suite of advanced tools tailored for IT infrastructure.
What are the most important features of Gremlin?In industries such as e-commerce, finance, and healthcare, Gremlin helps maintain service reliability by identifying vulnerabilities before they affect operations. IT teams can simulate stress tests specific to their industry, ensuring systems are resilient against potential threats, enhancing customer satisfaction, and securing business continuity.
| Author info | Rating | Review Summary |
|---|---|---|
| Senior Software Engineer at a sports company with 10,001+ employees | 4.5 | I use Gremlin for chaos engineering, significantly boosting confidence, increasing uptime, and providing strong ROI through failure injection. Despite its expense and learning curve, the stable platform is impactful, making it a valuable 9/10 tool. |
| VP Global at a tech vendor with 10,001+ employees | 4.5 | I use Gremlin to proactively test failures and improve system reliability, especially with dependency mapping and safe fault injection. While it boosts confidence and offers great features, better cost-benefit visibility and deeper dependency intelligence are needed. |
| DEVOPS specialist at a media company with 10,001+ employees | 4.0 | I use Gremlin for chaos engineering on Kubernetes, valuing its prebuilt tests and automated scheduling. It boosted our production confidence, cutting issues by 30%. I seek more cloud integrations and AI for broader usability, giving it an 8/10. |
| Dev Ops To Development (IT) at a non-tech company with self employed | 4.5 | I use Gremlin for chaos testing, boosting infrastructure reliability over 50%. I value its flexibility and dashboard but desire more free learning resources and better integration with observability platforms like Splunk. |
| DevOps & Mlops Engineer at a printing company with 1-10 employees | 5.0 | I use Gremlin for Chaos Engineering, leveraging its built-in experiments to get reliability scores and insights for my web services. It's stable, scalable, and helps save time, though I wish it had open-source features. The support is great. |
| Documentation Engineer at a tech vendor with 1,001-5,000 employees | 4.0 | I find Gremlin excellent for simulating extreme stress to test application resilience, identify weaknesses, and reduce outages. While it significantly improved our recovery time and reduced downtime, I believe the UI and pricing could be optimized. |
| Site Reliability Engineer at a tech services company with 10,001+ employees | 3.0 | I use the Enterprise Reliability Platform to maintain reliability, finding it significantly increased efficiency and reliability. My organization measured improvements in latency and SLOs, and I have no recommendations for improvement after one year of use. |
| Performance Test Engineer at a educational organization with 51-200 employees | 5.0 | I use Gremlin for chaos engineering on AWS Kubernetes, finding critical failures and reducing downtime. Its flexibility, ease of use, and templates delivered significant ROI, proving more effective than Amazon FIS. I highly recommend it. |