No more typing reviews! Try our Samantha, our new voice AI agent.
reviewer1494894 - PeerSpot reviewer
Senior Manager, Site Reliability Engineering at Extra Space Storage
Real User
Top 10
Jan 30, 2021
Provides insightful analytics and good visibility that assist with making architectural decisions
Pros and Cons
  • "Datadog has given us near-live visibility across our entire cloud platform."
  • "We have recently had a number of issues with stability and delays on logging, monitoring, metric evaluation, and alerts."

What is our primary use case?

We primarily use Datadog for logs, APM, infrastructure monitoring, and lambda visibility.

We have built a number of critical dashboards that we display within our office for engineers to have a good understanding of the application performance, as well as business partners to understand at a high level the traffic flowing through the app.

We started with logging, as our primary monitor, and have shifted to APM to get a deeper understanding of what our system is doing, and how the changes we are making impact the apps.

How has it helped my organization?

Datadog has given us near-live visibility across our entire cloud platform. We are finally in a state where we are alerting our users about degraded performance well before the helpdesk tickets start rolling in.

We are making major architectural decisions based on the data we are getting from Datadog. It also gives us an idea of where the complexity really lies in some older, monolithic apps. 

We have used the APM endpoint monitoring to prioritize work on slower endpoints because we can see the total count, as well as the latency. That has been a big driver in our refactor work prioritization.

We have struggled to get more business-centric measures in our code to surface actual business values in our reports, but that is our next initiative.

What is most valuable?

We started with Log analytics in the beginning stages of our monitoring journey. Those were very insightful, but obviously only as useful as we made them with good logging practices.

The dashboards we created are core indicators of the health of our system, and it is one of the most reliable sources we have turned to, especially as we have seen APM metrics impacted several times lately. We can usually rely on logs to tell us what the apps are doing.

APM and Traces have been crucial to understanding how users are actually using the app. That drives a lot of our decisions around refactoring and focusing our limited engineering resources.

What needs improvement?

Continued improvement around cost and pricing model is needed. It is pretty complex and takes a fair amount of intimate knowledge to know exactly how turning on a single function is going to impact your bill, especially when you don't see the metrics for a day or two. 

We have recently had a number of issues with stability and delays on logging, monitoring, metric evaluation, and alerts. More often than not in the past month, it seems that we get the banner across the to of our dashboards that some service is impacted. They don't always show up on the incident page, either.

Buyer's Guide
Datadog
June 2026
Learn what your peers think about Datadog. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.
900,228 professionals have used our research since 2012.

For how long have I used the solution?

We have been using Datadog for two years.

What do I think about the stability of the solution?

Overall, it has been fairly stable for us. There are the occasional issues with importing data, that has usually been resolved in a short time. We have never had an issue where that data was lost, just delayed, and eventually backfilled. 

It seems (anecdotally, of course) that there have been a few more stability issues lately. We have noticed several days that we are getting in-app alert banners indicating that some metric or log ingestion was delayed, or the web app itself was experiencing severe slowness. 

Overall, these issues are resolved rather quickly - kudos to their engineering teams. I hear that they actually use Datadog to monitor Datadog. 

What do I think about the scalability of the solution?

Datadog is very scalable but just watch the cost.

How are customer service and support?

Technical support is hit and miss; there are a number of nuances to how this tool should be implemented, and it is difficult to re-explain how our infrastructure and applications are set up every time we need an in-depth investigation to understand what is broken.

Which solution did I use previously and why did I switch?

Previously, we used AppDynamics. The pricing model didn't seem to fit with actual cloud spend. Now we may have swung the pendulum a little too far, and seem to be dealing with pricing on every facet of the application. 

How was the initial setup?

The initial setup was pretty straightforward. Additional tweaks and configuration have been a bit more difficult as we get deeper and deeper into the guts of the integrations. Making sure we are keeping up with a rapid release schedule, and keeping our server clients in sync with our app packages has been troublesome. There have been some major changes in the APM that have introduced a number of bugs and broken some of our dashboards and alerts.

What about the implementation team?

Our in-house team handled the deployment, with a lot of tickets created for the Datadog team.

What was our ROI?

ROI is difficult to measure completely. Our first year spend compared to our second and now going into the third year spend have been significantly different.

What's my experience with pricing, setup cost, and licensing?

My advice is to really keep an eye on your overage costs, as they can spiral really fast. We turned on some additional span measures and didn't realize until it was too late that it had generated a ton.

Frankly, we love the visibility it gives us into our applications, but it is a bit cumbersome to ensure we are paying for the right stuff. Overall, the cost is worth it, as it helps us keep system-critical applications up and running, and reduces our detection and correction times significantly.

Which other solutions did I evaluate?

We evaluated Dynatrace and AppD before choosing this product.

What other advice do I have?

Datadog requires pretty close supervision on the usage page to ensure you aren't going out of control. They have provided a bunch of new features to assist in retention percentage, but it can be a bit confusing on what is being retained, and what can be viewed again after triggering an alert. It's a difficult balance of making sure you are getting the right data for alerts, and still having the correct information still available for research after the fact.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
reviewer1476039 - PeerSpot reviewer
Network Engineer / AWS Cloud Engineer / Network Management Specialist at CareFirst
Real User
Jan 20, 2021
Good visualizations and dashboards help to minimizes downtime and resolve issues quickly
Pros and Cons
  • "The most valuable feature is the dashboards that are provided out of the box, as well as ones we were able to configure."
  • "Datadog provided us the ability to monitor our cloud infrastructure (network, servers, storage), platform/middleware (database, web/applications servers, business process automation), and business applications across our cloud providers."
  • "More pre-configured "Monitor Alerts" would be helpful."
  • "Pricing seemed easy until the bill came in and some things were not accounted for."

What is our primary use case?

We were in need of a cloud monitoring tool that was operationally focused on the AWS Platform. We wanted to be able to responsibly and effectively monitor, troubleshoot, and operate the AWS platform, including Server, Network, and key AWS Services.

Tooling that highlighted and detected problems, anomalies, and provided best practice recommendations. Tooling that expedites root-cause analysis and performance troubleshooting.

    Datadog provided us the ability to monitor our cloud infrastructure (network, servers, storage), platform/middleware (database, web/applications servers, business process automation), and business applications across our cloud providers.

    How has it helped my organization?

    Datadog provided us the tooling to help us effectively monitor, troubleshoot, and operate the AWS platform, including Server, Network, Database, and key AWS Services. It highlights detected problems and anomalies and provides best practice recommendations, expedites root-cause analysis, and performance troubleshooting.

    Datadog provides analytics and insights that are actionable through out-of-the-box visualizations, dashboards, aggregation, and intuitive searching that shortens the time to value and account for our limited time & resources we have to operate in production.

    What is most valuable?

    The most valuable feature is the dashboards that are provided out of the box, as well as ones we were able to configure. Specific Dashboards that were provided that made things easier were EC2, RDSKubernetes dashboards.

    We also use the logging tool, which makes searching for specific error logs easier to do.

    Datadog Logging provides the capability for us to use AWS logs such as VPC Flow Logs, ELB, EC2, RDS, and other logs that provide lots of relevant operational data but are not actionable. Datadog provides a tool that can provide us analytics and insights that are actionable for visualizations, dashboards, alerting, and intuitive searching.

      What needs improvement?

      More pre-configured "Monitor Alerts" would be helpful. Datadog's knowledge of its customers and what they are looking for in terms of monitoring and alerting could be taken advantage of with pre-canned alerts. They have started this with "Recommended Monitors".  That feature was very helpful when configuring our Kubernetes alerts. More would be even better. 

      Datadog tech support is very good. One area that could be more helpful is actually talking to someone or sharing your screen to help troubleshoot issues that arise. For new cloud engineers just coming into the cloud monitoring field, there is a learning curve. There is a lot to learn and figure out. For example, we still ran into some issues configuring the private link and more videos of how to do things could be of use.

      For how long have I used the solution?

      We have been using Datadog for one year.

      What do I think about the stability of the solution?

      We have not run into any issues with stability.

      What do I think about the scalability of the solution?

      The scalability of Datadog is very good.

      How are customer service and technical support?

      Customer service has been excellent.  I communicate weekly a Datadog Customer Success Manager.  He helps me followup on any open issues or questions that we may have.  Technical support has been very good. Opening tickets is easy.  Sometimes a Tech Engineer may take a bit of time to get back with you.  Communicating with Tech Engineer has to be done via ticket/email - no phone assistance is available.

      Which solution did I use previously and why did I switch?

      we did not.

      How was the initial setup?

      Procedures for setup seemed straightforward but once you got going, there were some issues. For us, getting our private link to work needed additional tech support. They were able to help us resolve the issue we were experiencing. I think the procedures could be done a bit better to help you with setup.

      What about the implementation team?

      We deployed it ourselves.

      What was our ROI?

      Datadog helps us minimize downtime and helps us resolve issues quickly.  

      What's my experience with pricing, setup cost, and licensing?

      Pricing seemed easy until the bill came in and some things were not accounted for. The issue may have been that we didn't realize what was being accounted for, such as the number of servers and the number of logs being ingested.

      Datadog had really good pre-sale reps that work with us but need to make sure all the details are covered.

      Which other solutions did I evaluate?

      The solution we were looking for needed to provide out-of-the-box capabilities that shorten the time to value. We had limited time & limited resources. Datadog had high recommendations in these areas, so we decided to do a trial with them.

        What other advice do I have?

        We are very pleased with Datadog overall.

        Datadog has assigned an account rep to us that meets with us regularly to make sure all our needs are being met and help us get answers to any questions or issues we are running up against. They have been of great helping us standup monitoring of our Kubernetes environment.

        Which deployment model are you using for this solution?

        Hybrid Cloud

        If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

        Amazon Web Services (AWS)
        Disclosure: My company does not have a business relationship with this vendor other than being a customer.
        PeerSpot user
        Buyer's Guide
        Datadog
        June 2026
        Learn what your peers think about Datadog. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.
        900,228 professionals have used our research since 2012.
        reviewer1479957 - PeerSpot reviewer
        Senior Director of DevOps at Housecall Pro
        Real User
        Jan 4, 2021
        Good graphing and dashboards, and it improves visibility for developers
        Pros and Cons
        • "Having a wealth of information has helped us investigate outages, and having historical data helps us tune our system."
        • "Metric graphing and Dashboards are the most valuable features because they give us good observability into our system and work well to alert us when interesting things happen."
        • "Datadog has a lot of documentation, but a lot of that documentation assumes you know how the service works, which can lead to confusion."

        What is our primary use case?

        We primarily use Datadog for the monitoring of EC2 and ECS containers running mostly Rails applications that host a SaaS product. We also monitor ElasticSearch and RDS, and we are working on adding their Application Performance Monitoring solution to monitor our applications directly.

        We use DataDog to create dashboards, graphs, and alerts based on interesting metrics. DataDog is our first place to look to find the performance of our system.

        We also use their logging platform and it works well. Especially useful is that the logs and metrics are tightly integrated so you can jump between them easily.

        How has it helped my organization?

        Developers are able to see how code is running in production, where this was mostly opaque previous to us implementing DataDog. We are able to emit custom metrics that are specific to our business, and the built-in metrics have also proven useful. Having a wealth of information has helped us investigate outages, and having historical data helps us tune our system.

        DevOps engineers are able to put sensors around our system to proactively detect problems, whereas before, our engineers heard about problems from customers. Logs are easier to find for developers.

        What is most valuable?

        Metric graphing and Dashboards are the most valuable features because they give us good observability into our system and work well to alert us when interesting things happen. We use this functionality daily.

        We value the monitoring capability since it allows us to be pushed alerts, rather than have to observe graphs continually. The integrations with Slack and PagerDuty enable us to be interrupted appropriately and keep a running tab on the system without bothering us unnecessarily.

        The online process monitoring has been extremely helpful, as it gives engineers the ability to see the live status of all the processes running our systems without them having to log in.

        What needs improvement?

        Their logging solution is expensive for our use case. They do have the capability to rehydrate old or incomplete logs, and it works, but I would rather not have to think about that operation.

        Datadog has a lot of documentation, but a lot of that documentation assumes you know how the service works, which can lead to confusion. Positive note is that they do have lots of documentation, it just needs better curation.

        Their APM solution still needs some work, but they are actively developing it. I would also like to see more database-specific application monitoring.

        For how long have I used the solution?

        I have been using Datadog for five years across two companies.

        What do I think about the stability of the solution?

        Any issues are addressed and communicated very quickly. I have not had any issues with uptime.

        What do I think about the scalability of the solution?

        If you do not need 100% of data such as logs, APM traces, etc., this scales well. It does not scale as well if you want 100% of your logs indexed. You should understand any other usage-based bills before using any part of their service as it is very easy to run up a large bill.

        The performance of the system scales very well, and host monitoring and APM are relatively cheap.

        How are customer service and technical support?

        Account support is excellent.

        Customer support is good if you get them to go beyond pointing out the right documentation.

        Which solution did I use previously and why did I switch?

        Previously, I used homebuilt solutions with Nagios and Cacti but found that there was far too much work to understand them and keep them up and fed compared to the value that I got. They also did not integrate well with existing data sources without a lot of effort.

        I also previously used StackDriver and found it too opinionated. I like that DataDog gives you tools to work with certain types of data and make your own graphs, monitors, etc., whereas, with StackDriver, I felt like there were a limited number of ways you could accomplish goals.

        How was the initial setup?

        The basic setup is easy. A more advanced setup can be tricky because the documentation assumes you know how the system works already. Support is somewhat helpful, but mostly points out the documentation you should already have found.

        What about the implementation team?

        We implemented in-house.

        What's my experience with pricing, setup cost, and licensing?

        My advice is to understand what number of hosts and data you want to commit to. Beware that usage-based billing is both a blessing and a curse. It is easy to run up a large bill, so become familiar with the cost of each piece of your bill and use the metrics they supply to estimate and monitor your bill.

        I have had good luck with their support team helping us to figure out the correct commit levels. Their account support is excellent in this regard. I have heard their sales team can be aggressive, but I have not experienced it personally.

        Which other solutions did I evaluate?

        I originally chose Datadog because of my previous experience. We recently considered moving over to New Relic because we liked their APM solution better. However, the pricing of New Relic and our familiarity with Datadog won over. New Relic is a good product but it didn't fit our overall needs as well as Datadog.

        Which deployment model are you using for this solution?

        Public Cloud

        If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

        Disclosure: My company does not have a business relationship with this vendor other than being a customer.
        PeerSpot user
        Senior Director with 10,001+ employees
        Real User
        Sep 12, 2020
        A good solution for infrastructure, but not for application-level monitoring
        Pros and Cons
        • "Datadog's ability to group and visualize the servers and the data makes it relatively easy for the root cause analysis."
        • "Datadog lacks a deeper application-level insight. Their competitors had eclipsed them in offering ET functionality that was important to us. That's why we stopped using it and switched to New Relic. Datadog's price is also high."

        What is our primary use case?

        We used Datadog to capture the salvatory of our AWS fleet of around 1,200 servers.

        What is most valuable?

        Datadog's ability to group and visualize the servers and the data makes it relatively easy for the root cause analysis.

        What needs improvement?

        Datadog lacks a deeper application-level insight. Their competitors had eclipsed them in offering ET functionality that was important to us. That's why we stopped using it and switched to New Relic.

        Datadog's price is also high.

        For how long have I used the solution?

        I have been using Datadog for about three years.

        What do I think about the stability of the solution?

        Stability really wasn't ever an issue. We didn't have any outages specific to Datadog where we couldn't get reports or insights to information. We were more concerned about the stability of our own systems and applications.

        What do I think about the scalability of the solution?

        There was no issue with scaling as such. It didn't scale well only from the cost perspective.

        How are customer service and technical support?

        Fortunately, because of the stability of the solution, we never had reasons to deal with technical support. Most of our interaction was with their product management, which was focused on the feature capability and ultimately pricing.

        What's my experience with pricing, setup cost, and licensing?

        It didn't scale well from the cost perspective. We had a custom package deal.

        Which other solutions did I evaluate?

        We switched from Datadog to New Relic because it offered ET functionality. Datadog was traditionally born out of monitoring infrastructure. Over the years, they have improved their ability to give you insights at the application layer and to be considered under APM. New Relic really started at the application layer and has worked its way down. 

        Ultimately, we were able to accept New Relic because coming from an operations team, infrastructure was more important. As our application became more complex, our application developers needed better insight. Because there is a significant overlap in the Venn diagram between Datadog and New Relic, we felt that the needs of the infrastructure team and the applications team could be met with New Relic and its expansion in providing a sort of lightweight security.

        What other advice do I have?

        Datadog started off at the infrastructure level, and New Relic started off at the application level. Both of them were expanding not only into each other's space but also into the SIM space.

        There are a lot of options out there. For folks like me, it becomes a costly proposition because, at the end of the day, we're talking about logs, events that get pushed out. I have to push out some to Datadog and some to the security event manager. Then you start to think why can't you just push them to one place and let a product do that. That's where these products are trying to grow. They're not quite there yet because the SIM space is pretty mature. An enterprise like ours needs something fully focused and dedicated. Startups can live with New Relic that has a security capability or Datadog.

        I would advise you to really understand the value that you're trying to go after. Make sure that you're not trying to solve all problems that you have from the observability perspective with Datadog because that will erode the value you get out of this solution.

        Make sure that you are going to use Datadog for infrastructure, and it is going to be great. If you start adding other kinds of stuff to it, you'll probably start losing some of that value. Especially, if you want to go for application-level monitoring, you may be a bit disappointed.

        I would rate this solution a six out of ten. I'm a very price-conscious kind of purchaser.

        Disclosure: My company does not have a business relationship with this vendor other than being a customer.
        PeerSpot user
        reviewer2275260 - PeerSpot reviewer
        Enterprise Architect at a mining and metals company with 10,001+ employees
        Real User
        Top 5
        Sep 15, 2023
        Comes with good documentation and clear dashboards
        Pros and Cons
        • "Datadog has clear dashboards and good documentation."
        • "The solution needs to integrate AI tools."

        What is most valuable?

        Datadog has clear dashboards and good documentation. 

        What needs improvement?

        The solution needs to integrate AI tools. 

        How are customer service and support?

        I avail support from our internal team. 

        What other advice do I have?

        I rate Datadog a nine out of ten. 

        Disclosure: My company does not have a business relationship with this vendor other than being a customer.
        PeerSpot user
        reviewer2000271 - PeerSpot reviewer
        Software Developer at a tech vendor with 51-200 employees
        Vendor
        Dec 7, 2022
        Great for logging and monitoring with useful RUM capabilities
        Pros and Cons
        • "The product has offered increased visibility via logging APM, metrics, RUM, etc."
        • "Even though it is powerful on its own, the UI-based design lacks elegance, efficiency, and complexity."

        What is our primary use case?

        We’re currently using logging, monitoring, metrics, APM, etc.

        We've started to use e-SLOs, however, it takes a bit of time to work through those.

        RUM has been very useful. I have used this in the past to debug problems in production, which has been g great.

        We also want to start using synthetics and tracing more. 

        Our application currently runs in many different environments based on our customers' requirements. This allows us to see everything in one place and filter by environment as required, which is extremely useful.

        How has it helped my organization?

        The product has offered increased visibility via logging APM, metrics, RUM, etc. We've gone from almost nothing to something that didn’t take a lot of time to set up. It has been great since we had so little time to spare.

        As a startup, we have limited resources, and no one has enough time for anything. The fact that there are so many easy integrations and configurations by YAML makes everything easy to set up without needing a full-time employee. Instead, we're just configuring monitoring solutions which are very desirable.

        What is most valuable?

        The product is very useful for tracking down anything that’s gone wrong. I’ve been using it to make sure everything is working correctly after deployment and to make sure we don’t suffer performance degradation. We've found it great for tracking down anything that’s gone wrong in real-time.  

        The logs are helpful. They are necessary for any application. Prior to this, our solution would have been to SSH into a machine and tail log files. This, however, is untenable for many reasons, and one of the first things I wanted to change.

        The RUM has been great. Seeing real users interacting with our website is quite helpful.

        What needs improvement?

        Sometimes it’s difficult to customize certain queries to find specific things, specifically with the logging solution. I’ve used other logging platforms in the past that have extensive and mature query languages. This might not be super friendly to start out with, yet can be very powerful. 

        I wish there was more of an emphasis on query languages instead of the UI-based tooling that Datadog provides. Even though it is powerful on its own, the UI-based design lacks the elegance, efficiency, and complexity.

        For how long have I used the solution?

        I've been using the solution for six months or so.

        Which solution did I use previously and why did I switch?

        I was previously familiar with Splunk.

        What's my experience with pricing, setup cost, and licensing?

        I don’t handle pricing or licensing aspects. 

        Which other solutions did I evaluate?

        I did not evaluate other options. 

        Disclosure: My company does not have a business relationship with this vendor other than being a customer.
        PeerSpot user
        Real User
        Dec 7, 2022
        Great dashboards, good monitoring, and easy SLAs
        Pros and Cons
        • "Profiling has been made easier."
        • "Lately, chat support has a longer waiting time."

        What is our primary use case?

        Our primary use case would be using the dashboards and getting proper insights based on the dashboards.

        The monitoring, SLO, and SLA have been better and easier since we started using the Terraform infrastructure. APM has been easier as we had to enable it through the CronJob directly.

        Profiling has been made easier. We are able to get many insights into the code. Profiling provides really good insights right now. 

        Logs are the most valuable and the best solution so far. Datadog can help solve any slow queries or database-related errors. 

        The primary use case would be using the dashboards and getting proper insights based on the dashboards.

        How has it helped my organization?

        Monitoring has been better and easier since we started using the Terraform infrastructure.

        APM has been easier as we had to enable it through the CronJob directly.

        Profiling has made it easier in terms of getting many insights into the code.

        The logs are the most valuable and the best solution. Datadog can help us to solve any slow queries or database-related errors.

        What is most valuable?

        Profiling provides really good insights, and APM has really good tracing visibility. 

        The SLA and SLO definitions and the monitoring are also really important and very valuable parts of the product and make great Datadog features. 

        Datadog support is also really valuable as they provide support for the product through the chat as well. 

        The Datadog premium support has helped us to provide faster outcomes for a problem. 

        Also, rather than having an email thread, it would be better to get the support on call and sort out the issue, which is the support we get from Datadog CSM.

        What needs improvement?

        Integration should have been easier. It is very tough to go to all the services and enable Datadog integration for each AWS service. 

        We can add the AWS services and the services on one page and show only the services that are enabled. A similar approach should be for any other integration.

        Lately, chat support has a longer waiting time. We would love to get faster chat support. We also need additional support for sending the flare files

        Disclosure: My company does not have a business relationship with this vendor other than being a customer.
        PeerSpot user
        reviewer2045055 - PeerSpot reviewer
        Sr. Software Engineer at a tech vendor with 51-200 employees
        Real User
        Dec 7, 2022
        Good observability and dashboards with increased visibility
        Pros and Cons
        • "Datadog dashboards are pretty great."
        • "We need more integration with security tools like Drata."

        What is our primary use case?

        Observability is a key use case, as is security.

        How has it helped my organization?

        With this product, we get more visibility into our K8s clusters and more intelligent alerting.

        What is most valuable?

        Datadog dashboards are pretty great.

        What needs improvement?

        We need more integration with security tools like Drata.

        For how long have I used the solution?

        I've used the solution for three years.

        Disclosure: My company does not have a business relationship with this vendor other than being a customer.
        PeerSpot user
        reviewer2045049 - PeerSpot reviewer
        Product Manager, Delivery Engineering at a media company with 1,001-5,000 employees
        Real User
        Dec 7, 2022
        Intuitive to set up with great dashboards and dashboards and APM
        Pros and Cons
        • "The tools are powerful and intuitive to set up."
        • "Billing should be more transparent."

        What is our primary use case?

        The main use case is observability and reliability as part of a platform/delivery engineering solution. We use the product to assist tenants and clients within the company to get more ramped up on SRE/DevOps.

        How has it helped my organization?

        The solution has provided us with a lot more insight into service-level metrics, which is especially useful with APM/tracing. It gives us all-up dashboards and alerts to assist with incident management.

        What is most valuable?

        The most useful aspects of the product are the dashboards and APM/tracing. The tools are powerful and intuitive to set up as well.

        What needs improvement?

        Custom-level metrics could be improved.

        Billing should be more transparent.

        For how long have I used the solution?

        I've used the solution for three to four years at multiple companies.

        What do I think about the stability of the solution?

        The solution is very stable.

        What do I think about the scalability of the solution?

        The scalability is fantastic so far.

        How are customer service and support?

        Technical support has been great so far.

        How was the initial setup?

        The initial setup is straightforward. The documentation we use is very clean and concise.

        What about the implementation team?

        We handled the initial setup in-house.

        What's my experience with pricing, setup cost, and licensing?

        I would advise others to be cautious around custom metrics and be picky when setting them up.

        Which deployment model are you using for this solution?

        Hybrid Cloud

        If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

        Amazon Web Services (AWS)
        Disclosure: My company does not have a business relationship with this vendor other than being a customer.
        PeerSpot user
        reviewer2045043 - PeerSpot reviewer
        Software Engineer at a comms service provider with 5,001-10,000 employees
        Real User
        Dec 7, 2022
        Great monitors and APM with helpful Terraform support
        Pros and Cons
        • "APM is great and has provided low-effort out-of-the-box observability for various services."
        • "Delta traces on the Golang profiler are extremely expensive concerning memory utilization."

        What is our primary use case?

        We primarily use the product for tracing, metrics, and alarms in various deployment environments.

        How has it helped my organization?

        The product has provided our company with improved observability, which has helped make the incident response more targeted and quicker.

        What is most valuable?

        APM is great and has provided low-effort out-of-the-box observability for various services. 

        Monitors are helpful, and definitions are simple. 

        Terraform support is nice as it allows us to create homogenous monitoring environments in various deployment environments with little additional effort. It also facilitates version control of monitor definitions, etc. 

        The Golang profiler is generally good with the exception of delta profiles; it has provided helpful observability into Heap Allocations which has helped us reduce GC overhead.

        What needs improvement?

        Delta traces on the Golang profiler are extremely expensive concerning memory utilization. In a Kubernetes environment where we would like to set per-pod memory allocations as low as possible, the overhead of that profiler feature is prohibitive. In one case, our pods (which were provisioned to target 250 MB and max at 500 MB memory) got stuck in a crash loop due to out-of-memory, which was caused entirely by the delta profiles feature of the profiler.

        Multistep Datadog synthetics lack the feature of basic arithmetic. For our use case, performing basic arithmetic on the output of previous steps to produce input for subsequent steps would be extremely useful.

        For how long have I used the solution?

        I've used the solution for nine months.

        Which deployment model are you using for this solution?

        Private Cloud

        If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

        Microsoft Azure
        Disclosure: My company does not have a business relationship with this vendor other than being a customer.
        PeerSpot user
        Buyer's Guide
        Download our free Datadog Report and get advice and tips from experienced pros sharing their opinions.
        Updated: June 2026
        Buyer's Guide
        Download our free Datadog Report and get advice and tips from experienced pros sharing their opinions.