Try our new research platform with insights from 80,000+ expert users
reviewer2004198 - PeerSpot reviewer
Devops Engineer II at a comms service provider with 11-50 employees
Real User
Great CPU profiler and lots of features but can be overwhelming
Pros and Cons
  • "Even if we don't end up using Datadog, it revealed problems and optimizations to us that weren't obvious before."
  • "The sheer amount of products that are included can be overwhelming."

What is our primary use case?

We use the solution for monitoring our logs across distributed clusters. Right now, we have an Elasticsearch solution that is tied to each platform (our product is a PaaS solution). 

We are looking at moving to a single pane of glass solution, which Datadog would be good for (plus, we could wrap up other tools like Prometheus, Grafana, Pagerduty, Pingdom, and more). We want to be able to have Datadog running on one single cluster and ingesting and processing logs from all our distributed clusters.

How has it helped my organization?

So far, we are just in the evaluation stages so it's hard to say how it's improved out organization. However, one positive impact it had is it's been just showing us an example of how to build in observability, metrics, tracing, etc., in a better way. 

Even if we don't end up using Datadog, it revealed problems and optimizations to us that weren't obvious before. One potential reason why it may not help us is that we have strict rules around log parsing and may not be able to send it to an external organizaton for ingestion/processing.

What is most valuable?

The CPU profiler has been interesting even though it isn't our core use case. 

We are finding that Datadog has way more offerings than originally expected, so we are constantly finding new parts of it that would be convincing to use. 

The log and ingestion are very similar to our current Elasticsearch setup. We find the tracing and overall integration/ecosystem to be the most valuable part. Basically, the CPU profiler is a good example of a value add for a problem we knew we had yet was low priority and had hacky workarounds. The value proposition is in the ecosystem as a whole.

What needs improvement?

The sheer amount of products that are included can be overwhelming. 

The solution requires better overarching UI, which would make things clearer. Even though I generally dislike the AWS UI, it makes the different services very clear, and it also makes where you are at any given point clear. 

The sidebar for all the different services is a bit much. 

I also found the tagging of logging pipelines to be a bit tedious. It would be great if, once marked up, it would automatically be a first-class citizen in Datadog.

Buyer's Guide
Datadog
August 2025
Learn what your peers think about Datadog. Get advice and tips from experienced pros sharing their opinions. Updated: August 2025.
865,384 professionals have used our research since 2012.

For how long have I used the solution?

We are still in the evaluation stage and have used it less than one month.

What do I think about the stability of the solution?

The stability looks good so far.

What do I think about the scalability of the solution?

It seems easier to scale and build app functionality across multiple teams rather than other solutions.

Which solution did I use previously and why did I switch?

We have used Elasticsearch, Grafana, and Prometheus. We are still evaluating Datadog.

What was our ROI?

The product has provided good ROI by saving development time as well as time managing setting up ES.

What's my experience with pricing, setup cost, and licensing?

It is somewhat expensive compared to open-source options.

Which other solutions did I evaluate?

We evaluated Elasticsearch, Grafana, and Prometheus. We are still evaluating Datadog.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. evaluator
PeerSpot user
reviewer2004201 - PeerSpot reviewer
Software Engineer at a comms service provider with 11-50 employees
Real User
Industry-standard with good profiling and helpful alerts
Pros and Cons
  • "The biggest thing I liked was the combination of all the things - monitoring, log aggregation, and profiling."
  • "It can be overwhelming for new people as it has a lot of features."

What is our primary use case?

We use different tools for log collection and monitoring. Using Datadog will combine different use cases into one product that will be easier to manage. 

The tools we use are open-source, so there is no commercial support. Having customer support would be ideal since we're a small team. 

Profiling would be another great feature to have. Currently, it's manual. Having Datadog would give us a standard, and we don't have to do much manual work.

How has it helped my organization?

It will solve a lot of our problems. We have different tools for each of them in our organization; they are open-source and therefore not very well maintained with there is no customer support. 

Having an industry-standard product such as Datadog would be ideal for us as we are short on manpower. Since this is a managed all-in-one product with readily available support, we will be able to focus on application logic rather than figuring out why a tool isn't working.

What is most valuable?

The biggest thing I liked was the combination of all the things - monitoring, log aggregation, and profiling. We have different tools for each of them in our organization and all of them are open-source. These are not very well maintained and there is no customer support. 

Having an industry-standard product is ideal for us as we are short on manpower. Profiling is another amazing feature. Currently, we rely on some open-source solutions, and it's all done locally. Having it done on Kubernetes would give us more insights and help with performance. Alerting is again a nightmare for us. Datadog solves all of these issues.

What needs improvement?

It can be overwhelming for new people as it has a lot of features. The UI could certainly be improved. Having less information with better organization could help newcomers. I haven't seen the documentation, however, a well-organized documentation would invite many varied users.

For how long have I used the solution?

I've been using the solution for three years.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Datadog
August 2025
Learn what your peers think about Datadog. Get advice and tips from experienced pros sharing their opinions. Updated: August 2025.
865,384 professionals have used our research since 2012.
reviewer2003934 - PeerSpot reviewer
SRE at a computer software company with 51-200 employees
Vendor
Great for log aggregation, searching, and system monitoring
Pros and Cons
  • "The ability to easily drill down into log queries quickly and efficiently has helped us to resolve several critical incidents."
  • "Datadog could always lower the price!"

What is our primary use case?

We are using Datadog for server metrics, log aggregation and searching, system monitoring, alerting the team about errors, and dashboards for our developers. It's used by the Site Reliability Engineering team and Management of all levels. 

It's assisting us in proving SOC II compliance. 

We're looking to improve our usage of Datadog's RUM and APM components to get better and more performance insights on our production environments. 

We're also looking to leverage more synthetic monitors and runbooks for anyone responding to incidents.

How has it helped my organization?

The ability to easily drill down into log queries quickly and efficiently has helped us to resolve several critical incidents so far this year, and we heavily rely on a series of dashboards showing us various queues and load on CPU and memory for servers. 

We also have a view of the information required when we begin the patch and/or upgrade processes. 

I've also set up several monitors to alert the Site Reliability Engineering team when various metrics show a server might be reaching capacity. We use it to send an email suggesting we increase the size of the cloud instance.

What is most valuable?

The ability to easily drill down into log queries quickly and efficiently has helped us to resolve several critical incidents. We heavily rely on dashboards that are showing us various queues and load on CPU and memory for servers. 

We also have a view of the information required when we begin the patch and/or upgrade processes. 

I've arranged several monitors to alert the Site Reliability Engineering team when various metrics show a server that might be reaching capacity. We use it to send an email suggesting we increase the size of the cloud instance.

What needs improvement?

Datadog could always lower the price! In general, more demos online and maybe more free hands-on tutorials for basic functionality would be good for less technical users. 

I would also prefer more chances to amend the contract more than twice a year. As a smaller but growing company, it can be difficult to adequately predict demand.

For how long have I used the solution?

I've used the solution for more than three years.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Ian Schell - PeerSpot reviewer
Senior Site Reliability Architect at a tech vendor with 1,001-5,000 employees
Real User
Reduces debugging time, with good distributed tracing and useful RUM
Pros and Cons
  • "We have hundreds of microservices, and knowing how top-level requests weave throughout all of them is invaluable."
  • "There is occasional UI slowness and bugs."

What is our primary use case?

We use Datadog for general observability into our infrastructure, as well as running analytics queries for our SLI/SLO platform. This helps all of our teams be informed of how well their products are actually performing in production, and aim their efforts at the thing that will provide the highest ROI. 

We also use it for general monitoring and alerting during load tests and service releases to detect any issues related to the deployments. This helps us maintain our high contractual uptime promises to our clients.

How has it helped my organization?

It has drastically reduced the amount of time we spend on debugging issues and tracking down the root causes of incidents. What might have taken days or hours with separate vendors in the past (or even single vendors with terrible UI) is now quick and easy. 

We've often gone from detecting an incident to identifying the needed fix within ten minutes or less and covered multiple domains like APM, Logs, Database performance monitoring, etc., in just a few clicks. This is extremely powerful.

What is most valuable?

Distributed tracing is the most valuable feature. We have hundreds of microservices, and knowing how top-level requests weave throughout all of them is invaluable. 

At one glance, we can clearly see which service is slow and then switch over to the infrastructure view or container view to debug why the slowness is happening. This is true of all their other integrated products as well; the more you add, the more insights you get when looking at traces.

We also use RUM extensively. This helps us cover the last mile of application performance. Without it, we wouldn't know if our browser applications were functioning slowly for our users.

What needs improvement?

There is occasional UI slowness and bugs. While the Datadog UI is generally miles above its competitors, there are a few cases where it falls short or has started to slow down over time. They also occasionally make poor UI redesign choices. They should continue focusing on this area to maintain the high standard they started out with.

For how long have I used the solution?

I've used the solution for five years.

What do I think about the stability of the solution?

We've never had major stability issues.

What do I think about the scalability of the solution?

Scalability has never been an issue, although there is occasionally UI slowness.

How are customer service and support?

Support via tickets is absolutely terrible. It's the one obvious bad spot for Datadog. If we didn't have direct relationships with many of their product managers, our experience would be much worse.

How would you rate customer service and support?

Negative

Which solution did I use previously and why did I switch?

We previously used New Relic. It had a terrible UI and the integration between products was not great. Datadog is miles ahead of them and is continuing to increase that distance.

How was the initial setup?

The initial setup is straightforward, and the docs are done well.

What about the implementation team?

We managed the implementation in-house.

What was our ROI?

Our ROI is high.

What's my experience with pricing, setup cost, and licensing?

I'd advise users to negotiate rates. Datadog's off-the-shelf rates are pretty high.

Which other solutions did I evaluate?

We have only used and looked into New Relic.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
reviewer2004210 - PeerSpot reviewer
Cloud Specialyst at a financial services firm with 501-1,000 employees
Real User
Centralized with good observability and many modules
Pros and Cons
  • "The most valuable aspect is for us to have everything in one place."
  • "We need a lot of modules since we collect all data logs from all operating systems."

What is our primary use case?

We collect all data logs from all operating systems, such as Windows, Linux, VMware, and bare metal data centers. We also automatize the installation of the agent on servers. 

Now we are starting a POC to analyze the APM module. In the feature, the next step is to do a POC of security modules. 

The final idea is to have a unique portal for observability. This will make it easy to troubleshoot and for layer levels 1 and 2. 

How has it helped my organization?

We are looking into a lot of modules. We collect all data logs from all operating systems, including Windows, Linux, VMware, and bare metal data centers. We also automatize the installation of the agent on servers. 

We're developing POCs for APM and security modules. We'll also have a unique portal for observability. This will make it easy to troubleshoot. 

The most valuable aspect is for us to have everything in one place.

What is most valuable?

We're investigating many modules. We collect all data logs from all operating systems (Windows, Linux, VMware, and bare metal data centers). We also automatize the installation of the agent on servers. 

We're doing POCs in APM and security. 

Soon, we'll have a unique portal for observability. This will make troubleshooting easy at levels 1 and 2. 

The most valuable aspect for us is to have everything in the same place.

What needs improvement?

We need a lot of modules since we collect all data logs from all operating systems. 

The most important module for us is log management. The second is the security module. The third one is the APM.

For how long have I used the solution?

We've used the solution for one year.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
reviewer2000463 - PeerSpot reviewer
Technical Lead at a wholesaler/distributor with 1,001-5,000 employees
Real User
Great dashboards, easy to tweak, and showcases helpful metrics
Pros and Cons
  • "The ease of correcting these dashboards and widgets when needed is amazing."
  • "The parallel editing of the dashboards should not cause users to lose the work of another person."

What is our primary use case?

We use Datadog for observability and monitoring primarily. Various cross-functional teams have built various dashboards, including Developers, QA, DevOps, and SRE. 

There are also some dashboards created for senior leadership to keep tabs on days to day activities like cost, scale, issues, etc. 

Also, we've set up monitors and alarms that kick off when any metrics go beyond the threshold. With Slack and PagerDuty integration, correct team members get alerted and react to solve the issue based on various runbooks.

How has it helped my organization?

Using Datadog metrics has helped the organization a lot in many manners. With one centralized monitoring place, it's a lot less effort to keep track of the system and applications' health. 

Using this also helps teams be proactive in dealing with any issues before they get escalated by customers. 

Lastly, having so many integrations makes the DevOps and SRE's lives a lot easier when automating the detection and resolution of any issues hidden in the system or applications. Overall, it has helped a lot.

What is most valuable?

My favorite feature is creating dashboards as that empowers me to sleep calmly at night and not to keep watch on critical system metrics. Be it DB metrics or computer-related metrics, it's always easy to view them. 

The ease of correcting these dashboards and widgets when needed is amazing. 

The only issue I face is when more than one person editing these dashboards simultaneously, one or the other person sometimes loses his/her work. That said,  they will resolve that soon. With the variety of widgets, it's so easy to plot the data in a timely manner, and that makes monitoring a lot easier.

What needs improvement?

The solution can be improved in a few areas. 

The parallel editing of the dashboards should not cause users to lose the work of another person. 

Secondly, we would like to see more demos of tools that are in beta version, when they come live. I am sure they will help us a lot.

For how long have I used the solution?

I've been using the solution for slightly over two years.

What do I think about the stability of the solution?

I find the solution to be very stable.

What do I think about the scalability of the solution?

I totally love it. It is scalable. 

Which solution did I use previously and why did I switch?

We previously used Sumo Logic.

How was the initial setup?

The initial setup is not so difficult.

What about the implementation team?

We implemented the solution in-house.

What was our ROI?

The ROI is very fair so far.

What's my experience with pricing, setup cost, and licensing?

I can't recommend the licensing.

Which other solutions did I evaluate?

I was not involved in any pre-evaluation process.

Which deployment model are you using for this solution?

Hybrid Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
reviewer2000472 - PeerSpot reviewer
Security Engineering Manager at a computer software company with 11-50 employees
Real User
Democratizes observability, great log searchability, and intuitive UI
Pros and Cons
  • "I find the greatest feature is being able to search across logs from various microservices."
  • "One area where I was really looking for improvement was the CSPM product line. I had really wanted to have team-level visibility for findings, since the team managing the resources has much more context and ability to resolve the issue, as the service owner. However, this has been added to the announcement in a recent keynote."

What is our primary use case?

I use the solution to manage security-related logs and metrics, as well as create detection rules for security events. I am a security engineer, so one area of interest is the CSPM product, giving us the ability to look at findings across the cloud environment. 

The great part about the Datadog security products is that they incorporate the context of the resources/hosts where the security event is found. This allows us to see exactly what is running on a host that we see as a security alert.

How has it helped my organization?

The greatest impact it has had is on the ability to democratize observability and put monitoring into the hands of the people. Teams can quickly get the information they need, without needing a bunch of training, since the UI is super intuitive and easy for beginners. This helps reduce time to resolution during incidents and gives context to developers quickly and easily. Context is really important since seconds matter when the ship is down, and you don't know why.

What is most valuable?

I find the greatest feature is being able to search across logs from various microservices. As a member of the security team, I find that I often need visibility into other teams' services in order to get a good picture of our security posture.

I also am a fan of the ability to easily create monitors and get alerts into Slack quickly, without too much overhead. For example, I often need to create monitors where I am not too sure where the baseline lies. Having the ability to create anomaly monitors makes this process much more straightforward. Anomaly monitors are great for a security team.

What needs improvement?

One area where I was really looking for improvement was the CSPM product line. I had really wanted to have team-level visibility for findings, since the team managing the resources has much more context and ability to resolve the issue, as the service owner. However, this has been added to the announcement in a recent keynote. 

For how long have I used the solution?

Personally, I've used it my entire time employed here, more than three years.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
reviewer1996518 - PeerSpot reviewer
ITOPS and SRE Manager at Ticket
User
Good observability, available on the cloud, and capable of scaling
Pros and Cons
  • "The observability on offer is the most useful aspect of the product."
  • "The FinOps needs improvement."

What is our primary use case?

We primarily use the solution for observability.

How has it helped my organization?

The solution has helped with our POV phase.

What is most valuable?

The observability on offer is the most useful aspect of the product.

What needs improvement?

The FinOps needs improvement. 

What do I think about the stability of the solution?

The stability is good.

What do I think about the scalability of the solution?

The scalability is good.

Which solution did I use previously and why did I switch?

We previously used AppDynamics and Dynatrace.

Which other solutions did I evaluate?

We also evaluated AppDynamics and Dynatrace.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Download our free Datadog Report and get advice and tips from experienced pros sharing their opinions.
Updated: August 2025
Buyer's Guide
Download our free Datadog Report and get advice and tips from experienced pros sharing their opinions.