We are using the solution for scaling up the website for market data applications. EC2 and Datadog have enabled high-level monitoring of underlying infra and services.
The Datadog profiler comes in handy to pinpoint issues with resource utilization during peak hours, and traces/log management helps narrow down the root cause.
The network map is crucial in identifying bottlenecks and determining what needs more attention.
Host map helps identify problematic hardware and devise ways to counter issues that arise during scaling, and deploying solutions on the cloud.
While my team is relatively new to Datadog, I already see immense value in switching over to Datadog as the primary APM and NPM tool.
The arsenal of features it offers is bound to come in a clutch when facing production issues, and when finding out what went wrong is crucial.
The network map has helped to figure out the golden signals and optimize the infrastructure.
The synthetics have helped ensure the high availability of arch functions as intended.
The network map is useful. With it, we have the ability to see the data flow across the entire network path across all the applications is highly valuable as the data from this service helps identify network bottlenecks, non-performant applications, and bad endpoints.
This is especially crucial for a high-availability website aimed at market data applications where low latency is crucial.
The host map gives a clear picture of the entire infrastructure, and the ability to switch between logs, metrics, and traces is very handy when it comes to debugging issues on the fly.
I love the ability to install the integrations and agents quickly. This is a well-made product.
To be very fair, I haven't had enough experience with Datadog to pick out improvements.
My involvement with Datadog has largely been positive. I love the simplicity and intuitiveness it offers - even for nontechnical folks who just might be starting out with developing technical chops in their domain.
I've used the solution for three years.