I'm looking to purchase an APM solution, and looking for some feedback.
I've shortlisted AppDynamics, New Relic, Dynatrace, Datadog, as well as some others like Big Panda and Corelogic that might be used to bolt on as value add. I have experience with AppDynamics, Datadog and Dynatrace but not New Relic - I'm leaning towards New Relic though.
I want to hear how you did your disc...
IT Technical Architect at a insurance company with 5,001-10,000 employees
20 October 20
There are many factors and we know little about your requirements (size of org, technology stack, management systems, the scope of implementation). Our goal was to consolidate APM and infra monitoring. We maintain critical processing on our mainframe so there was a desire to include this in our transaction trace. Due to a highly mature ELK implementation, we are not trying to incorporate log analytics into solution buy may consider in the future. We had AppD, Dynatrace, New Relic, and CA Wily all in house at the time of our evaluation. We eliminated Datadog due to a lack of real user monitoring and AppD based on experience and licensing. Between Dynatrace and New Relic, Dynatrace won based on the automation, integrated AI, support for "old" techs, and confidence we could eliminate multiple APM and infra monitoring tools.
I would not include products like BigPanda, MoogSoft, in this analysis. They are not monitoring solutions but event correlation solutions. You will need additional monitoring products to capture data and feed them. Having said that if you cannot consolidate tools you will likely need to purchase an event solution to make sense of all the alarms. We did evaluate these products but with Dynatrace AI did not feel the business value was there for the investment.
Here's a quick pro/con list on Dynatrace & New Relic from our analysis.
New Relic Pros: Insights is an awesome product and capability. Lots of capabilities and plugins to extend data collection. The APM dashboard is aesthetically pleasing and intuitive. Good training and documentation are available to support the product.
New Relic Cons: Requires lots of manual configurations to implement and support. Insights product requires an investment of time to achieve value. Licensing is a nightmare as there is virtually no transparency in what you are being charged for. Lack of solution to consolidate alerts across implementation other than significant investment in insights to manually achieve this.
Dynatrace Pros: Very simple to implement and maintain with out of the box automation which supports modern (cloud/Kubernetes) and "old" (mainframe). In-app chat is helpful. High integration of infra and APM data for full-stack observability and engineering. Topology and trace discovery is more reliable than other products or our CMDB. Synthetics are easy to set up for any user. AI-assisted problem analysis on the trace discovery streamlines troubleshooting. AI includes "events" in an analysis like VMotion, deployment events. Have not done yet but looking to leverage monitoring as code for a fully integrated and automated delivery pipeline. See keptn.sh open source project.
Dynatrace Cons: User SQL lacks some functions of NRQL for user analysis. Host, process, and service data is not available to query within the product. Alarm processing lacks some granular controls. The Plug-in library is less robust.
Good luck with your decision!
Alert aggregation was the primary requirement. BigPanda pulls all this together into a single UI for us, allowing us to see related alerts grouped together into an incident, and enables us to easily create a JIRA ticket and Slack channel to manage an issue.
Modern-day servers are robust enough to accommodate as many applications and processes as possible. Still, there is a limit to how much load a server can handle.
If your business does not heed the server constraints in time, you are bound to suffer from operational loss due to server downtimes. To closely monitor your server health, you must track specific metrics regularly.
Here are some s...
Collecting as many metrics, statuses, and logs about the servers is indeed the first step, you never know what data you will need to solve a particular problem. The second step is to process and correctly pinpoint where the network performance/behavior differs from the expected range/baseline.
Can your network monitoring software automate the obvious (execute remote corrective actions in response to alerts) and notify the IT person about only critical situations where the human needs to make a decision about the resolution options? We expect the network monitoring software today to do just that.
I would say NetCrunch can do it, but do you have any experience with other monitoring products that provide a similar type of monitoring experience for IT teams?
What Is AIOps?
AIOps is the practice of applying analytics and machine learning to big data to automate and improve IT operations. These new learning systems can analyze massive amounts of network and machine data to find patterns not always identified by human operators. These patterns can both identify the cause of existing problems and predict future impacts. The ultimate goal of AIOps is...
Future of NOC transformation unifies IT teams
NOC transformation could lead to unified IT operations with cross-domain teams, but not all enterprises need radical change when smaller upgrades and modernization do the job.
In the technology world, it can be easy to throw around the word transformation and lose the nuances of what it entails.
Consider the networking industry. Remote work req...
IT Operations Management (ITOM) refers to the administration of technology and application requirements within an IT organization. Under the ITIL framework, ITOM’s objective is to monitor, control, and execute the routine tasks necessary to support an organization’s IT infrastructure.
In addition to the above, an ITOM solution ensures effective provisioning and management of capacity, cost...
I have done the product for 22 plus years, whenever it was called OpC.
Some of that is still around in the last version I worked with 10.7x. I’m afraid that since Micro Focus bought the product it’s DOA, they don’t have any partners like HP had, independent consultants like myself are locked out of getting news work because Micro Focus unlike HP doesn’t promote independent work. They try to gobble up all of the little fish we used to make a living working for.
I have moved back to my Unix and Linux system administrator roots and I work for a large fortune 50 company that has multiple lines of business, a few still have the tools around. Our group manages several thousand servers using Core Nagios. It reminds me of the day when HP OpenView was easily deployed and configured. It became something of a green-eyed monster that of the 20 clients who I worked with over the 22 years not only have they dumped it but are using similar OSE tools to monitor their environments.