IP/MPLS Engineer at a comms service provider with 1,001-5,000 employees
Real User
It's helpful for seeing where we're having problems with the network, but the display could be more intuitive
Pros and Cons
  • "The installation is no problem. I've installed Nagios several times."
  • "The way Nagios displays information isn't easy for a new user to understand. It's not intuitive enough. You need to read some tutorials or be trained to understand what it's displaying. Also, I think it needs more features to improve network visibility because there are some things you can't detect."

What is our primary use case?

We use Nagios XI for monitoring and seeing where we're having problems with the network. 

What needs improvement?

The way Nagios displays information isn't easy for a new user to understand. It's not intuitive enough. You need to read some tutorials or be trained to understand what it's displaying. Also, I think it needs more features to improve network visibility because there are some things you can't detect.

For how long have I used the solution?

I've been using Nagios for more than five years.

What do I think about the stability of the solution?

Nagios is stable.

Buyer's Guide
Nagios XI
April 2024
Learn what your peers think about Nagios XI. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
768,924 professionals have used our research since 2012.

What do I think about the scalability of the solution?

Nagios has good scalability. 

How was the initial setup?

The installation is no problem. I've installed Nagios several times.

What other advice do I have?

I rate Nagios XI six out of 10. It's one of several network monitoring systems we have, and it has some features we can't get from other platforms.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
PeerSpot user
Engineer at a tech vendor with 51-200 employees
Real User
Only Monitoring Tool You will Need

Why is OMD a Better Choice than Zabbix or Zenoss

I spend about 3 weeks vetting through 20+ open source monitoring solutions and at the end of the process, the choices had boiled down to few major ones - OMD (best combination of open source plug-ins put together for Nagios), Zabbix, and Zenoss.

The main components of OMD are Check_MK, PNP4Nagios, Nagvis, and of course Nagios. Among these projects, Check_MK is the core of OMD that makes Nagios easy to configure, easy to scale, and mashed together all the other popular Nagios plug-ins into one unified user interface. Thus the following comparisons are done using Check_MK as the keyword, but I will also cover how other plug-ins makes OMD project stand out from the competitions.

Trend

Check_MK vs Zabbix vs Zenoss Core Trend

enter image description here
A quick Google Trend search will tell you that Check_MK is up and rising. Together with the Nagios’s community size, you can certainly find custom monitoring plug-ins created by community members and save yourself time from reinventing the wheel.

Project Health

Before you pick any open source tool for enterprise projects, you want to make sure that their code is not stale and the community is vibrant for the years to come. Active community and frequent code updates ensure your questions get answered and fast bug fixes. Free service from Ohloh will give you an overview of those aspects on open source projects. The following comparison charts are created from Ohloh.

Number of Code Commits Made by Each Project

enter image description here
Check_MK is a clear winner in this chart. It tells you that Check_MK is constantly making more improvement than the other 2 projects.

Number of Contributor of Each Project

enter image description here
In this chart, Check_MK’s contributor is increasing and will soon surpass Zabbix. And don’t forget it is standing on giant’s shoulder, the largest monitoring community - Nagios.

User Reviews

Don’t just listen to me. Here is one of the blog post that talks about why moving away from Zabbix to Check_MK.

Moving From Zabbix to Check_MK

Architecture Design Advantage

OMD

What is OMD:
OMD is a combination of best practices on how Nagios should be setup and integrated. It has incorporated all of the most popular 3rd party Nagios plug-ins in single easy to maintain, easy to install, and easy to upgrade package. Once you have your Linux server running, installing and have your OMD monitoring suite running only takes about 10 minutes with one command.

Administrators can really save time on not having to compile Nagios, or other plug-ins, trying to integrate and mess with configurations between plug-ins and Nagios. It really is a no-brainer to setup and start with.

Why use OMD instead of other flavors of Nagios combos, e.g. ?
Founded July, 2010 by a group of well known Nagios community members and Nagios addon developers
e.g. NagVis, Check_MK, PNP4Nagios, and others

Check_MK

What is Check_MK

Check_MK is an extension to the Nagios monitoring system that allows creating rule-based configuration using Python and offloading work from the Nagios core to make it scale better, allowing more systems to be monitored from a single Nagios server.

enter image description here
There are 2 significant modules that Check_MK uses to improve Nagios performance. One is called Livestatus and the other is called Livecheck.

Livestatus

Before Livestatus ☹
  • Monitoring results are stores to a single file status.dat. It becomes a bottleneck on CPU and IO for larger installation.
  • status file status is not realtime, default is to update every 10 seconds.
  • NDOUtils utilize databases for monitoring results (MySQL or PostgreSQL), but still have some severe shortcomings.
  • NDOUtils has complex setup.
  • NDOUtils needs a databases to be administered, a rapidly growing one.
  • NDOUtils eats up significant portion of your CPU resources just to keep the database up to date.
  • Some similar projects that still uses NDOUtils:
  • Regular housekeeping of the database can hang your Nagios for minutes or even an hour once a day.
After Livestatus ☺
  • Livestatus also uses Nagios Event Broker API like NDO, but it does not actively write out data. Instead, it opens a socket by which data can be retrieved on demand.
  • Livestatus imposes no measurable burden on CPU at all.
  • Livestatus produces zero disk IO when querying status data.
  • No configuration is needed. No database is needed. No administration is necessary.
  • Livestatus scales well to large installation even beyond 50,000 services.
  • Livestatus give you access to Nagios-specific data that is not available to any other methods.

Livecheck

Before Nagios 4.0, Even a perfectly tuned system rarely manages to execute more then a few thousand checks per minute.
What make things worse: while your system is getting larger, the maximum check rate is even getting worse. The more hosts and services your system manages, the less checks per second it will be able to perform. Why?
Existing Problems of Nagios (before Nagios 4.0) ☹
  • Each new check creates a new fork
  • The new process prepare everything needed to execute the check plug-in, then fork the second time when ready
  • Forking is costly even for highly optimized Linux kernel
  • The forking of Nagios core (before v.4.0) does not scale on multiple CPUs (single thread process).
  • you can well run into a situation where your powerful 16-CPU server is limited to 100 Checks per second while most of its CPU cores are idle most of the time.
How does Livecheck solve those bottlenecks ☺
  • It uses a number of helper processes. The core communicate with each helper through a Unix socket (that does not appear in file system).
  • Only a small helper program is forked instead of the complete Nagios monitoring core.
  • The helper forks distribute over all available CPUs instead of single CPU.
  • The total process VM size of Livecheck is about 100KB only!
  • Inline implementation of check_icmp (PING tests). To give you an idea of how much improvement this has done, here is a benchmark example using dual core 2800 MHz CPU:
    • Before inline check_icmp: 300 ICMP checks per second.
    • After inline check_icmp: 2600 ICMP checks per second. The checks generated an ICMP traffic of 45Mb/s.

Nagios Monitoring Core working with the best plug-ins (Check_MK, NagVis, PNP4Nagios and etc)

enter image description here

Multisite - An Advance Web Interface for Nagios

Multisite is part of the Check_MK project as a better web UI alternative for Nagios.

A new and innovative GUI for viewing Nagios status information and controlling your monitoring system. It is based on MK Livestatus and aims at replacing the Nagios web GUI (also known as “the CGIs”). Multisite supports distributed monitoring in a very efficient way.

Zero Configuration Files with WATO

This is one of the most brilliant solutions from Check_MK project to tackle the notorious Nagios configuration disaster. Although Nagios is a flexible and powerful monitoring system, having to mess with its multi-level and confusing configuration files scares many people away. Now, there are many web interface plug-ins that try to take a stab at the issue, but WATO is by far the best that simplify the complexity of Nagios configuration while staying very flexible and more flexible by sitting on top of Check_MK.

WATO is a web based administration tool for Check_MK. It allows you to manage your hosts and services to be monitored and perfectly supports Check_MK’s mechanism of inventory to autodetect services to be checked on a host. WATO allows to move a substantial part of the daily workload from the monitoring administrator to his colleagues.

Monitoring Agent for both Linux and MS Windows

enter image description here

Responsive UI for Mobile Client

Powerful Search Function

Visual Meters with Perf-O-Meter

enter image description here

NOC with Dashboards (Thanks to PNP4Nagios & Nagvis)

PNP4Nagios

Nagvis

NagVis is a visualization addon for the well known network managment system Nagios. NagVis can be used to visualize Nagios Data, e.g. to display IT processes like a mail system or a network infrastructure.

Sample Navigation in Nagvis

Automation and Web Services for Automated Provisioning

Automation is build into Multisite. You can make web service request against Multisite to automate adding new host, enabling new service checks, or embed any of the host/service check web pages into any other websites.

This feature makes it very easy to integrate with Puppet or Chef for automatically adding new servers(hosts) and services to the monitoring system.

24/7 NOC with Flexible Notification

With Check_MK abstracting the original Nagio’s notification scheme, it has become possible to send notifications of any hosts or services to any number of people at any time.

You can even create custom script to send the notification in some creative ways like having the notification be ☎called via a VoIP server to your cell phone and read you the alert message or have the alert be sent to your ✐instant messenger.

Custom Icons

http://mathias-kettner.de/checkmk_devel_multisite_icons.html

Management and Maintenance

Distributed Monitoring

Distributed WATO allows you to manage several monitoring sites through a logically centralized WATO.
  • 1200 Check_MK installations
    • Centralized status of all 1200 stores per minute
    • Using NagVis to show all 1200 stores’ status on the map using the Geomap function
    • All stores’ overall status is aggregated through the use of the Business Intelligence function

enter image description here

Backup of Changes

  • Automatic Check_MK configuration backup on every change you make
  • Easy restoration with the Thunder icon
    enter image description here

Upgrade OMD

Business Intelligence

Available from version 1.2.3

Predictive Monitoring

  • Smart threshold that detects anomaly from daily operation
  • Set warning level based on prediction
    enter image description here

Available from version 1.2.3

Monitor Cronjobs

Before

<code>5 0 * * * root /usr/local/bin/backup >/dev/null</code>

After

<code>5 0 * * * root mk-job nightly-backup /usr/local/bin/backup >/dev/null</code>

Available from version 1.2.3

Dive in to the OMD World

I will be sharing how I install OMD, optimized web interface (Multisite), utilized passive checks, implemented 24/7 on call plan, and integrated with automated business processes. I will add link here once they become available.

  1. How to Setup OMD in 1 Hour
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Chris Childerhose - PeerSpot reviewer
Chris ChilderhoseLead Infrastructure Architect at ThinkON
ExpertTop 5Real User

Wow very nice write up and detail. We are debating to switch off Nagios over to Solarwinds at the moment but I might have Management check this review.

See all 2 comments
Buyer's Guide
Nagios XI
April 2024
Learn what your peers think about Nagios XI. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
768,924 professionals have used our research since 2012.
it_user12228 - PeerSpot reviewer
Systems Administrator at a cloud provider with 501-1,000 employees
Vendor
I've used both Nagios and SolarWinds- different enviorments have different preferences

I've used both Nagios and SolarWinds, although I haven't messed with nagios as much as solarwinds. Solarwinds had some nice features for monitoring and I learned a lot about it in the short time that I actually worked with it. Nagios just seems to work and when a server is down, I investigate. Solarwinds seemed to have more issues but that could have been because it was running on a server 2003 box and possibly old hardware where as the company I'm working at now runs nagios on a linux box with some decent hardware. Again, I haven't delved deep into Nagios and it's possible that what I'm looking at is just a webpage front end to nagios that not everybody uses but it's still pretty nice regardless.

Not sure if different environments have different prefferences but the company I was working at that used solarwinds was an ISP. The company that uses nagios is a web hosting company. I've also seen a linux admin at a previous job use nagios so it may be that nagios is more popular among linux if not servers altogether.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user2652 - PeerSpot reviewer
it_user2652Project Manager at a non-tech company with 10,001+ employees
PopularVendor

Can we use Solarwinds to monitor linux as well as solaris operating systesm?

PeerSpot user
Senior Manager of Engineering at a tech services company with 501-1,000 employees
Consultant
The Reactor helped improved our script automation and self-response.

Valuable Features:

It has a lot of flexibility for customization and a wide range of metrics. Also, it is opensource and has a big community.

Improvements to My Organization:

The Reactor helped improved our script automation and self-response. XI has provided us with top flexibility for heterogeneous systems monitoring.

Room for Improvement:

Nagios needs to improve their incident manager. Currently, it isn't good for ITIL in my opinion. The Reactor is good to go, but XI needs to improve its reporting functionality.

Use of Solution:

We use bothe Nagios XI and Nagios Reactor.

Deployment Issues:

We have had no issues with the deployment.

Stability Issues:

There have been no performance issues.

Scalability Issues:

It's been able to scale for our needs.

Other Advice:

I would advise that you create a lab with Nagios Core and test what you really need. Although it's exciting to use all the products, only a few are really important in your IT structure. When you are confident with scripting and MIBS integration, you can consider expanding it to your Enterprise systems with Nagios XI and some other modules. I would discourage you from using the ticketing system to start with and choose something more dedicated.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
PeerSpot user
Senior Systems / DevOps Engineer with 1,001-5,000 employees
Vendor
It's easy to customize through scripts. The user interface needs to be improved.

Valuable Features:

In my experience with Nagios, I've found that the most valuable features are the scalability and extensibility through using scripts. It's easy to customize, and Nagios makes it easy to use languages you're already familiar with such as Bash/Python

Improvements to My Organization:

It's helped us improve as we now have the ability to customize the solution. By doing this through scripting, we are now able to monitor every layer of our stack from infrastructure to applications.

Room for Improvement:

I feel that the maturity and user interface needs to be improved. I think this is handled through its integration with OpsView.

Deployment Issues:

We have had no issues with the deployment.

Stability Issues:

There have been no performance issues.

Scalability Issues:

It's been able to scale for our needs.

Initial Setup:

If you plan ahead of time and thoroughly test the final solution you want to implement, it should be straightforward.

Other Advice:

Plan ahead and take your time through staging the installation and take your time testing your customized scripts before doing the production installation.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
PeerSpot user
Manager of Infrastructure at a university with 10,001+ employees
Vendor
It's helped us to improve our incident management, problem management, and capacity management.

What is most valuable?

It's ability to monitor a good range of services out of the box and it's extensibility with the use of plug-ins, with an active user community sharing community contributed plug-ins to cover monitoring for a very wide range of services and components.

How has it helped my organization?

It's helped us to improve our incident management, problem management, and capacity management.

What needs improvement?

The web interface is perfectly fit for purpose but isn't up there with the best modern admin panels. There is scope for improvement to make it more dynamic and responsive.

For how long have I used the solution?

I've been using it for three years.

What was my experience with deployment of the solution?

Nagios XI ships as a virtual appliance and is easily deployed.

What do I think about the stability of the solution?

There have been no performance issues.

What do I think about the scalability of the solution?

Clear instructions are provided on how to expand the disk capacity of the appliance and the solution can be scaled to allow several distributed Nagios monitoring instances to work together.

How are customer service and technical support?

Customer Service:

Customer service is excellent.

Technical Support:

I haven't needed any technical support.

Which solution did I use previously and why did I switch?

We previously used the free version of Nagios and decided that the commercially packaged version Nagios XI was well worth the cost and reduced the admin overhead significantly.

How was the initial setup?

Initial setup is straightforward and allows monitored hosts to be added one-by-one using a wizard or added in bulk using the bulk add tool.

What about the implementation team?

An in-house implementation was done. I would advise that you need to understand the overall basic architecture of Nagios before implementing it.

What's my experience with pricing, setup cost, and licensing?

In my view this product is very good value for money when compared with other commercially provided monitoring solutions.

What other advice do I have?

Implementation is best done by people with a good general Linux/open source skill set. If you have these kinds of people then implementing Nagios should be fairly straightforward. If you have a bunch of Window admins who've never touched a Linux system, then implementation could be a challenge.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user67689 - PeerSpot reviewer
CEO at a tech services company with 51-200 employees
Consultant
It does exactly what we need: Alert us when something is wrong

We use Nagios because it does exactly what we need: Alert us when something is wrong and (with PNP) show some trend to figure out when or maybe how the problem started. The graphical output is for sure not state of the art, but we are not artists but rather techs ;) As a hosting business we have quite a homogenous infrastructure and use mostly snmp for fetching data in our own scripts. Once the basic setup for Nagios is done it is quite easy to automatically script new hosts when you add new machines or delete the old ones. However your choice might be different if you have to monitor IT of various customers with totally different infrastructure.
And finally there are apps for android, iPhone, etc so with just one look at your phone you see if everything is well.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user4401 - PeerSpot reviewer
it_user4401Developer at a transportation company with 1,001-5,000 employees
Vendor

Nagios is a very useful tool for system administrators, which provide good monitoring for network and servers. It's free and from my point of view it's fast and efficient. Nagios is stable and has many plugins available for monitoring. The software is robust, but, as a con, is an absolute pain to configure it. There is a very intuitive dashboard page that the administrator can leave open on an extra monitor to view at any given time and it will even rotate between different pre-selected views if the administrator likes that.

Network Engineer at a tech services company with 11-50 employees
Real User
Provides for better network visibility and identification of connection problems
Pros and Cons
  • "The most valuable feature is its support for different types of devices, where it can use all of the equipment that you need."
  • "I would like to see more customization in the network map because it is a bit tricky to use it."

What is our primary use case?

We use this solution for the monitoring of network devices in multiple areas. I use this solution on a daily basis, and the first thing that I do when I go into the workplace each morning is to check on all of the equipment.

How has it helped my organization?

This solution has improved our ability to determine the source of a connection failure in our network. Prior to using Nagios XI, we did not know whether the drop in service was a result of our system, or whether it was the fault of our service provider. Now, we are able to monitor the service provider's router and make that determination.

What is most valuable?

The most valuable feature is its support for different types of devices, where it can use all of the equipment that you need. You can find and change your speed in the network topology, and if something goes down then it will be noticed immediately.

What needs improvement?

I would like to see more customization in the network map because it is a bit tricky to use it.

First, when you click on one of the devices on the map, it shows you information about the equipment. Sometimes, there is information that I do not want to show. I would instead like to choose what is shown.

Next, I would like to have the ability to show only a subset of equipment on the network map. When you view it, all of the equipment that you have is there, every time. Sometimes, I would like to choose just a group of devices to view.

For how long have I used the solution?

Four months.

What do I think about the stability of the solution?

This is a stable solution. On a scale of one to ten, I would rate the stability a nine.

What do I think about the scalability of the solution?

The scalability is perfect. I can add as much equipment as I like.

How are customer service and technical support?

I have not needed to contact technical support. Everything installed easily and it works fine.

Which solution did I use previously and why did I switch?

We did not use another solution prior to this one.

How was the initial setup?

The initial setup is very straightforward.

The deployment took approximately one hour. The difficult part is configuring the equipment, such as setting up the IP addresses. Because I have more than fifty devices, it took me some time. The task is simple, albeit repetitive.

What's my experience with pricing, setup cost, and licensing?

The licensing fees for this solution are approximately $3,000 USD per year.

Which other solutions did I evaluate?

When I was researching solutions I read about Nagios XI, and it was rated as one of the best solutions. I tried it and I liked it, which is why it was selected.

What other advice do I have?

I would rate this solution an eight out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Download our free Nagios XI Report and get advice and tips from experienced pros sharing their opinions.
Updated: April 2024
Buyer's Guide
Download our free Nagios XI Report and get advice and tips from experienced pros sharing their opinions.