Nagios Technical Expert at a tech vendor with 51-200 employees
Real User
The High Availability Feature Keeps Things Very Stable
Pros and Cons
  • "The Script Module in Nagios is really easy to use and is really cost efficient."
  • "The reporting part should be made simpler. While we can obtain all the reports we need, we always have to create work-arounds to get them."

What is most valuable?

The High Availablity feature is the most important. I have not seen Nagios crash during the last three years. The Script Module in Nagios is really easy to use and is really cost efficient.

How has it helped my organization?

This solution provides 24/7 monitoring of the IT infrastructure and provides real-time monitoring for the devices, with alerts.

What needs improvement?

The reporting part should be made simpler. While we can obtain all the reports we need, we always have to create work-arounds to get them.

For how long have I used the solution?

I have used it for the last three years.

Buyer's Guide
Nagios XI
April 2024
Learn what your peers think about Nagios XI. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
768,924 professionals have used our research since 2012.

What do I think about the stability of the solution?

The only issue faced is that after 20K services Nagios starts acting weird, due to the amount of checking it does and the amount of data stored in the database.

What do I think about the scalability of the solution?

No, there is no issue with scalability.

How are customer service and support?

As I am working from Asia, and Nagios is based in America, we face the issue of time zones. But overall, trouble shooting and support from the Nagios technical team is superb.

Which solution did I use previously and why did I switch?

I have worked on various tools in past like Spiceworks, Idera, and ManageEngine. We switched to Nagios for lower cost and good performance.

How was the initial setup?

The setup is really easy and can be done by any non-technical person as well. Nagios also provides an OVF for ESXi and VMware, which has everything set up; makes life easy.

What's my experience with pricing, setup cost, and licensing?

The pricing is really cost efficient. The licensing is perpetual and can be renewed very easily.

Which other solutions did I evaluate?

We evaluated ManageEngine, Solar Winds, Idera and Check_MK.

What other advice do I have?

The over all performance of the product is superb. Just keep in mind during implementation to keep the hardware strong, according to the environment you are monitoring and based on the check Nagios is going to perform.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user607749 - PeerSpot reviewer
it_user607749Manager, Live Production at a computer software company with 1,001-5,000 employees
Real User

I agree!

it_user123504 - PeerSpot reviewer
Co-Founder / Executive Director / Chief Technical Officer at a tech company with 501-1,000 employees
Vendor
We chose it because it was a flexible tool that happens to be open source and has the capability to provide a holistic view of all the systems configured on the network.

What is most valuable?

  • Monitoring Engine gives an in-depth dashboard of all the systems configured for monitoring
  • Capacity Planning shows the capacity trends of all the systems which enables you to plan appropriately for your needs
  • It has a good reporting tool and alert system

How has it helped my organization?

It improved our in-house support services as it helped changed them from being reactive to being proactive as notifications were sent about any device that was about to reach the set threshold.

What needs improvement?

It would be nice if the initial setup did not require as much tweaking and configuration and could just work out of the box.

For how long have I used the solution?

I've been using it for over a year.

What was my experience with deployment of the solution?

Initially, we encountered problems during the setup as some errors were noted due to a few pre-requisites not being fulfilled and having some configuration files to edit, and plugins to add.

What do I think about the stability of the solution?

There have been no performance issues.

What do I think about the scalability of the solution?

It's been able to scale for our needs.

How are customer service and technical support?

The initial setup was a little bit straightforward but cumbersome as there was more work in tweaking to make it work and creating configuration files for each devices that needs to be monitored.

Which solution did I use previously and why did I switch?

Nagios XI was chosen because it was a flexible tool that happens to be open source and has the capability to provide a holistic view of all the systems configured on the network. It also comes with a good Web Interface/Dashboard to update the system as needed.

How was the initial setup?

Our implementation was carried out by an in-house team.

What other advice do I have?

Nagios XI is an outstanding network Monitoring tool, which can be tweaked to suit any environment and it is a good solution for business looking for remarkable features at zero cost.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Nagios XI
April 2024
Learn what your peers think about Nagios XI. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
768,924 professionals have used our research since 2012.
it_user137748 - PeerSpot reviewer
Software Test Engineer at a comms service provider with 501-1,000 employees
Vendor
The monitoring and notification when an issue arises are the most valuable features.

Valuable Features

The monitoring and notification when an issue arises.

Improvements to My Organization

It was essential we had to have live monitoring as we were trial-ing a real-time communications application on a live site.

Room for Improvement

While the UI is functional, I have always thought there was room for improvement here. It is at times assumptive for the not so technical, rather than informative.

Use of Solution

I've been using it for 12 to 18 months. Presently I work as a software test engineer and my employer has been using Nagios for approximately ten years now. I first used Nagios in January 2014. I was in a small startup company based in the TSSG called Kodacall, it was based around WebRTC. I implemented in order that we could monitor our network as we had some clients who very kindly were letting us test our product on their live site. I really found Nagios a perfect fit for us and a good product to work with. As the saying goes “It does exactly as it says on the tin”. I hope what I have given might be of some small help.

Deployment Issues

It was deployed as a stand alone tool and is also integrated with Docker.

Stability Issues

It was deployed as a stand alone tool and is also integrated with Docker.

Scalability Issues

It was deployed as a stand alone tool and is also integrated with Docker.

Customer Service and Technical Support

I didn’t require any service as the documentation is good and most questions are answered on any number of forums.

Initial Setup

Setup ran pretty smooth. It's not overly difficult but it does require patience and some experience would make it easier. Like everything it is easier the second time around.

Implementation Team

We implemented it in-house.

Other Solutions Considered

Although we did look at other tools, we felt Nagios was a good fit for us.

Other Advice

I would have no hesitation in recommending it.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
PeerSpot user
NOC Engineer at a financial services firm with 501-1,000 employees
Vendor
It provides basic and complex monitoring of our environment as well as application-level monitoring, but notes can only be managed through the database.

What is most valuable?

There are a lot of plugins available and you can also write your own.

How has it helped my organization?

The product allows comprehensive monitoring of your environment. From basic monitoring of your servers health, to more complex tasks like AD, database, and, to some extent, application-level monitoring. Another great feature is that the event handlers allow you to automate repetitive tasks in your environment.

What needs improvement?

There's no convenient way to manage notes as the only way is to manage them through the database.

For how long have I used the solution?

I've used it for three years.

What was my experience with deployment of the solution?

No issues encountered.

What do I think about the stability of the solution?

No issues encountered.

What do I think about the scalability of the solution?

No issues encountered.

How are customer service and technical support?

Customer Service:

I've worked on several occasions with Nagios customer service and technical support, always received very professional and prompt responses.

Technical Support:

I've worked on several occasions with Nagios customer service and technical support, always received very professional and prompt responses.

Which solution did I use previously and why did I switch?

No previous solution used.

How was the initial setup?

It's very straightforward and and it has good documentation.

What about the implementation team?

We implemented it in-house.

What's my experience with pricing, setup cost, and licensing?

Nagios Core is a free-ware which has almost the same functionality as NagiosXI. It's a bit less convenient to use, since it doesn't have UI for the configuration.

Which other solutions did I evaluate?

No other options were evaluated.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user423219 - PeerSpot reviewer
it_user423219Sales Team at a tech vendor with 11-50 employees
Real User

Hi Dima, You can add config notes in Nagios Core and comments. It's not very straight forward at the moment though. I've heard it's being revamped in coming versions though. One other thing you can do is link to a wiki if you have one internally. Thanks for sharing your experience :)

it_user68025 - PeerSpot reviewer
IT Administrator at a tech services company with 51-200 employees
Consultant
What is Nagios: An Introduction to enterprise level server monitoring

Any internet based service company like for example web hosting, DNS hosting, Email-hosting, Cloud architectures, and even CDN networks have server's ranging from several hundreds to thousands. There might be different roles that are played by different servers that are geographically isolated from each other. As a whole these geographically separated servers might be providing a combined service to the end customer. A particular issue or problem on any of the server should not affect the customer service, and must be found and fixed before the outage happens.

Let's take two examples which will explain the need for a 24 x 7 monitoring of these servers. Suppose that you get a call from your technical support team saying that several customers are complaining about their websites being inaccessible. Such complaints without any other details are very difficult to troubleshoot, if you do not have a 24 x 7 server monitoring in place. During crisis, you cant waste time by checking the basic below mentioned things.

  • Server Disk Space
  • Swap and memory utilization
  • Processes and its status
  • Load on the server
  • RAID array status
  • File system mount status
  • Web server status

Because its quite normal to miss some or the other, by manually looking for basic issues on the server. What if the issue that was causing the problem was simply due to a RAID drive failure, due to which one of the disks were inaccessible( which contains the document root for some websites hosted ).

Such problems can be monitored for and can be warned before a complete failure occurs. Another funny example would be to find that a customer facing service was not working as desired for hours, simply due to a lag in time from a Network Time Server.

It is not at all feasible for a system administrator to look each and every log, and service settings, and other configurations round the clock. There needs to have some automated tool to continuously keep on monitoring these required services and settings on the server, and inform the concerned people in case of an issue. A good server and infrastructure monitoring tool must have the following characteristics.

  • Must have a web interface which clearly outlines the issues that a particular host/server has.
  • Must inform different concerned people in case of an issue.
  • Must send pagers, mails, and text messages to the developers and system administrators concerned with a particular service failure.
  • The tool must have the capability to take actions such as restarting a service, based on the current status.

What is Nagios

Although there are many proprietary monitoring tools out there to select from depending upon the requirement, no proprietary tool can provide the peer review, source code modification, and version iterations that an open source tool provides.

Nagios is an open source network monitoring tool that provides all those capabilities we discussed above in one package. Nagios monitors the servers and network devices(in fact i must say any network device which is accessible with an IP address can be monitored using Nagios) and alerts you when a particular service that's being monitored goes wrong, and also will alert you when the service comes back to normal required state. Nagios is capable of doing the following things.

  • Monitoring of different services on a server, such as SMTP, HTTP, POP, IMAP, PROXY, and the list goes on. In fact you can make nagios to monitor anything on the server(You just need to make a custom script according to your requirement)
  • 24 x 7 monitoring of server resources like CPU, Memory, Swap, Load
  • A nice web interface which indicates the status of the services by three methods OK, Warning, Critical
  • Maintaining a different set of contact groups(which will contain email addresses of different concerned people), based on the service

In this tutorial, we will be having a look at the major components of Nagios, which helps nagios to complete its task of maintaining a good monitoring infrastructure.

Let's begin this tutorial by understanding how a nagios server checks the status of a remote service on a remote server, and accurately report the output to you. In the world of nagios you will too often hear a term called plugins, which are readily available binary or small script based program, that checks the status of your required service or program.

Nagios checks the status of a remote service or program in multiple ways. Let's understand them one by one.

(1) Directly monitor services through network

In this first method the nagios server will execute a plugin on the nagios server itself, which will basically try to connect to a network service on the target server. Lets understand this through the following diagram.

In the above shown diagram, we have tried to depict how nagios process execute an example check(which is also sometimes called plugin), on the nagios server itself, which will connect to the http port 80 on the target server, and will record the response time.

Nagios server will execute the check at regular interval(as configured), to check the availability of the service. In the above shown example, the plugin is placed inside the nagios server, and no changes are done at the client side. You cant monitor all properties of a client that counts, through this method. This method can be used only to monitor, services that are available publicly. The main reason behind this is that, you need to login inside the client server, in order to monitor stuff like memory usage, process status, cpu load, and other stuff.

Hence this kind of plugins are very limited in its capability, but you can surely achieve a considerable amount of good 24x7 monitoring using this method, for publicly available services like SMTP, HTTP, DNS, FTP, PORT availability check, Remote MySQL & MSSQL etc.

(2) Nagios monitoring through SSH and NRPE

As mentioned in the previous method, without getting a login to the remote machine, the level of monitoring you can achieve is very limited, and also you cannot monitor all the services using that method.

You can achieve a 24 x 7 monitoring of the things that cannot be monitored directly through network with the help of two different methods, they are as mentioned below.

  • Check the status of a remote service by executing a plugin, that will be placed on the remote client, by loging inside the client with the help of SSH.

Related: Working of SSH explained

  • NRPE (Nagios Remote Plugin Executor), is a daemon that's installed as a stand alone or an inetd daemon that waits for requests from the nagios server on port 5666, to execute commands that are defined in its configuration file.

Let's frst undersand monitoring a remote host using SSH method. In this method, a user is made on all the client machines, which allows ssh login from the nagios server with the help of a predifined ssh key and execute a requred plugin to monior a required service.

This method of executing remote plugins on remote client with the help of SSH is a secure way to monitor. As a normal user logs in the remote client, the nagios server will be able to run any command that the normal user will be able to run(when i say run, i mean execute).

the plugins that reside in the remote client are sometimes called as local plugins as they are local to the remote host. to run local plugins on remote host,nagios uses a ready made command called check_by_ssh(we will be discussing the complete command usage of this plugin in a dedicated post of its own).

of cource you will not be sitting and entering passwords each and every time the check is executed by the nagios daemon. Login and execution of the remote plugin on the remote server using ssh must be seamless and also must be password less login. For this, you need to set up public key authentication of the user, which will be loging inside the remote server for executing the plugins.

Now let's see the another method of executing remote plugins.

Another method that is commonly used to achieve the successful execution of a remote plugin is NRPE. NRPE stands for Nagios Remote Plugin Executor. NRPE is a package that will be installed on all the remote hosts, that needs to be monitored. Mostly NRPE is installed as Xinetd service on the remote host, and by default it listens on the tcp port 5666.

Suppose the nrpe daemon receives a query from the nagios server, to execute a command on the local server, nrpe daemon looks inside the nrpe configuration files, for a command with the same name what nagios asked to run. Unlike ssh method, nrpe cannot run any command that the nagios server asks to run. Commands first need to be defined inside the nrpe configuration file. And only those commands can be run from the nagios server. Deploying ssh based nagios checks are much easier compared to nrpe method, because in nrpe method, you need to first install nrpe package on all the client servers that requires to be monitored.

Above diagram depicts the nrpe method of executing remote checks on a remote client with nagios. Nagios server has a check_nrpe plugin (which is very similar to the plugin check_by_ssh used in ssh method), which connects to the remote client on the port 5666, and executes the command, which is given as an argument to check_nrpe plugin(the command given as argument to check_nrpe plugin on the nagios server must also be defined in nrpe configuration files on the client, where the command will be executed.)

Nrpe method of monitoring remote host, by executing plugins on the remote machine is limited to the commands defined inside the nrpe configuration files on the client. Which means the command which you require to run on the remote machine, must be predefined in the nrpe configuration files on the client.

But check_by_ssh can be used to run any command, with executable permission to the user used to login to the remote machine.

Let's go ahead and understand the remaining two methods that can be used to monitor a remote host in nagios monitoring.

(3) Monitoring remote host with the help of SNMP in nagios

SNMP can be used to fetch the current value of different properties of a network device or any SNMP aware device. if you have SNMP daemon installed on your remote host, which needs to be monitored, then you can monitor hard drive, load, etc with the help of SNMP daemon.

Advantage behind using SNMP to monitor is because it is supported by a wide variety of devices like network switches, routers, UPS devices etc.

We will be doing a couple of posts on SNMP, for getting a better overview of the protocol and its usage. We will also be doing a dedicated post for monitoring devices with nagios and SNMP.

Above case of monitoring with snmp places the plugin inside the nagios server itself, which will be a generic snmp plugin that will be used to monitor all snmap related services, with different arguments given to it.

(4) Nagios Passive monitoring or NSCA (Nagios Service Check Acceptor)

Until now we have seen around 4 different methods, used to monitor a remote server using nagios. All of them worked by either a plugin placed on the nagios server or a plugin placed on the client, or by simple monitoring or publicly available service. In all the above mentioned method, the plugin execution or say command execution was initiated by the nagios server.

Let's now see a method, in which the client will execute a required plugin at a regular interval, and report the output of the execution to the nagios server. This is achieved with the help of a daemon called NSCA.

NSCA stands for Nagios Service Check Acceptor. This is installed as a daemon on the nagios server itself, and it will wait for the command result from the client.

This kind of nagios monitoring is called as passive monitoring, because nagios server is not the one that initates the checks on the client, but the client will execute the plugins specified, at regular interval with the help of a cron and report the output to the nsca daemon on the nagios server.

While reporting the output, the client will also send details like the service name, hostname, the output of the command executed to the nsca daemon, so that the nagios server can report the output exactly in the same way active checks are executed(active checks are those checks in which the command execution is initiated by the nagios server. Examples are check by ssh, nrpe etc.)

There are couple of things that needs to be understood, from the above shown diagram. NSCA is a daemon on the nagios server that waits for the command result from the client.

Send_nsca is a program that can be used to send a command result to the nagios server. The hostname, the service name, and other related details will be included in the command result send using send_nsca to the nagios server.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Lead Solution Consultant at a tech services company with 51-200 employees
Real User
Top 5Leaderboard
Scalable product with an easy setup process
Pros and Cons
  • "It is an open-source platform with valuable features for performance and stability."
  • "The product's stability could be even better."

What is our primary use case?

We use Nagios XI for server monitoring.

What is most valuable?

It is an open-source platform with valuable features for performance and stability.

What needs improvement?

The product's stability could be even better.

For how long have I used the solution?

We have been using Nagios XI for five years.

What do I think about the stability of the solution?

I rate the product's stability a nine out of ten. There is room for improvement.

What do I think about the scalability of the solution?

I rate the product's scalability a nine out of ten.

How was the initial setup?

The initial setup is easy. It requires a team of ten engineers to execute the process.

What about the implementation team?

Our employees implement the product with the help of third-party vendors.

What other advice do I have?

I rate Nagios XI a nine out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Flag as inappropriate
PeerSpot user
PeerSpot user
System Administrator at a hospitality company
Vendor
Unlimited Insight Into Multiple Infrastructures And You Can Customize It With Basic Scripting Skills
Pros and Cons
  • "You want to monitor a specific metric that nobody else has? You can do it even with the most basic of scripting skills, and you can always share it with the vast community of Nagios Exchange."
  • "The PNP4Nagios plugin not working easily with XI is an issue for me, because some open source monitoring plugins do not work out of the box. But in the end, you learn to live with it."

What is most valuable?

The main characteristic I adore is the open source character of it. You want to monitor a specific metric that nobody else has? You can do it even with the most basic of scripting skills, and you can always share it with the vast community of Nagios Exchange.

How has it helped my organization?

Like any monitoring tool, it gave me insight into multiple infrastructures I've been a part of, without any limitation (due to the open-source character that I referred to above).

What needs improvement?

It's more what I personally don't like, rather than what areas need improvement. For example, the PNP4Nagios plugin not working easily with XI is an issue for me, because some open source monitoring plugins do not work out of the box. But in the end, you learn to live with it.

For how long have I used the solution?

I've been using Nagios since about 2005. I've seen the development path through the open-source version (and some other forks of it like Icinga and OMD) but for the last four years I've been entirely using the XI branch.

What do I think about the stability of the solution?

On older versions I had some minor issues. Currently, to be honest, it is as stable as I could hope for a monitoring tool.

What do I think about the scalability of the solution?

Up to now, the infrastructures I've been a part of were not so large, up to 200 hosts and 1300 services. Even for XI which uses MySQL on the back end, a host with 8GB RAM and four vCPUs is adequate.

How are customer service and technical support?

With the open source forks, the community is vast and so is the knowledge around the product. Because of this, even though I have a valid commercial support bundle, I have never had the need to use it.

Which solution did I use previously and why did I switch?

No, I started with Nagios. I've used other apps also like Microsoft SCOM (which is not very good), Zabbix (which is very decent), Tivoli (which is also not very good), HP OpenView (which is vast and requires almost a duplicate infrastructure to run to its full extent), Icinga (a very good clone), Centreon (haven't used it much but it seems solid enough), but I've always ended up using Nagios.

How was the initial setup?

For the latest versions, for me, it is pretty straightforward.

What's my experience with pricing, setup cost, and licensing?

For the cost of the commercial product and support, and taking into account the open source characteristics of it, I believe it is difficult to a better value. Yes, it needs some time to configure and address its issues, but seriously, which monitoring solution does not?

Which other solutions did I evaluate?

Before going to Nagios XI (commercial, meaning with support), because of the relationship my company had with Microsoft, I evaluated also SCOM. As with Nagios, I went through the whole installation and configuration process. Because of my previous knowledge, I directly compared it with Nagios, and the latter won, hands down.

What other advice do I have?

Be prepared to put some time into it and research it appropriately. If there is an option for consulting services through the support channel, don't be afraid to use it.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user607749 - PeerSpot reviewer
it_user607749Manager, Live Production at a computer software company with 1,001-5,000 employees
Real User

I agree!

PeerSpot user
Systems Engineer at a tech services company with 501-1,000 employees
Consultant
It has somewhat helped improved the workflow and technical processes with regards to response times. A better multi-tenant environment would be ideal.

What is most valuable?

It has a wide variety of plugins in existence, but when there's no plugin available it is quite simple to integrate one's own programs and scripts.

How has it helped my organization?

It has somewhat helped improved the workflow and technical processes with regards to response times, and identify the frequent point of failures so as to architect alternative solutions. Plus all the stuff about SLA is an area that for me that is hard to quantify, but there's a team of accountant types who are dedicated to pulling numbers out of their hats.

What needs improvement?

It would be nice to have a better, or alternative, dashboard, à la Thruk, to see business process groupings, rather than just host and services. Tis might give a better visual representation of how the company is performing, and the availability of mission critical services etc.

A better multi-tenant environment would be ideal where certain users have limited visibility instead of just limited functionality.

For how long have I used the solution?

I've been using it for about six or seven years. We currently have v3.4.1 in production and are currently developing and testing v4.1.1 prior to implementation. Currently, we use various plug-ins including NRPE, NSCA, NagiosQL 3.2.0, PNP4Nagios 0.6.24, NSclient++ 0.3.9, and nagios-plugins 2.1.1.

What was my experience with deployment of the solution?

We have had no issues with the deployment.

What do I think about the stability of the solution?

We're still experiencing stability issues with regards to collecting performance graphs.

What do I think about the scalability of the solution?

It's been able to scale for our needs.

Which solution did I use previously and why did I switch?

We migrated away from being Windows centric. Their prodcut, although quite powerful, it was Windows centric and as our fleet of Linux servers increased an alternative was needed to suport multiple operating systems.

How was the initial setup?

The installation and configuration was straightforward and not complex, but rather time consuming as there was no easy method to centrally deploy plugins or agents to hundreds of remote clients. The only complexity that we encountered was in the network configurations for our firewalls and routing.

What about the implementation team?

All work is done in-house, and over the years it seems easier to Google for a solution to a given problem, but often enough a problem will crop up where no-one has yet found a solution. You need to learn to take notes and document everything that you. This may seem time consuming, but it will certainly save a lot more time and work in the future.

What other advice do I have?

It gets the job done, but there's a lot of room for improvement. Make sure that you clearly identify which are the mission critical services and which aren't so as to avoid cluttering the dashboard and overwhelming IT staff with too much information.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Download our free Nagios XI Report and get advice and tips from experienced pros sharing their opinions.
Updated: April 2024
Buyer's Guide
Download our free Nagios XI Report and get advice and tips from experienced pros sharing their opinions.