Geographically distributed, fault-tolerant and “intelligent” application/host monitoring systems

not an answer really, but some pointers: definitivly take a look at presentation about nagios @ goldman sachs. they faced problems you mention – redundancy, scalability: thousands of hosts, also automated configuration generation. i had redundant nagios setup but at much smaller scale – 80 servers, ~1k services in total. one dedicated master server, one … Read more

Send all outgoing mail to /dev/null

I did this on my development box by disabling sendmail completely and then having a simple perl script listen on the SMTP port and dump the emails into a directory. I’m sure it’s possible to do with the sendmail configuration, but the perl script was much easier. Here’s it stripped down to the essentials: #!/usr/bin/perl … Read more

How to check HTTP response code of a secured website with Nagios?

I have tried using check_http but I get success even if the website is redirected to an error page This can be solved with check_http –expect. Here is the documentation from check_http –help: -e, –expect=STRING Comma-delimited list of strings, at least one of them is expected in the first (status) line of the server response … Read more

Generic Nagios plugin to check if a specific process/service is running?

I use the standard NAGIOS check_procs plugin, with the -C flag, shown here being invoked from nrpe.cfg via NRPE: command[check_spamd]=/usr/lib/nagios/plugins/check_procs -c 1: -w 3: -C spamd which will WARN if it doesn’t find at least three processes with the executable name (not counting path) spamd, and which will CRIT if it doesn’t find at least … Read more

Key differences between Nagios and Open NMS?

Wikipedia has a comparative table which helps a lot I think the key differences are: language: Nagios is written in C and OpenNMS in Java. It makes nagios a lot faster on older hardware. Data collection: Nagios performs very little data collection. Other software (like cacti for example) will be required for a more extensive … Read more

Nagios – new user doesn’t get enabled permissions

Check out the Nagios FAQ at No permission to view configuration, which says: Description: If you click on “View Config” and you get: “””It appears as though you do not have permission to view the configuration information you requested… If you believe this is an error, check the HTTP server authentication requirements for accessing this … Read more