Should old servers be retired

Here’s a previous question and answers: Do you continue to use your end-of-life server/network equipment? And another one: How often does your company replace all its servers? At 5 years, for what sounds like mission-critical functions, I’d start looking at replacement even if they’re working fine. But since they are working fine, I’d plan out … Read more

Alternative to etsy/statsd

Check out the following two projects: batsd batsd-dash batsd is a daemon for aggregating and storing statistics, while batsd-dash is a configurable dashboard the present the data you have stored in batsd. About batsd Batsd is a ruby-based daemon for aggregating and storing statistics. It targets “wireline” compatibility with Etsy’s StatsD implementation, which they described … Read more

How to monitor glusterfs volumes

This has been a request to the GlusterFS developers for a while now and there is nothing out-of-the-box solution you can use. However, with a few scripts it’s not impossible. Pretty much entire Gluster system is managed by a single gluster command and with a few options, you can write yourself health monitoring scripts. See … Read more

Disk Capacity Planning for Whisper / Graphite

whisper-info.py gives you a lot of insight into what and how each file is aggregated, including the file’s size. However it’s only useful for existing whisper files. When you want to see predictive sizing of a schema before putting it in place, try a Whisper Calculator, such as the one available at https://gist.github.com/jjmaestro/5774063 EDIT: When … Read more

Disable Monit alerts when PID changed

You can disable this alert by setting up a local alert statement. This is thoroughly documented here: http://mmonit.com/monit/documentation/monit.html#setting_a_local_alert_statement This should do it: check process blop with pidfile /…/blop.pid alert recipient@address.com but not on { pid } start program = “…” stop program = “…”

What’s difference between monitoring, tracing and profiling?

This is the way I use these words. Others may have additional or different usages. Depending on the job at hand, I will use the terms differently. Development teams and operations teams have different needs an usage. Monitoring is monitoring. Usually it is ongoing, and preferably automated. Open source tools like Munin, Nagios, and MRTG … Read more