We use nagios as our alerting/monitoring solution and i have written an nsca based chef report handler that submits the chef client run status via send_nsca command to a nsca server (which in turn submits a passive check on the nagios.cmd pipe). It assumes nsca-client is installed on the chef client nodes and nsca daemon is running (and configured ) on the nagios box.
You can get the script here: https://github.com/ranjibd/nsca_handler
You can also use nrpe based active checks, but i guess that will introduce some delay, but also helps you detecting in case the client crashes (which im experiencing with ruby 1.9.2 )
ranjibOn Thu, Jun 30, 2011 at 4:54 AM, Robert Keng < " target="_blank"> > wrote:
Hi all,Just wondering if anyone knows if there is a way to monitor chef-client to make sure it isn't stuck in retries because of some error in some recipe? Does the client actually reply back to the chef-server if its able to apply all recipes and things are happy?
I apologize if this is a really simple question, fairly new to chef...Just I'd rather not have to parse /var/log/chef/client.log for this, thanks!
Archive powered by MHonArc 2.6.16.