[chef] Re: Re: chef-client (still) randomly failing


Chronological Thread 
  • From: Phil Cryer < >
  • To:
  • Subject: [chef] Re: Re: chef-client (still) randomly failing
  • Date: Tue, 19 Nov 2013 13:54:55 -0600

Can do Daniel, can you tell me how I build the gem manually? I've checked out from https://github.com/opscode-cookbooks/chef-client and https://github.com/opscode-cookbooks/chef - have run Bundle install successfully, but don't see any info about how to build the gem. And, can I just build the chef-client 11 gem, and put it on an existing node that's still talking to chef-server 10.x? Thanks


On Tue, Nov 19, 2013 at 12:20 PM, Daniel DeLeo < " target="_blank"> > wrote:

On Tuesday, November 19, 2013 at 10:13 AM, Phil Cryer wrote:

I'm still failing to get chef-client to run on it's own without failing due to timeouts. I have 4 CentOS 6.4 nodes in QA, overnight usually 2 will fail and show as 'red' the next day - it runs hourly, so they'll be down for 10+ hours. I login to the node, stop chef-client, run it manually, and it works. 

1) Why is it timing out on NTP, and even failing as it retries 4 times in a row? 
2) Why doesn't it fix itself on the next run? It seems to fail, and then never recover.

Full logs here:
https://gist.github.com/philcryer/7549743

Can you check lsof and ps to see if there is a child process of chef that does not get killed and keeps the yum lock?

You may be hitting this issue: https://tickets.opscode.com/browse/MIXLIB-16

That’s been fixed in master and should get released with the next Chef 11.x release. You could try building the gem manually and installing it on one of your systems to see if it fixes the problem which would be greatly appreciated.

-- 
Daniel DeLeo




--



Archive powered by MHonArc 2.6.16.

§