Hi,
I’m having an issue with jobs hanging. These jobs are patching windows servers.
There are 3 templates that run concurrently these, each have an inventory of about 20 machines, the Forks are configured to 10.
There are no errors output to the jobs output screen, the job just stops progressing. The job can be cancelled and relaunched, is sometimes the jobs hang again and often they run fine.
When the job hangs it just sits there displaying the last task completed, and AWX says that the task is Running and it just sits there until it’s cancelled.
As I can’t run the patching during business hours, so I have been using the same inventories and using a health check playbook , that does a few things like check the diskspace, and check for outstanding patches, and running these concurrently as they would happen on patching day and I can’t get the jobs to hang. There were no disk spaces issues. CPU spikes to 100% on all 4 cores from time to time but for the most part less than 70%. Swap isn’t touched and memory doesn’t go more than 5GB of 15GB.
No sure where to look to diagnose this further.
Any thoughts ?
Thank you
Greg