Hello guys:
I’m running AWX 15.0.1. Sometimes I run some jobs which get stuck without a clue what could be wrong.
For example, I have a job in Running State for more than 2 hours. Through the Web UI, the last log line is 1430 and this hasn’t changed in the last 2 hours. When I try to see logs from the container, I don’t find anything important like errors or warnings.
I logged in into the awx_task container and I see the ansible-playbook process still alive, however I cannot see any open SSH connection to any remote host. If ansible is not connected to any host, so what’s doing anyway?
I ran “awx-manage run_dispatcher --status” and only saw the JobID still running.
I have no idea how to troubleshoot this. I’m tempted to cancel and relaunch the job again, but if I do this I will never wrong the root cause of the problem.
Anyone of you could suggest any idea here?
I’d appreciate some help. Thanks in advance.