AWX Source control failure, But randomly works

,

Hi All,
We have an AWX deployment with two web and task replicas. Several jobs have been scheduled with the “Update revision on job launch” option enabled. For these scheduled jobs, we observe that some source control updates are failing without any error messages. Upon investigation, we found that the logs are getting stuck midway, resulting in a failed status. Please refer to the attached screenshot for further details.


Additionally, we have observed that the corresponding playbook is also failing. The logs indicate a “Failure Explanation: Previous Task Failed”. Upon further inspection,

In The details in the playbook logs, it appears that the issue is related to a missing resource in the Execution Environment. Interestingly, the same task performs perfectly at the next scheduled time.
It started happening all of a suddenly without making any upgrades or modifications.

AWX version: 24.2.0

Please refer to the attached screenshots for more information.


I am also attaching the pod logs from the ocp for reference.


Anyone checked this post and any suggestions?

@Rijo_Thomas hello, are you able to reproduce this error just running the playbook with the same EE locally using Ansible Navigator. If you 're unfamiliar with this technology, please use Getting started with Execution Environments — Ansible Community Documentation

It would help to localize the issue. If it doesn’t happen when you run it with Navigator, i think it’s worth investigating it more on the AWX side and filing a bug report in its repo.

The issue appears to be related to the source control update. When the source control update fails, the problem arises. However, if the source control update is successful, the playbook runs successfully, and we receive the correct output. There are no apparent Execution Environment (EE) issues with these jobs, so it does not seem to be related to the EE.