Hey all,
I am facing a timeout issue while trying to run a job template. This is our current setup:
AWX Version - 22.5.0 (AWX is running on OKD and is deployed using AWX Operator)
OKD Version - 4.11.0-0.okd-2022-12-02-145640 (Update Channel: Stable-4)
OpenSSH version on bastion host:
openssh-server-7.4p1-23.el7_9.x86_64
openssh-7.4p1-23.el7_9.x86_64
openssh-clients-7.4p1-23.el7_9.x86_64
OpenSSH version on remote server:
openssh-8.7p1-30.el9_2.x86_64
openssh-clients-8.7p1-30.el9_2.x86_64
openssh-server-8.7p1-30.el9_2.x86_64
The traffic flow is as follows:
AWX on OKD → Bastion Host/Jumpbox → Remote Server
Problem Statement:
When I try to run a template, the first few tasks run successfully. But after running a few tasks, I see that the server becomes unreachable and I see “Timeout Before Authentication” in the SSH logs on the remote server. Here’s an example: