awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 failed: ‘Cannot connect to host 10.36.0.10:8052 ssl:False [Connect call failed (‘10.36.0.10’, 8052)]’
SUMMARY
After Kubernetes Cluster Upgrade to v1.23.6 pods are now unable to communicate with each other with below error in awx-web pod logs
2022-05-10 15:37:04,088 WARNING [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 failed: ‘Cannot connect to host 10.36.0.10:8052 ssl:False [Connect call failed (‘10.36.0.10’, 8052)]’.
2022-05-10 15:37:04,090 DEBUG [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 attempt number 2.
2022-05-10 15:37:12,102 WARNING [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 failed: ‘Cannot connect to host 10.36.0.10:8052 ssl:False [Connect call failed (‘10.36.0.10’, 8052)]’.
2022-05-10 15:37:12,104 DEBUG [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 attempt number 3.
2022-05-10 15:37:16,111 INFO success: nginx entered RUNNING state, process has stayed up for > than 30 seconds (startsecs)
2022-05-10 15:37:16,111 INFO success: nginx entered RUNNING state, process has stayed up for > than 30 seconds (startsecs)
2022-05-10 15:37:16,111 INFO success: uwsgi entered RUNNING state, process has stayed up for > than 30 seconds (startsecs)
2022-05-10 15:37:16,111 INFO success: uwsgi entered RUNNING state, process has stayed up for > than 30 seconds (startsecs)
2022-05-10 15:37:16,111 INFO success: daphne entered RUNNING state, process has stayed up for > than 30 seconds (startsecs)
2022-05-10 15:37:16,111 INFO success: daphne entered RUNNING state, process has stayed up for > than 30 seconds (startsecs)
2022-05-10 15:37:16,111 INFO success: wsbroadcast entered RUNNING state, process has stayed up for > than 30 seconds (startsecs)
2022-05-10 15:37:16,111 INFO success: wsbroadcast entered RUNNING state, process has stayed up for > than 30 seconds (startsecs)
2022-05-10 15:37:20,123 WARNING [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 failed: ‘Cannot connect to host 10.36.0.10:8052 ssl:False [Connect call failed (‘10.36.0.10’, 8052)]’.
2022-05-10 15:37:20,126 DEBUG [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 attempt number 4.
2022-05-10 15:37:28,142 WARNING [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 failed: ‘Cannot connect to host 10.36.0.10:8052 ssl:False [Connect call failed (‘10.36.0.10’, 8052)]’.
2022-05-10 15:37:28,143 DEBUG [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 attempt number 5.
2022-05-10 15:37:34,151 INFO success: awx-rsyslogd entered RUNNING state, process has stayed up for > than 30 seconds (startsecs)
2022-05-10 15:37:34,151 INFO success: awx-rsyslogd entered RUNNING state, process has stayed up for > than 30 seconds (startsecs)
2022-05-10 15:37:36,152 WARNING [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 failed: ‘Cannot connect to host 10.36.0.10:8052 ssl:False [Connect call failed (‘10.36.0.10’, 8052)]’.
2022-05-10 15:37:36,154 DEBUG [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 attempt number 6.
2022-05-10 15:37:44,169 WARNING [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 failed: ‘Cannot connect to host 10.36.0.10:8052 ssl:False [Connect call failed (‘10.36.0.10’, 8052)]’.
2022-05-10 15:37:44,170 DEBUG [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 attempt number 7.
2022-05-10 15:37:52,180 WARNING [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 failed: ‘Cannot connect to host 10.36.0.10:8052 ssl:False [Connect call failed (‘10.36.0.10’, 8052)]’.
2022-05-10 15:37:52,182 DEBUG [-] awx.main.wsbroadcast Connection from awx-767cb986bb-fv9tr to 10.36.0.10 attempt number 8.
ENVIRONMENT
AWX version: 0.21.0
Operator version: 0.21.0
Kubernetes version: 1.23.6
AWX install method: K8S via operator
STEPS TO REPRODUCE
remove deployment and redeploy on version k8s 1.23.6
EXPECTED RESULTS
pods communicate and awx comes online
ACTUAL RESULTS
nothing , service does not appear to come online
ADDITIONAL INFORMATION
seems related to SSL
AWX-OPERATOR LOGS
--------------------------- Ansible Task StdOut -------------------------------
TASK [Remove ownerReferences reference] ********************************
ok: [localhost] => (item=None) => {“censored”: “the output has been hidden due to the fact that ‘no_log: true’ was specified for this result”, “changed”: false}
{“level”:“info”,“ts”:1652197108.1342492,“logger”:“proxy”,“msg”:“Read object from cache”,“resource”:{“IsResourceRequest”:true,“Path”:“/api/v1/namespaces/awx/secrets/awx-broadcast-websocket”,“Verb”:“get”,“APIPrefix”:“api”,“APIGroup”:“”,“APIVersion”:“v1”,“Namespace”:“awx”,“Resource”:“secrets”,“Subresource”:“”,“Name”:“awx-broadcast-websocket”,“Parts”:[“secrets”,“awx-broadcast-websocket”]}}
--------------------------- Ansible Task StdOut -------------------------------
TASK [Remove ownerReferences reference] ********************************
ok: [localhost] => (item=None) => {“censored”: “the output has been hidden due to the fact that ‘no_log: true’ was specified for this result”, “changed”: false}
{“level”:“info”,“ts”:1652197108.564614,“logger”:“runner”,“msg”:“Ansible-runner exited successfully”,“job”:“471168312615460271”,“name”:“awx”,“namespace”:“awx”}
----- Ansible Task Status Event StdOut (awx.ansible.com/v1beta1, Kind=AWX, awx/awx) -----
PLAY RECAP *********************************************************************
localhost : ok=63 changed=0 unreachable=0 failed=0 skipped=46 rescued=0 ignored=0