Is there any way, in Ansible 1.9.6, of having Ansible retry a failed connection attempt x times at y intervals? We are seeing flaky network behavior when deploying on AWS, with apparently random ssh failures causing runs to break.
It’s never in the same place twice so it would be nice if we could have Ansible back off and retry before bailing.
If not in 1.9.6 (which we’re stuck on for another few months … .OpenShift reasons), how about 2.x?
That said, on big playbooks with a couple hundred hosts/tasks I generally still have at least one or two generic connection failures. Like you it happens on different tasks/hosts every time for me.