Like many, I’m having problems figuring out how to handle situations where one endpoint out of several fails when running a playbook.
One obvious approach is to create logic in my calling scripts (mostly bash) which will call a playbook until one of 2 things happen:
1 - No ‘<playbook_name>.retry’ files are found. That is, everything is successful
or
2 - Some number of attempts to run the playbook retries is exceeded.
Are there better options? And, would 2.0 negate the need to do this with the changes to ssh retries?
Thanks,
-T