Not forking parrallel tasks?

David_Brossard · November 7, 2013, 6:04pm

I have found when running ansible with either playbooks or direct commands that the forking is not working as I’d expect. If I add -f 20 or -f 3 or --forks #, it appears I still end up with only 1 remote command running at a time. I check this by looking at the processes running on the ansible server and I only see 1 remote ssh session at a time to my remote machines.
I am running an older version of ansible 1.2.2 because it is the latest in the Ubuntu LTS repo. Has anyone else seen this? Is there a problem with my syntax?

Thanks

sivel · November 7, 2013, 6:06pm

Forks are used for multiple servers, not multiple commands per server.

So if you are operating over 30 servers, you could set 30 forks and run a single command across all servers at once.

David_Brossard · November 7, 2013, 6:29pm

Correct. I am running a single command (such as apt-get update) against a dozen or so servers and want it to happen in parallel. Unfortunately, when using --forks 20 or -f 20 I still only get connections to 1 server at a time. They run consecutively and not concurrently.

Brian_Coca1 · November 7, 2013, 7:27pm

sounds like you have ‘serial: 1’ set in the playbook.

David_Brossard · November 7, 2013, 10:05pm

This happens whether I use ansible-playbook or ansible. I do not have serial:1 (or any serial setting) in my playbooks or my ansible.cfg

David_Brossard · November 7, 2013, 10:16pm

For example, something as simple as:

ansible MyServers -m shell -a "uptime " -f 20

Only returns the uptime for server at a time. I have also tried using a sleep statement for example and only a single server returns its results at a time after the sleep interval. They are not running in parallel for some reason.

Brian_Coca1 · November 7, 2013, 10:21pm

output to your screen is serialized (otherwise it would be intermixed) but
the actual remote actions should be in parallel.

David_Brossard · November 7, 2013, 10:30pm

The don’t appear to be however, that is my concern. For example I assume a command like:

ansible MyServers -m shell -a “sleep 20” -f 20

When run on a list of less than 20 servers should return output from all servers around 20 seconds after it is initiated. It is not. 1 server replies, 20 seconds later the next server replies, 20 seconds later the next server replies etc.

They are not running in parallel. What am I missing?

Brian_Coca1 · November 7, 2013, 10:36pm

just tested :

ansible all -m shell -a “sleep 10 && uptime” -f 10

and i get batches of 10 every 10 seconds (as expected)
tested with git checkout and version 1.3.3

David_Brossard · November 7, 2013, 10:46pm

That is the expected behavior I was also hoping to see. But I do not however. How can I trouble shoot this?

tannerjc · November 7, 2013, 10:47pm

What connection method are you using? What verification steps are you taking to count the number of simultaneous connections?

Brian_Coca1 · November 7, 2013, 10:48pm

start with -vvvv, also I’ll check 1.2.2 … there might be a locking issue that has been fixed in newer versions.

David_Brossard · November 7, 2013, 11:06pm

I have not manually change any connection method so I assume it should still be the default paramiko with shared SSH keys. Here is a test showing that it does not run in parallel. Notice that each response is 10 seconds after the previous one even though -f 10 is specified.

ansible Dev -m shell -a “sleep 10 && date” -f 10
pdx-cass-d02 | success | rc=0 >>
Thu Nov 7 22:54:39 UTC 2013

pdx-extws-d02 | success | rc=0 >>
Thu Nov 7 22:54:49 UTC 2013

pdx-intws-d01 | success | rc=0 >>
Thu Nov 7 22:55:00 UTC 2013

pdx-extws-d01 | success | rc=0 >>
Thu Nov 7 22:55:10 UTC 2013

pdx-fep-d01 | success | rc=0 >>
Thu Nov 7 22:55:20 UTC 2013

pdx-cass-d01 | success | rc=0 >>
Thu Nov 7 22:55:30 UTC 2013

pdx-lb-d01 | success | rc=0 >>
Thu Nov 7 22:55:40 UTC 2013

pdx-mq-d01 | success | rc=0 >>
Thu Nov 7 22:55:51 UTC 2013

pdx-sql-d01 | success | rc=0 >>
Thu Nov 7 22:57:35 UTC 2013

pdx-job-d01 | success | rc=0 >>
Thu Nov 7 22:56:11 UTC 2013

pdx-listen-d01 | success | rc=0 >>
Thu Nov 7 22:56:21 UTC 2013

pdx-sql-d02 | success | rc=0 >>
Thu Nov 7 22:58:05 UTC 2013

David_Brossard · November 7, 2013, 11:16pm

So using -vvvv it appears that ansible first connects to ALL servers to copy the tmp file for execution and the password files etc. It then connects individually to the first server to execute that command. Once a response is returned it then connects to the next server.
The verbose output is quite extensive. I’m not sure what else I should be looking for…

tannerjc · November 8, 2013, 12:10am

Are you able to reduce this down to a simple playbook or a module call and reproduce the behavior?

David_Brossard · November 8, 2013, 12:28am

That is what I was hoping to demonstrate with my simple module call of:

ansible Dev -m shell -a “sleep 10 && date” -f 10

Each result comes back 10 seconds after the previous result.

pdx-cass-d02 | success | rc=0 >>
Thu Nov 7 22:54:39 UTC 2013

pdx-extws-d02 | success | rc=0 >>
Thu Nov 7 22:54:49 UTC 2013

pdx-intws-d01 | success | rc=0 >>
Thu Nov 7 22:55:00 UTC 2013

pdx-extws-d01 | success | rc=0 >>
Thu Nov 7 22:55:10 UTC 2013

Brian_Coca1 · November 8, 2013, 5:02pm

so I tested with 1.2.2 and 1.2.3 and cannot reproduce the issue:

ansible all -m shell -a “sleep 10 && uptime” -f 10

it returns 10 hosts every 10s, as expected

David_Brossard · November 8, 2013, 10:16pm

I will upgrade to ansible 1.3.3 today and see if that doesn’t solve my issues.
Thanks

David_Brossard · November 25, 2013, 5:19pm

FYI- Upgrading to 1.3.3 did the trick. I’m not sure why I had the issue earlier that was unreproducible by others.
Thanks for your help everyone.

Silvio_Tomatis · August 6, 2015, 4:47pm

Posting on an old thread, since I just had this same issue with ansible 1.9.2.
In the end, I could solve it removing my persisted ssh connections like this
rm ~/.ansible/cp/*
I have no idea why exactly this solved the problem but in case you encounter the same issue you can try it.

Topic		Replies	Views
parallel execution of playbook at a time in multiple hosts Ansible Project	25	56	August 10, 2015
parallel execution Ansible Project	41	46	June 9, 2016
FORKS --- not working as expected and also discrepecies in playbook profile task prints Ansible Project	5	36	November 4, 2021
Ansible fork and serial relationship? Ansible Project	0	6	February 22, 2019
How can I troubleshoot the speed of ansible-playbook runs? Ansible Project	6	8	December 3, 2014

Not forking parrallel tasks?

Related topics