distributing files via several repos to balance load

Walid · September 20, 2013, 1:54pm

Dear All,

I was going through Will Thames Ansible examples to Brisbane devops day and I notice the configuration file copy, or file transfer pattern below:


- name: download java
  action: get_url url={{repo_url}}/{{java_archive}} dest={{tmpdir}}/{{java_archive}}

where repo_url is defined as a group variable

*repo_url*: '[http://repo.dev.example:8000](http://repo.dev.example:8000)'

My Question : how can the repo_url point to several URLS instead of one so that if there are several nodes executing it, and Ansible load balance the requests and know which URL to send back to which client ?

kind regards

Walid

James_Cammarata1 · September 20, 2013, 2:24pm

You could set the URL in a host or group variable, so that it would be more evenly distributed. Beyond that, you could setup a reverse proxy like squid for your systems to fetch through so as soon as one system fetched the file you’d have a locally cached copy.

Walid · September 20, 2013, 2:42pm

Hi James,

the URL just as one example what if it was through a copy/fetch module not an http server but a file server, also could you elaborate on how one can extend the use of group variables to load balance source in automated way.

TIA

Walid

James_Cammarata1 · September 20, 2013, 2:59pm

Yes, it would require splitting up your tasks and having your hosts in a second child group. For example, given an inventory file like this:

host1
host2
host3

[target1]
host1

[target2]
host2

[target3]
host3

And group_vars/target[1…3]:

target_url: http://… # different url for each group var

You could have a playbook that does the download like this:

get_url: url={{target_url}}/{{java_archive}} dest={{tmpdir}}/{{java_archive}}

Walid · September 20, 2013, 3:22pm

Thanks James, Will take this into consideration when scaling Ansible.

Brian_Coca1 · September 20, 2013, 6:17pm

or you can just point to a load balancer and have it take care of it based on load

Walid · September 20, 2013, 8:04pm

Dear Brian,

I was looking more of a less intrusive internal Ansible solution without any considerations to the underlying physical infrastructure, however i do hear you and James, load balancer , caching, and probably DNS round robin are possible solutions. if it is not available internally by Ansible in a more automated way or data directive may be it will make it as a feature request later. Another configuration management solution does have it to answer problems of scale. the way it is done via 2-4 directives, something like select_attribute directive that selects from a list and can apply automatically the division and allocation of source to destination uses an automatically created random weights based on a probability distribution if that makes any sense.

kind regards

Walid

Brian_Coca1 · September 20, 2013, 8:34pm

One of the things I like about ansible is that it doesn't try to do
everything itself, relies on existing well known and used solutions (ssh,
sudo, cron, etc).

That is why some of us will push outside solutions (tcp load blancer, dns,
etc) vs seeing it built into ansible. But it is perfectly understandable
that other people have different preferences and would like to see more
stuff 'built in'. It is very hard, if not impossible, to please everyone.

Michael_DeHaan2 · September 21, 2013, 1:20pm

Very much agree with Brian here, this is a great case for using a load balancer.

Michael_DeHaan2 · September 21, 2013, 2:07pm

BTW, if not shared already, “with_random_choice” can be useful if you want something basic. I temporarily forgot about this

debug: msg={{ item }}
with_random_choice:
boston
paris
tokyo

Walid · September 21, 2013, 3:32pm

Thanks Michael, and Brian, eventually we need to keep an open mind and adapt to tools and devop processes accordingly

Topic		Replies	Views
Ansible copy multiple files to multiple destinations ?? Ansible Project	10	24	January 17, 2018
Push multiple configuration files to multiple servers Ansible Project	4	6	November 10, 2013
idea: allow group_vars/ and host_vars/ files to be organised in sub folders Ansible Project	12	13	September 14, 2016
Behavior of get_url if file exists in destination? Ansible Project	9	22	August 29, 2014
Passing variables into files for syncing Ansible Project	1	1	December 17, 2015

distributing files via several repos to balance load

Related topics