Get_url with wildcards

roelvandepaar · April 5, 2024, 2:16pm

Hello,

I have problems downloading a file from a https repository.
What is the best way to download a file where I don’t know the full file name?

Using wildcards * or regex ([.*]) in the url of the get_url module does not work.
I can get the entire html of the url with the uri module, but then I would need a way to parse all that html to get a list of files.
And then I would need to build a logic that selects the correct file.

The repository does not have an API. It is just a httpd repository.

How do you approach such a situation?

This is what I tried:

    - name: get_url module
      ansible.builtin.get_url:
        url: "https://my-domain.tld/mydir/*.md5"
        dest: .
        validate_certs: false

Regards
Roel

Dustin · April 5, 2024, 3:34pm

Unfortunately, the get_url module is not going to work this way. You will have to do the discovery of the full URLs yourself first, then loop over the discovered URLs.

flowerysong · April 5, 2024, 6:59pm

That’s simply not how HTTP is designed to work, regardless of what method you’re using for retrieval. You have to know the full location of the resource you’re requesting.

roelvandepaar · April 8, 2024, 7:29am

I guess you are correct.

I used a workaround and parsed the html of the repository with curl and bash commands to get the file names. Maybe if the repository had an API there would be a more straightforward way.

jpmens · April 8, 2024, 8:02am

as long as you’re aware of how incredibly brittle and likely to break, that can be … if the repo software changes the way it presents their HTML, you might be up all night trying to figure out why your playbooks broke.

system · May 8, 2024, 8:03am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Use get_url to download all files in a location Ansible Project	3	60	January 15, 2015
Download all files from a url Ansible Project	3	50	March 27, 2017
Downloading files with a specific regular expression from remote https host to target server local path. Ansible Project	6	5	March 27, 2020
Using get_url or uri - substitute for curl? AWX Project	1	187	May 29, 2020
get_url ... donlwoading html, nt raw content Ansible Project	1	10	June 5, 2023

Get_url with wildcards

Related topics