capturing logs from ansible/ansible-playbook runs

Seth_Vidal · July 4, 2013, 6:22am

I wanted something like sfromm's ansible-report system but for a
variety of reasons I was concerned about the single sqlite db.

So I started hacking up a solution for lots of simultaneous runs of
ansible and capturing logs of the whole thing.

this is a callback plugin:

http://infrastructure.fedoraproject.org/cgit/ansible.git/tree/callback_plugins/logdetail.py

and this is a viewer for those logs:
http://infrastructure.fedoraproject.org/cgit/ansible.git/tree/scripts/logview

If you put the callback_plugin in a path for callback plugins specified
in your /etc/ansible/ansible.cfg then it will also capture data on
tasks run via ansible - not just ansible-playbook.

This is an example of the output from running

logview -d yesterday -p mirrorlist

mirrorlist
20.50.54 host1 Jul 03 2013 20:54:04 54 CHANGED /etc/nagios/nrpe.cfg
20.50.54 host1 Jul 03 2013 20:55:29 75 CHANGED restart nrpe
20.50.54 host2 Jul 03 2013 20:54:06 54 CHANGED /etc/nagios/nrpe.cfg
20.50.54 host2 Jul 03 2013 20:55:31 75 CHANGED restart nrpe
20.50.54 host3 Jul 03 2013 20:54:05 54 CHANGED /etc/nagios/nrpe.cfg
20.50.54 host3 Jul 03 2013 20:55:29 75 CHANGED restart nrpe

I thought I'd post it here in case it is helpful to anyone.
-sv

jpmens1 · July 4, 2013, 8:35am

Seth,

So I started hacking up a solution for lots of simultaneous runs of
ansible and capturing logs of the whole thing.

that's lovely! I'd like to propose the following patch, which avoids a
traceback when the process doesn't have a controlling tty (happens here,
dunno why).

Regards,

-JP

(attachments)

logdetail.patch (1.65 KB)

Seth_Vidal · July 4, 2013, 5:43pm

You are absolutely correct. Thank you for that. I had not tested it
outside of a controlling TTY.

-sv

Seth_Vidal · July 4, 2013, 6:01pm

I found one bug - if I sudo -i to root - geteuid() doesn't return who I
_was_ - only 'root' which is unfortunate - I'll have to see if I can
get a best possible situation which uses getlogin() if possible

What's the exception if you don't have a controlling tty with
os.getlogin() - I can just catch it and use geteuid()

-sv

jpmens1 · July 4, 2013, 6:20pm

I found one bug - if I sudo -i to root - geteuid() doesn't return who I
_was_ - only 'root' which is unfortunate - I'll have to see if I can
get a best possible situation which uses getlogin() if possible

maybe
os.getenv('HOME', os.getenv('LOGNAME', 'unknown'))

(Sorry, I'm away from $cust at the moment: can't check on the exception
I got.)

-JP

Michael_DeHaan2 · July 4, 2013, 6:25pm

BTW, if folks are interested in some really nice views into logging, I’d recommend signing up for the AWX webinar here:

http://ansibleworks.enterthemeeting.com/m/DBVI6YDJ

This is coming out August 8th, hopefully about a week after the AWX release. You’ll be able to try it out free for a small number of nodes.

Michael_DeHaan2 · July 4, 2013, 6:26pm

Slight correction, webinar is August 8th – logging is out as part of the release

Michael_DeHaan2 · July 4, 2013, 6:30pm

Am I terrible with words today or what

“out” means “is part of the release”

Seth_Vidal · July 4, 2013, 7:27pm

I ended up doing

def getlogin():
    try:
        user = os.getlogin()
    except OSError, e:
        user = pwd.getpwuid(os.geteuid())[0]
    return user

which seems to step around it just fine

-sv

Stephen_Fromm · July 4, 2013, 8:11pm

I wanted something like sfromm's ansible-report system but for a
variety of reasons I was concerned about the single sqlite db.

One comment on the above. I plan to devote some time in the coming weeks
to work on the concurrency problem when using a sqlite database. One can
still choose to use another database that doesn't have the limitations of
sqlite -- whatever sqlalchemy supports should work with ansible-report. Of
course, that does introduce dependencies that sqlite doesn't have.

So I started hacking up a solution for lots of simultaneous runs of
ansible and capturing logs of the whole thing.

this is a callback plugin:

http://infrastructure.fedoraproject.org/cgit/ansible.git/tree/callback_plugins/logdetail.py

and this is a viewer for those logs:

http://infrastructure.fedoraproject.org/cgit/ansible.git/tree/scripts/logview

If you put the callback_plugin in a path for callback plugins specified
in your /etc/ansible/ansible.cfg then it will also capture data on
tasks run via ansible - not just ansible-playbook.

This is an example of the output from running

logview -d yesterday -p mirrorlist

mirrorlist
20.50.54 host1 Jul 03 2013 20:54:04 54 CHANGED /etc/nagios/nrpe.cfg
20.50.54 host1 Jul 03 2013 20:55:29 75 CHANGED restart nrpe
20.50.54 host2 Jul 03 2013 20:54:06 54 CHANGED /etc/nagios/nrpe.cfg
20.50.54 host2 Jul 03 2013 20:55:31 75 CHANGED restart nrpe
20.50.54 host3 Jul 03 2013 20:54:05 54 CHANGED /etc/nagios/nrpe.cfg
20.50.54 host3 Jul 03 2013 20:55:29 75 CHANGED restart nrpe

I thought I'd post it here in case it is helpful to anyone.

Looking at logdetail.py, how does it handle the case where you have a
playbook with the same name, but in different paths?

I like the minimal dependencies of this approach and that it is easy to
prune with something like logrotate.

jpmens1 · July 5, 2013, 5:24am

Seth,

What's the exception if you don't have a controlling tty with
os.getlogin() - I can just catch it and use geteuid()

The exception is
OSError: [Errno 2] No such file or directory

I remember now, that the message stumped me. But you've fixed it all,
so ignore.

-JP

Seth_Vidal · July 5, 2013, 2:34pm

One comment on the above. I plan to devote some time in the coming
weeks to work on the concurrency problem when using a sqlite
database. One can still choose to use another database that doesn't
have the limitations of sqlite -- whatever sqlalchemy supports should
work with ansible-report. Of course, that does introduce
dependencies that sqlite doesn't have.

cool!

> logview -d yesterday -p mirrorlist
>
> mirrorlist
> 20.50.54 host1 Jul 03 2013 20:54:04 54 CHANGED /etc/nagios/nrpe.cfg
> 20.50.54 host1 Jul 03 2013 20:55:29 75 CHANGED restart nrpe
> 20.50.54 host2 Jul 03 2013 20:54:06 54 CHANGED /etc/nagios/nrpe.cfg
> 20.50.54 host2 Jul 03 2013 20:55:31 75 CHANGED restart nrpe
> 20.50.54 host3 Jul 03 2013 20:54:05 54 CHANGED /etc/nagios/nrpe.cfg
> 20.50.54 host3 Jul 03 2013 20:55:29 75 CHANGED restart nrpe
>
>
> I thought I'd post it here in case it is helpful to anyone.
>

Looking at logdetail.py, how does it handle the case where you have a
playbook with the same name, but in different paths?

It doesn't. I realized that when I wrote it and just kinda said "meh".

If anyone would like to fix it I'm happy to accept it - but in our
usecase the likelihood of having such a situation is extremely low.

I like the minimal dependencies of this approach and that it is easy
to prune with something like logrotate.

I like that your sqlite db is a million times easier to search

-sv

Brian_Coca1 · July 6, 2013, 1:09am

idk, i find grep and awk much easier to use than sql when looking at events.

Topic		Replies	Views
create a dedicate log for every runned playbook Ansible Project	1	11	August 25, 2021
Ansible 2.1 not logging commands Ansible Project	0	1	June 15, 2016
Ansible Logs for Different Host/Client/Server Ansible Project	1	9	June 10, 2016
How do I record just what ansible is doing Ansible Project	3	7	March 7, 2017
AnsibleDB - collect facts and ansible logs Ansible Project	0	27	September 15, 2023

capturing logs from ansible/ansible-playbook runs

Related topics