Scalable Spark Clusters (RE: Matt, Do you want your weekends back?)

Good day,

I wonder if you can possibly be able to assist me I am looking for a way to automate the newest versions of Apache Spark in Ansible Tower 3 - the objective is to eventually replace cloudera to handle the devil ops.

The above refers, I am currently on a project / educational path of understanding the science behind Big Data and analytics. I am a well competent systems engineer in in public and private proprietary cloud environments, I would like to know your professional opinions on how to release engineer Apache Spark 2.0.0 and Hadoop 2.7.2 clusters into the public cloud possibly using using Apache Ignite.

I anticipate your response and assistance as I have a fellow student who is counting on me for a viable solution by the close of the Weekend.

Thank you in advance,

Matt :v:

TheSolutionIsX.com

Looks like there are several roles for installing Spark on Ansible Galaxy - probably a good starting point to see what others have done:

https://galaxy.ansible.com/list#/roles?page=1&page_size=10&autocomplete=spark