Here is the plan I suggest to globally move production to Spark 2:
- Update Spark2 version on the cluster (currently 2.1.1, move to 2.3) - T185581
- Puppetize oozie-sharelib update as explained here (https://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.6.0/bk_spark-component-guide/content/ch_oozie-spark-action.html#spark-config-oozie-spark2): https://gerrit.wikimedia.org/r/#/c/415465/
- Puppetize and install yarn shuffle jar install for Spark 2: https://gerrit.wikimedia.org/r/#/c/424593/
- Merge refinery-source spark2 patch (jobs tested) - https://gerrit.wikimedia.org/r/#/c/348207/
- Merge oozie jobs patch in refinery to use new jar and use new spark lib - (patch to come)
- Deploy and restart jobs
@Ottomata : Comments welcome!