Spark 2+ is a real improvelent over 1.6, it'd be great if we could have it available, and gently move our jobs to the new APIs.
Loose end TODOs:
- remove spark2-beeline
- spark-sql logging is too verbose with provided log4j.properties
- Make spark2 use hadoop native libs
- Make a spark2 assembly jar and put hdfs
- Wikitech documentation
- email announcement