Problem statement
Spark on K8 currently does not match the features we have through Spark through YARN. Specifically we might we run into issues with Resource Management, Network/Storage bottlenecks and security management.
Primary Task
Documenting our technical explanations, or discussions about some of the challenges we will run into when changing how we interface with Spark and clarify the tradeoffs we will have to contend with.
Research Areas:
- Network & Storage Constraints - Decoupling
- How Scheduling works
- Security Model