Spike Goal
Determine what the user experience is when integrating DataHub with Spark
Key Questions:
- What do we get when we integrate. spark in such a way that Spark? Is this something we want to support?
- Evaluate the creation of
- Pipelines
- Tasks
- Lineage between source and destination datasets
- Can this play a part in the broader Data-Platform strategy.
https://datahubproject.io/docs/metadata-integration/java/spark-lineage/