Page MenuHomePhabricator

Dataset Config Store
Open, Needs TriagePublic

Description

This Epic covers the work to implement a dataset config store so that the users of the data platform can store technical configuration in one place. Sub tickets will cover investigating existing solutions, similar work from other teams and the work required to implement or adopt a solution.

This is the hypothesis from annual planning.

SDS 2.6.3 If we decide on a technical solution for a dataset configuration store, implement it for one existing data pipeline then we can use the configuration tool to automatically generate simple data pipelines to support and increase the speed to production of new metrics

Technical Decision Record: https://docs.google.com/document/d/1tuoRviz3kNgUNOnSjtP5Pr6ikAiZWOdWxUytrDd1ZKs/edit#heading=h.3k1uzt7e33l4

High level deck: https://docs.google.com/presentation/d/15gGj41kCC7DiDKMtvPWaRp-jGoCO4ifuw9AhgqdBEzM/edit#slide=id.g15105b408d_0_287

Event Timeline

Worth investigating? https://datacontract.com/

Have we looked around to see if there are existing 'dataset' config formats/specs we can already use?

Have we looked around to see if there are existing 'dataset' config formats/specs we can already use?

I have not investigated this road at all.
https://datacontract.com/ is cool, I think we need some time to investigate the cost of integrating it to our need before deciding to use that (or something else).