We need a way to utilize the produced dataset. What form it takes is TBD.
Update:
We receive the datasets in JSON dumps. Our current solution ingests the data into a PostgreSQL instance using scripts available here: https://github.com/schana/recommendation-missing-sections/tree/master/recommendation_missing_sections/data