User Story
As a data engineer, I want to begin consolidating our refine jobs into Airflow, so that I can get faster at deploying, maintaining and optimising our Data Pipelines
Success Criteria
- Deploy our POC jobs into our production instance of airflow, complete with monitoring and testing capabilities.
- Have all our Refine Scheduler jobs moved into our airflow instance.
- Deprecate Refine - ensuring that its only available through an airflow operator.
Open questions / remarks
- Is our POC the right Job to start with?
- Do we have all the required operators?
- Who needs to validate that the pipeline is working as intended?