User Story
As a data engineer, I need to build an Airflow job to transform data from event data* to Iceberg tables
* Event Data here means events in the topics mediawiki.page_content_change and mediawiki.revision_visibility_change (will do revision_visibility_change separately)
Done is:
- Job is running on hourly schedule on Airflow
- mediawiki.page-content-change might not be in a state where all of the data is available (
we should limit the scope of this to 1 smaller wiki to make testing easier for now?We are consuming all wikis.)