Parent task to organize all the things we want to do with DataHub.
SDS1.3 KR for FY24/25:
Achieve at least a 50% reduction in the average time required for data stakeholders to understand and trace data flows for 3 core and essential metrics
Parent task to organize all the things we want to do with DataHub.
SDS1.3 KR for FY24/25:
Achieve at least a 50% reduction in the average time required for data stakeholders to understand and trace data flows for 3 core and essential metrics
| Status | Subtype | Assigned | Task | ||
|---|---|---|---|---|---|
| Open | None | T369756 [EPIC] Datahub Improvements | |||
| Open | None | T366720 Public DataHub | |||
| Resolved | brouberol | T309622 Create Airflow Pipeline for Ingesting/Updating Superset data into DataHub | |||
| Resolved | BTullis | T316336 Upgrade DataHub to v0.8.43 | |||
| Open | None | T377789 Data Platform Data Lineage | |||
| Resolved | tchin | T306896 Integrate Spark with DataHub with lineage (Data-Engineering) | |||
| Resolved | tchin | T372899 Ingest a test hive database into datahub | |||
| Open | None | T378899 Upgrade to Spark 3.2 to support Spark lineage for Iceberg tables | |||
| Resolved | Ahoelzl | T369758 [SPIKE] Define process to build out lineage in DataHub | |||
| Duplicate | None | T386724 Integrate Spark with DataHub with lineage (non Data-Engineering Airflow instances) | |||
| Open | mforns | T386862 Enable Spark data lineage for all Airflow instances | |||
| Resolved | EBernhardson | T374118 Datahub - ingest Hive discovery database | |||
| Resolved | BTullis | T376657 Unable to find ingested tables in datahub |
Removing inactive task assignee who left WMF ages ago.
Please do {re/un}assign tasks as part of offboarding.