- Depending on: https://phabricator.wikimedia.org/T171258
- Following the puppetization of the WDCM: run the whole system on regular monthly updates
- Add a timestamp info somewhere in the WDCM dashboards
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | GoranSMilovanovic | T210147 Optimize WDCM update engines | |||
Resolved | GoranSMilovanovic | T179286 WDCM Regular Updates |
Status:
With the new procedures and following its scaling to Apache Spark we could run WDCM on daily basis with no trouble at all, except that we cannot do that because the Sqoop transfer from MariaDB (client wbc_entity_usage tables) to HDFS takes hours to complete. I will see if there is anything that I can do to speed it up, but I think the chances are slim.