Two options to implement this:
For snapshots, we could pull from sitematrix like we're doing now, maybe more frequently
For event-based, we could get notified by this maintenance bot: T292419: Post-creation work for pwnwiki
Two options to implement this:
For snapshots, we could pull from sitematrix like we're doing now, maybe more frequently
For event-based, we could get notified by this maintenance bot: T292419: Post-creation work for pwnwiki
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | • fdans | T187414 Wikistats 2.0: "aa.wikipedia.org" exists and has data available, but marked "Invalid" | |||
Open | None | T190700 Automate creation of sqoop list of wikis to import data for from sitematrix |
Tasked this briefly, want to coordinate with T239589 and make T239136 not necessary. To that end,
I think this should be much higher priority now. Usually we would move tasks back to incoming to re-prioritize, but not sure how to do that in the new process. cc @EChetty
This would only work for the snapshots, but a simple solution would be to just pull the sqoop list from canonical_data.wikis. There's no automated process keeping that up to date, but: