Page MenuHomePhabricator

Resume data pipeline operations
Closed, ResolvedPublic

Event Timeline

Important

At the time of writing this comment, wmf.wikidata_item_page_link and wmf.wikidata_entity latest available snapshots are 2025-01-20.
I've resumed all data pipelines with the following inputs:

  • wmf.wikidata_item_page_link and wmf.wikidata_entity = 2025-01-20
  • discovery.cirrus_index_without_content = 20250119
  • ALIS' monthly inputs = 2025-01
  • wmf.mediawiki_wikitext_current = 2024-11

T385865#10535846 yielded zero SLIS.
In the meanwhile, wmf.mediawiki_wikitext_current's 2025-01 snapshot became available, so re-running pipelines with it.

T385865#10538331 yielded zero section topics, which should also entail zero SLIS. The intuition is that mismatched input snapshots seem to disrupt SLIS.
SLIS is still at 2024-12-23, so I reset its Cassandra TTL.
ALIS is at 2025-01-20.

As of now, wmf.wikidata_item_page_link and wmf.wikidata_entity latest available snapshots are still 2025-01-20. I don't think it makes sense to resume normal operations if weekly inputs are missing.
I have paused all pipelines again.

AUgolnikova-WMF triaged this task as Unbreak Now! priority.Mar 19 2025, 4:39 PM
AUgolnikova-WMF updated the task description. (Show Details)

Update

  • weekly dependencies are back
  • ALIS has resumed normal operations - the latest snapshot is 2025-03-24
  • wmf.mediawiki_wikitext_current/snapshot=2025-03 isn't available yet
  • SLIS is also at 2025-03-24, but normal operations are expected to be on hold again, as the next scheduled run will wait for the above dependency

CC @KStoller-WMF @HNordeenWMF @Michael .

Update

SLIS is still blocked at 2025-03-24. I reset Cassandra's TTL.

CC @KStoller-WMF @HNordeenWMF @Michael .

Hi @mfossati please also tag @Seddon for these types of updates, thanks!

Update

SLIS is still blocked at 2025-03-24. I reset Cassandra's TTL again.
T394757: Update relevant data pipelines to wmf_content.mediawiki_content_current_v1 is in progress and can unblock this ticket.

CC @KStoller-WMF @HNordeenWMF @Seddon @Michael .

Update

T394757: Update relevant data pipelines to wmf_content.mediawiki_content_current_v1 is complete and this week's production runs were successful! 🎉
SLIS is unblocked with fresh data at 2025-06-16. 🎺

CC @KStoller-WMF @HNordeenWMF @Seddon @Michael .

Update

T394757: Update relevant data pipelines to wmf_content.mediawiki_content_current_v1 is complete and this week's production runs were successful! 🎉
SLIS is unblocked with fresh data at 2025-06-16. 🎺

CC @KStoller-WMF @HNordeenWMF @Seddon @Michael .

Great news, Thank you!