Steps to replicate the issue (include links if applicable):
Go to https://quarry.wmcloud.org/ and run any query on ruwiki_p (e.g. SELECT COUNT(*) FROM page;).
Observe warning about synchronization delay and stale data (≈ 11 h behind).
Confirm actual lag on https://replag.toolforge.org/ → section s6 shows ~16 h lag (replag.toolforge.org)
What happens?:
Queries run on Quarry (quarry.wmcloud.org) against the Russian Wikipedia replica (ruwiki_p) are returning stale data and showing this warning:
“The database on which this query was executed has a synchronization delay with the wiki. This can be caused by maintenance or a database incident, and should be resolved soon. Modifications that were made in the last 11 hours on the wiki are not taken into account in the results below.”
Meanwhile, [replag.toolforge.org] reports a consistent ~16 hour lag for section s6 on both the analytics and web replica hosts for ruwiki_p (Lag = 58 594 s ≈ 16 h 16 m 34 s) (replag.toolforge.org)
This far exceeds normal expectations (web replicas: < 5 min; analytics: < 1 h)
Queries return data that is ~16 hours out-of-date.
Warning message states recent modifications (last ~11 h) are not included.
heartbeat_p view shows large lag_seconds on s6.
Impact:
*Analytical queries on ruwiki are missing up to a day’s worth of edits.
*Public queries (e.g. dashboards, reports) based on Quarry are stale, leading to misinformation.
*Bot and service integrations relying on fresh data may fail or produce inconsistent results.
What should have happened instead?:
Replication lag should be under 5 minutes for the web host, and under 1 hour for analytics
No warning shown in Quarry for normal queries.
Other information (browser name/version, screenshots, etc.):
Preliminary diagnostics
*No known scheduled maintenance on s6 at this time.
*Other sections (s3,s5) currently show no lag
Please,
Verify replica I/O and SQL threads on section s6 (SHOW SLAVE STATUS\G).
Check for any long-running or blocking transactions on s6 (SHOW PROCESSLIST).
If stuck, restart the replication worker or clear problematic queries.
Provide ETA for full catch-up or escalate if hardware/network issues.