Page MenuHomePhabricator

Make WDQS active / active
Closed, ResolvedPublic


Traffic is ready for active / active applications, WDQS is ready to be active / active, we should do it.

Before sending traffic to codfw, we need to reimport all data as it seems that codfw is lagging behind. As there is no traffic yet, we can reimport all servers in parallel.

Event Timeline

Gehel created this task.Apr 4 2017, 5:45 AM
Restricted Application added projects: Wikidata, Discovery. · View Herald TranscriptApr 4 2017, 5:45 AM
Restricted Application added a subscriber: Aklapper. · View Herald Transcript
Gehel triaged this task as High priority.Apr 4 2017, 5:45 AM

Change 346543 had a related patch set uploaded (by Gehel):
[operations/puppet@production] wdqs: active/active public interface

Mentioned in SAL (#wikimedia-operations) [2017-04-06T08:37:23Z] <gehel> shutting down wdqs codfw for data reimport - T162111

Gehel added a comment.Apr 7 2017, 7:52 AM

Initial import is completed, wdqs-updater is restarted and is catching up on the differences since last export.

Change 346543 merged by Gehel:
[operations/puppet@production] wdqs: active/active public interface

Gehel added a comment.Apr 10 2017, 8:24 PM

Using the following curl to test, I don't see an entry in the nginx access log:

curl '*%20wd%3AQ79007%20.%0A%20%20%20%20%3Fstreet%20wdt%3AP17%20wd%3AQ142%20.%0A%20%20%20%20MINUS%20%7B%20%3Fstreet%20wdt%3AP131%20%5B%5D%20%7D%20.%0A%09SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22fr%22%20%7D%0A%7D%0AORDER%20BY%20%3FstreetLabel' -H 'Accept: application/sparql-results+json' -H 'User-Agent: curl (testing/gehel)' --resolve
Gehel edited projects, added Traffic; removed Patch-For-Review.Apr 10 2017, 8:28 PM
Gehel added a subscriber: BBlack.

I'm not sure the change is effective. While I do see a few requests (outside of pyball / icinga) in the nginx logs on the wdqs codwf servers, I don't see as many as I would expect. Also, grafana does not show any requests for the codfw service.

@BBlack any idea on how to check this further?

@Gehel: you can check x-served-by headers in the responses - half of those should have codfw hosts there now.

grafana dashboard was wrongly filtering on eqiad only (that's why I did not see any traffic there). More tests and checking x-cache and x-served-by headers show that indeed traffic is routed to codfw as well. All looks good!

ema moved this task from Triage to Watching on the Traffic board.Apr 11 2017, 12:32 PM
Smalyshev closed this task as Resolved.May 3 2017, 5:34 PM
Smalyshev claimed this task.

I think everything is fine, we can close this?