User Details
- User Since
- Aug 14 2018, 10:50 AM (297 w, 4 d)
- Availability
- Available
- IRC Nick
- effie
- LDAP User
- Effie Mouzeli
- MediaWiki User
- EMouzeli (WMF) [ Global Accounts ]
Thu, Apr 25
Wed, Apr 24
@brouberol thank you for finding this. While I had spotted this in the past, I began working on updating the module T356885, as its current design is somewhat limited. If you are currently blocked, my suggestion is for you to go ahead with this minor update, for now. When I make some time to properly finish T356885, I will update any charts with active jobs anyway. @JMeybohm any objections?
That is my doing, I shouldn't have marked this task as resolved. While I was doing some other work, I found that the definition of cache.mcrouter.deployment was kind of misleading, thus the update. The old definition cache.mcrouter.deployment will be phased out, it is still present because I wanted to move forward with mw-mcrouter ds, without breaking MediaWiki should someone update its modules.
I am marking this as High Priority because the current status is:
- codfw is using a mcrouter daemonset
- eqiad is using the mcrouter container
Tue, Apr 23
Uploaded new package with the binary named as prometheus-memcached-exporter
Tue, Apr 16
Mon, Apr 15
Thu, Apr 11
@dancy it would be great if someone could finish this soon. While scap now does have an option to mitigate potential helm hiccups, I think we should add it to the mix nevertheless
Built and repackaged
Built and uploaded
Tue, Apr 9
Thu, Apr 4
Wed, Apr 3
We depooled mw-web-ro from eqiad, and attempted a rollback
Tue, Apr 2
Mar 28 2024
Mar 27 2024
Switchover is done, it is Day 8, and we are back to Multi-DC. Thank you serviceops and @akosiaris for being good teammates and keeping an eye on things.
Mar 26 2024
Mar 21 2024
Mar 20 2024
Mar 19 2024
This is done, weill reopen if something goes south
Mar 6 2024
Mar 5 2024
mw-mcrouter ds has been deployed on staging mw-mcrouter staging
Mar 4 2024
Looks alright!
@Trizek-WMF as per our off-phabricator discussion, the major change is that this is not a procedure we test anymore, but it has become standard practice. Please edit the message as you see fit to reflect that.
Feb 29 2024
Feb 27 2024
Looking into the issue, we found that around 26th Feb @ ~21:45 UTC, the urldownloader1003 (ganeti VM running on ganeti1027 ie cluster master) lost network connectivity
This was most likely related to T358597
After issuing a restart, the VM came back to life normally.