Page MenuHomePhabricator

Raine (Raine Souček)
Site Reliability Engineer - ServiceOps

Projects (11)

Today

  • No visible events.

Tomorrow

  • No visible events.

Sunday

  • No visible events.

User Details

User Since
Mar 16 2023, 2:18 PM (161 w, 13 h)
Availability
Available
IRC Nick
Raine
LDAP User
Kamila Součková
MediaWiki User
KSoučková-WMF [ Global Accounts ]

Recent Activity

Mon, Apr 13

Raine added a comment to T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).

"Starting on November 2, 2023" appears to be a typo, unless you own a time machine ;)

Mon, Apr 13, 3:55 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine triaged T423138: Make the new ICU upgrade process production-ready as Medium priority.
Mon, Apr 13, 2:32 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine moved T423138: Make the new ICU upgrade process production-ready from Inbox to Backlog on the ServiceOps new board.
Mon, Apr 13, 2:32 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine created T423138: Make the new ICU upgrade process production-ready.
Mon, Apr 13, 2:31 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki

Fri, Apr 10

Raine created T422964: Upgrade the deployment servers to ICU 72.
Fri, Apr 10, 6:00 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine updated the task description for T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Fri, Apr 10, 3:00 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine updated the task description for T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Fri, Apr 10, 2:57 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine added a project to T419212: Upgrade ServiceOps roles from Bullseye to Debian Trixie: User-Raine.
Fri, Apr 10, 2:30 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware

Wed, Apr 8

Raine added a comment to T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).

How about:

Wed, Apr 8, 8:34 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine updated the task description for T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Wed, Apr 8, 12:44 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine updated subscribers of T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).

@Trizek-WMF as discussed, we can use the same announcements referenced in T420748#11779097. Thank you for your help!

Wed, Apr 8, 12:41 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine added a project to T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap): User-notice.
Wed, Apr 8, 12:40 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine closed T420748: CommRel support for ICU 72 upgrade, a subtask of T419049: Upgrade the MediaWiki servers to ICU 72 ☂️, as Declined.
Wed, Apr 8, 12:39 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine closed T420748: CommRel support for ICU 72 upgrade as Declined.

Declining this in favour of using T422544 directly.

Wed, Apr 8, 12:39 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine closed T420748: CommRel support for ICU 72 upgrade, a subtask of T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap), as Declined.
Wed, Apr 8, 12:39 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki

Tue, Apr 7

Raine added a parent task for T419058: Prepare packages and production images for ICU 72 upgrade: T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Tue, Apr 7, 9:48 PM · Essential-Work, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a subtask for T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap): T419058: Prepare packages and production images for ICU 72 upgrade.
Tue, Apr 7, 9:48 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine added a comment to T420748: CommRel support for ICU 72 upgrade.

In the end, we decided to use the old process for all wikis this time (see T419049#11796113 if you're curious about the reasoning). So we can use the exact same messaging as last time, including the same time frame, for all wikis.

Tue, Apr 7, 9:13 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Tue, Apr 7, 9:01 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine moved T411131: hcaptcha proxy: update wikitech page from In Progress to Radar (Pending) on the ServiceOps new board.
Tue, Apr 7, 7:54 PM · Traffic, Product Safety and Integrity, User-Raine, WE4.2 Bot detection, Documentation
Raine moved T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap) from Inbox to In Progress on the ServiceOps new board.
Tue, Apr 7, 7:53 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine triaged T422546: Clean up after the ICU 72 upgrade as Medium priority.
Tue, Apr 7, 7:53 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine moved T422546: Clean up after the ICU 72 upgrade from Inbox to Scheduled (this Q) on the ServiceOps new board.
Tue, Apr 7, 7:52 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine closed T419274: ICU 72 upgrade: enable remote ICU collation writes, a subtask of T419049: Upgrade the MediaWiki servers to ICU 72 ☂️, as Declined.
Tue, Apr 7, 7:51 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine closed T419274: ICU 72 upgrade: enable remote ICU collation writes as Declined.

Upon further discussion with DBA, we will fall back to the old process this time around.

Tue, Apr 7, 7:51 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a parent task for T419242: Migrate collation data to ICU 72: T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Tue, Apr 7, 7:41 PM · Patch-For-Review, MW-on-K8s, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a parent task for T419212: Upgrade ServiceOps roles from Bullseye to Debian Trixie: T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Tue, Apr 7, 7:41 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware
Raine added a parent task for T420748: CommRel support for ICU 72 upgrade: T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Tue, Apr 7, 7:41 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a parent task for T421913: Create MW images with ICU 72 libs: T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Tue, Apr 7, 7:41 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a parent task for T419976: Upgrade redis_misc hosts to Debian Trixie (Redis 8.0): T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Tue, Apr 7, 7:41 PM · Infrastructure-Foundations, ServiceOps new, MediaWiki-Platform-Team (Radar), ServiceOps-Datastores, MW-Interfaces-Team
Raine added subtasks for T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap): T419212: Upgrade ServiceOps roles from Bullseye to Debian Trixie, T419976: Upgrade redis_misc hosts to Debian Trixie (Redis 8.0), T419242: Migrate collation data to ICU 72, T420748: CommRel support for ICU 72 upgrade, T421913: Create MW images with ICU 72 libs.
Tue, Apr 7, 7:41 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine changed the status of T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap) from Open to In Progress.
Tue, Apr 7, 7:37 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine created T422546: Clean up after the ICU 72 upgrade.
Tue, Apr 7, 7:34 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine closed T419049: Upgrade the MediaWiki servers to ICU 72 ☂️ as Declined.

Upon further discussion with DBA, we will fall back to the old process (i.e. just running the maintenance script in place) this time around.

Tue, Apr 7, 7:31 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine created T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Tue, Apr 7, 7:06 PM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
Raine closed T419980: ICU 72 upgrade: `categorylinks` table swap as Declined.

Upon further discussion with DBA, we will fall back to the old process this time around.

Tue, Apr 7, 5:26 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine closed T419980: ICU 72 upgrade: `categorylinks` table swap, a subtask of T419274: ICU 72 upgrade: enable remote ICU collation writes, as Declined.
Tue, Apr 7, 5:26 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.

Clarifying that on s6, only frwiki and ruwiki are affected by the ICU upgrade, so we are only going to do the procedure for those two.

Tue, Apr 7, 2:51 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T419980: ICU 72 upgrade: `categorylinks` table swap.
Tue, Apr 7, 2:49 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.
  1. I'd strongly suggest we do this gradually, it is dangerous enough to warrant it. As mentioned somewhere, probably wiki by wiki isn't doable, but the smaller the better, so the list of s3 wikis on the task description (minus ruwikinews) sounds good to me.

Yes, it needs to be (roughly) all at once. We can potentially wait a few minutes between the first table swap and the rest if that helps, but the time from the start (when ServiceOps deploy the new image) to the table swap is potentially user-visible, so we don't want to stretch it too much.

To be super double sure: we can do one section at the time and it doesn't matter if it takes days in between sections?

No, we would have to do all sections roughly at once. This is because we have to do all wikis at once (constraint on our side), and the time when wikis are done and tables are not is potentially user-visible, so we want to minimize it. That is why we are now only doing s3, and we are not using this process at all for the other sections. (We will upgrade the other sections via a long-running maintenance script that has more user impact but doesn't need DB changes.)

Tue, Apr 7, 2:41 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.

Sorry for the delay, I've been out for almost 3 weeks and I am catching up now.

No worries, and thank you for the below!

Tue, Apr 7, 10:19 AM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki

Tue, Mar 31

Raine triaged T421913: Create MW images with ICU 72 libs as High priority.
Tue, Mar 31, 3:32 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine moved T421913: Create MW images with ICU 72 libs from Inbox to In Progress on the ServiceOps new board.
Tue, Mar 31, 3:32 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine created T421913: Create MW images with ICU 72 libs.
Tue, Mar 31, 2:48 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a comment to T419274: ICU 72 upgrade: enable remote ICU collation writes.

Many thanks @Scott_French <3

Tue, Mar 31, 2:26 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T419242: Migrate collation data to ICU 72.
Tue, Mar 31, 2:20 PM · Patch-For-Review, MW-on-K8s, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T420748: CommRel support for ICU 72 upgrade.
Tue, Mar 31, 1:50 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine triaged T420748: CommRel support for ICU 72 upgrade as High priority.

The finalized list of affected wikis and the updated timeline is on the task.

Tue, Mar 31, 1:29 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T420748: CommRel support for ICU 72 upgrade.
Tue, Mar 31, 1:27 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki

Mon, Mar 30

Raine updated the task description for T419980: ICU 72 upgrade: `categorylinks` table swap.
Mon, Mar 30, 9:01 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T419980: ICU 72 upgrade: `categorylinks` table swap.
Mon, Mar 30, 9:00 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.

@Marostegui or @Ladsgroup, when can we do the table swap? Would one of you be available for the swap (coordinated with serviceops) on Apr 7 or 8?

Mon, Mar 30, 8:36 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated subscribers of T421732: Fatal exception of type "TypeError" when deleting a page.

Oh, and many thanks to @taavi and @Scott_French for connecting the problem to my patch so quickly <3

Mon, Mar 30, 5:53 PM · MW-Interfaces-Team, Wikimedia-production-error, MediaWiki-Page-deletion
Raine added a comment to T421732: Fatal exception of type "TypeError" when deleting a page.

I would like to invoke The Oops Defense, and request a sticker :D

Mon, Mar 30, 5:51 PM · MW-Interfaces-Team, Wikimedia-production-error, MediaWiki-Page-deletion
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.

@JAllemandou We will postpone the upgrade (i.e. the table hot swap). However, the new tables are already created, though they are not in use. Is that a problem for you?

I don't see how just having the new tables created (and even loaded for the sake of it) could be an issue. Thanks for warning us :)

Mon, Mar 30, 10:37 AM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.

The sqoop jobs fleet starts at midnight UTC on the first of the month, and usually lasts 2 days and a half if everything goes well.
Thank you for considering scheduling your operation at a different time :)

Mon, Mar 30, 10:27 AM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki

Fri, Mar 27

Raine updated the task description for T419274: ICU 72 upgrade: enable remote ICU collation writes.
Fri, Mar 27, 7:54 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T420748: CommRel support for ICU 72 upgrade.
Fri, Mar 27, 7:48 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T420748: CommRel support for ICU 72 upgrade.
Fri, Mar 27, 1:25 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T419980: ICU 72 upgrade: `categorylinks` table swap.
Fri, Mar 27, 1:18 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T419049: Upgrade the MediaWiki servers to ICU 72 ☂️.
Fri, Mar 27, 1:05 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated subscribers of T411131: hcaptcha proxy: update wikitech page.

@kostajh Could you (or someone from your team) update the parts of HCaptcha wikitech marked for PSI? The scope is incident mitigation and making sure oncallers have enough info to identify culprits and know what to do, so it can stay high level. I believe some of this is already documented in your design docs, but it would be good to have it in a place where oncallers can easily find it.

Fri, Mar 27, 11:30 AM · Traffic, Product Safety and Integrity, User-Raine, WE4.2 Bot detection, Documentation
Raine added a project to T411131: hcaptcha proxy: update wikitech page: Product Safety and Integrity.
Fri, Mar 27, 11:26 AM · Traffic, Product Safety and Integrity, User-Raine, WE4.2 Bot detection, Documentation
Raine updated the task description for T411131: hcaptcha proxy: update wikitech page.
Fri, Mar 27, 11:24 AM · Traffic, Product Safety and Integrity, User-Raine, WE4.2 Bot detection, Documentation

Thu, Mar 26

Raine closed T421208: Dead links at https://wikitech.wikimedia.org/wiki/RESTBase#Analytics_and_metrics, a subtask of T421203: Bad ATS config led to large volume of 5xx from RESTBase, as Resolved.
Thu, Mar 26, 5:37 PM · Incident Severity 3, Traffic, Wikimedia-Incident
Raine closed T421208: Dead links at https://wikitech.wikimedia.org/wiki/RESTBase#Analytics_and_metrics as Resolved.

It's not clear what the dead link pointed to originally, but my best guess is that it was a dashboard for the MW deployment serving the API traffic. During an incident, I would want to see where the external REST and action API traffic lands. So I changed the dead dashboard link to point to the mw-api-ext dashboard.

Thu, Mar 26, 5:37 PM · ServiceOps-Services-Oids, ServiceOps new, Documentation, Sustainability (Incident Followup), RESTBase

Wed, Mar 25

Raine changed the status of T411131: hcaptcha proxy: update wikitech page, a subtask of T410626: WE6.2.6: ☂️ hcaptcha-proxy Production Readiness Review, from Open to In Progress.
Wed, Mar 25, 2:02 PM · User-Raine, Epic, ServiceOps-Services-Oids, ServiceOps new
Raine changed the status of T411131: hcaptcha proxy: update wikitech page from Open to In Progress.

I have reorganized https://wikitech.wikimedia.org/wiki/HCaptcha and added placeholders for missing/outdated info.

Wed, Mar 25, 2:02 PM · Traffic, Product Safety and Integrity, User-Raine, WE4.2 Bot detection, Documentation

Tue, Mar 24

Raine added a project to T418148: Reroute Link Recommendation APIs through the REST gateway: User-Raine.
Tue, Mar 24, 5:16 PM · User-Raine, MW-Interfaces-Team, ServiceOps new, OKR-Work

Mon, Mar 23

Raine added a comment to T420748: CommRel support for ICU 72 upgrade.

@Raine Based on past iteration, do we have more details about the expected disruption, what do we mean by out of order display, is it expected for all users the same way, during the entire duration?

Mon, Mar 23, 1:17 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.

A mass deletion of categories has just been launched on ruwikinews, with ~1.5M categories (out of ~2M) slated for deletion. It's going to take more than a few weeks to complete. I'm not sure how exactly it could affect this task, but a warning probably couldn't hurt. See the ruwikinews forum for more info.

@Ladsgroup or @tstarling, you have a better understanding of the MW side than me, do you know if this affects the new process, and if yes, what to do about it? Thank you!

Mon, Mar 23, 12:48 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated subscribers of T419980: ICU 72 upgrade: `categorylinks` table swap.

A mass deletion of categories has just been launched on ruwikinews, with ~1.5M categories (out of ~2M) slated for deletion. It's going to take more than a few weeks to complete. I'm not sure how exactly it could affect this task, but a warning probably couldn't hurt. See the ruwikinews forum for more info.

Mon, Mar 23, 12:45 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki

Fri, Mar 20

Raine moved T420748: CommRel support for ICU 72 upgrade from Inbox to Radar (Pending) on the ServiceOps new board.
Fri, Mar 20, 4:05 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine placed T420748: CommRel support for ICU 72 upgrade up for grabs.
Fri, Mar 20, 4:05 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine created T420748: CommRel support for ICU 72 upgrade.
Fri, Mar 20, 4:04 PM · User-notice, MoveComms-Support, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.

My suggestion has been to do it gradually across different wikis.

This is not really feasible, as we will need to deploy the image with the ICU72 libraries just prior to swapping the table, and currently there is no mapping of image to wiki (we're serving all wikis from the same image), so we're not able to roll out the new ICU libraries wiki by wiki.

The only option I can think of is to swap the table _before_ we upgrade the MW libraries, but I do not know what would happen in that case -- that would mean serving ICU72 keys with ICU67 libraries, and I have no idea whether it's forward-compatible and what would happen if it isn't.

Then maybe this time, we do swap mode on small wikis and then old method for large wikis

Fri, Mar 20, 3:19 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.

[...] the script needs to populate the new table [...], I'm not sure enwiki would be done by then.

Fri, Mar 20, 2:19 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.

My suggestion has been to do it gradually across different wikis.

Fri, Mar 20, 2:16 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine changed the status of T419242: Migrate collation data to ICU 72, a subtask of T419049: Upgrade the MediaWiki servers to ICU 72 ☂️, from Stalled to In Progress.
Fri, Mar 20, 12:49 PM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine changed the status of T419242: Migrate collation data to ICU 72 from Stalled to In Progress.
Fri, Mar 20, 12:49 PM · Patch-For-Review, MW-on-K8s, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T419548: Create a temporary Shellbox instance for ICU 72 upgrade.
Fri, Mar 20, 12:47 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine closed T419548: Create a temporary Shellbox instance for ICU 72 upgrade, a subtask of T419242: Migrate collation data to ICU 72, as Resolved.
Fri, Mar 20, 12:47 PM · Patch-For-Review, MW-on-K8s, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine closed T419548: Create a temporary Shellbox instance for ICU 72 upgrade, a subtask of T419274: ICU 72 upgrade: enable remote ICU collation writes, as Resolved.
Fri, Mar 20, 12:47 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine closed T419548: Create a temporary Shellbox instance for ICU 72 upgrade as Resolved.

Looks like it's working! I'm getting a grumpy response from shellbox when I curl at it from another pod.

kamila@deploy2002:~/icu72$ kubectl exec -it mw-experimental.codfw.pinkllama-8578d75f87-jrkhp -c mediawiki-pinkllama-app -- php -r "`cat test-shellbox-conn.php`"
Connection successful, HTTP status: 500
{
    "__": "Shellbox server error",
    "class": "Shellbox\\ShellboxError",
    "message": "No action was specified",
    "log": [
        {
            "level": "ERROR",
            "message": "Exception of class Shellbox\\ShellboxError: No action was specified",
            "context": {
                "trace": "#0 /srv/app/src/Server.php(77): Shellbox\\Server->guardedExecute('/srv/app/config...')\n#1 /srv/app/src/Server.php(66): Shellbox\\Server->execute('/srv/app/config...')\n#2 /srv/app/index.php(3): Shellbox\\Server::main('/srv/app/config...')\n#3 {main}"
            }
        }
    ]
}
Fri, Mar 20, 12:47 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki

Thu, Mar 19

Raine added a comment to T419548: Create a temporary Shellbox instance for ICU 72 upgrade.

The new images look happy:

sh
kamila@deploy2002:~/icu72$ kubectl exec -it shellbox-icu72-c4f45f566-p2srn -c shellbox-icu72-app -- sh -c 'apt list | grep libicu'
libicu72/now 72.1-3+deb12u1~wmf11u1 amd64 [installed,local]
Thu, Mar 19, 12:56 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T419548: Create a temporary Shellbox instance for ICU 72 upgrade.
Thu, Mar 19, 12:53 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine updated the task description for T419980: ICU 72 upgrade: `categorylinks` table swap.
Thu, Mar 19, 10:39 AM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki

Wed, Mar 18

Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.

I won't be but Manuel will. OTOH, it'll be around the time of dc switchover, can it be moved to a bit later? e.g. first week of April.

I will discuss with Manuel (and schedule around data lake imports), thank you!

Wed, Mar 18, 6:37 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.

Hi! FYI it would be probably be good to avoid early in a calendar month, as this is when data is sqooped into the Data Lake. I think things would break if a table rename happens while big selects are running on the analytics replicas.

Wed, Mar 18, 6:31 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine moved T419548: Create a temporary Shellbox instance for ICU 72 upgrade from Needs Info / Blocked to In Progress on the User-Raine board.
Wed, Mar 18, 5:16 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine changed the status of T419980: ICU 72 upgrade: `categorylinks` table swap, a subtask of T419274: ICU 72 upgrade: enable remote ICU collation writes, from Open to In Progress.
Wed, Mar 18, 5:13 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine changed the status of T419980: ICU 72 upgrade: `categorylinks` table swap from Open to In Progress.
Wed, Mar 18, 5:13 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine added a project to T411097: Deprecate low-traffic proxoid service and O:hcaptcha_proxy for the older hcaptcha proxy setup: User-Raine.
Wed, Mar 18, 5:10 PM · User-Raine, SRE, Traffic

Mar 17 2026

Raine added a project to T419980: ICU 72 upgrade: `categorylinks` table swap: DBA.
Mar 17 2026, 10:00 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine closed T419536: ServiceOps review of REST gateway deployment docs on wikitech, a subtask of T418735: Support for API rate limiting, as Resolved.
Mar 17 2026, 12:35 PM · User-Raine
Raine closed T419536: ServiceOps review of REST gateway deployment docs on wikitech as Resolved.

@daniel I was bold and edited the wikitech page. Ping me (and potentially reopen this) if you dislike anything, otherwise you can consider it reviewed.

Mar 17 2026, 12:35 PM · ServiceOps new, MediaWiki-Platform-Team, ServiceOps-Services-Oids, User-Raine
Raine added a comment to T419980: ICU 72 upgrade: `categorylinks` table swap.
  • I think this needs to be done gradually. We have never done this in production and it could be broken since it was introduced. So please do it on testwiki for example, then swap and if it's working fine. Move to mediawikiwiki (s3) and then rest of s3 and then wikis in s5 and so on.

Yes, definitely going to do that. Thanks!

Mar 17 2026, 12:16 PM · Data-Engineering-Radar, DBA, Data-Persistence, Data-Engineering, Schema-change, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki

Mar 13 2026

Raine updated the task description for T419548: Create a temporary Shellbox instance for ICU 72 upgrade.
Mar 13 2026, 5:43 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine moved T419548: Create a temporary Shellbox instance for ICU 72 upgrade from Scheduled (this Q) to In Progress on the ServiceOps new board.
Mar 13 2026, 1:23 PM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
Raine changed the status of T419548: Create a temporary Shellbox instance for ICU 72 upgrade, a subtask of T419242: Migrate collation data to ICU 72, from Stalled to In Progress.
Mar 13 2026, 1:23 PM · Patch-For-Review, MW-on-K8s, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki