Page MenuHomePhabricator

jijiki (effie mouzeli)
is an animal

Projects (18)

Today

  • No visible events.

Tomorrow

  • No visible events.

Sunday

  • No visible events.

User Details

User Since
Aug 14 2018, 10:50 AM (400 w, 3 d)
Availability
Available
IRC Nick
effie
LDAP User
Effie Mouzeli
MediaWiki User
EMouzeli (WMF) [ Global Accounts ]

Recent Activity

Yesterday

jijiki added a comment to T405688: Support shell to mw-experimental pod.

I don’t see what reliability has to do with it? We wanted to know the impact of adding a bunch of namespace aliases in many different languages (would this “shadow” any existing pages?), and it seemed to me that the best way to do that would be to dry-run namespaceDupes on all wikis, with the patch experimentally applied, and then inspect the maintenance script’s output (and compare it with a “regular” dry-run without the patch).

Thu, Apr 16, 3:36 PM · ServiceOps-Mediawiki, Prod-Kubernetes, ServiceOps new, MW-on-K8s

Wed, Apr 15

jijiki claimed T423452: Wikimedia Hackathon 2026: Fifty Shades of Caching and How LLMs Paint It Black.
Wed, Apr 15, 4:12 PM · Wikimedia-Hackathon-2026
jijiki moved T423452: Wikimedia Hackathon 2026: Fifty Shades of Caching and How LLMs Paint It Black from Backlog to Proposed Unconference Session on the Wikimedia-Hackathon-2026 board.
Wed, Apr 15, 4:12 PM · Wikimedia-Hackathon-2026
jijiki moved T423333: Wikimedia Hackathon 2026: Pipeline for Moving Data Between Wikibase Instances and Wikidata from Proposed Unconference Session to Backlog on the Wikimedia-Hackathon-2026 board.
Wed, Apr 15, 4:12 PM · Wikimedia-Hackathon-2026, Data-Engineering, Data Pipelines, Wikidata, Wikibase (3rd party installations)
jijiki moved T423333: Wikimedia Hackathon 2026: Pipeline for Moving Data Between Wikibase Instances and Wikidata from Backlog to Proposed Unconference Session on the Wikimedia-Hackathon-2026 board.
Wed, Apr 15, 4:12 PM · Wikimedia-Hackathon-2026, Data-Engineering, Data Pipelines, Wikidata, Wikibase (3rd party installations)
jijiki renamed T423452: Wikimedia Hackathon 2026: Fifty Shades of Caching and How LLMs Paint It Black from Wikimedia Hackathon 2026: Fifty Shades of Caching and How LLMs Paint It Black (TBA) to Wikimedia Hackathon 2026: Fifty Shades of Caching and How LLMs Paint It Black.
Wed, Apr 15, 4:05 PM · Wikimedia-Hackathon-2026
jijiki renamed T423452: Wikimedia Hackathon 2026: Fifty Shades of Caching and How LLMs Paint It Black from Wikimedia Hackathon 2026: 50 Shades of Caching and How LLMs Paint It Black (TBA) to Wikimedia Hackathon 2026: Fifty Shades of Caching and How LLMs Paint It Black (TBA).
Wed, Apr 15, 4:04 PM · Wikimedia-Hackathon-2026
jijiki created T423452: Wikimedia Hackathon 2026: Fifty Shades of Caching and How LLMs Paint It Black.
Wed, Apr 15, 3:46 PM · Wikimedia-Hackathon-2026

Thu, Apr 9

jijiki claimed T422784: MediaWiki SquareOne Dashboards.
Thu, Apr 9, 9:20 AM · ServiceOps new
jijiki moved T422784: MediaWiki SquareOne Dashboards from Scheduled (this Q) to Backlog on the ServiceOps new board.
Thu, Apr 9, 9:19 AM · ServiceOps new
jijiki triaged T422784: MediaWiki SquareOne Dashboards as Medium priority.
Thu, Apr 9, 9:19 AM · ServiceOps new
jijiki moved T422424: Another blob upload invalid error when pushing to docker-registry from Inbox to Radar (Pending) on the ServiceOps new board.
Thu, Apr 9, 9:19 AM · Infrastructure-Foundations, ServiceOps new, SRE
jijiki moved T422678: MediaWiki periodic job update-special-pages-s5 failed from Inbox to In Progress on the ServiceOps new board.
Thu, Apr 9, 9:18 AM · ServiceOps new, Wikimedia-production-error, MediaWiki-Special-pages
jijiki moved T422784: MediaWiki SquareOne Dashboards from Inbox to Scheduled (this Q) on the ServiceOps new board.
Thu, Apr 9, 9:17 AM · ServiceOps new
jijiki changed the status of T411202: SquareOne Dashboards MVP and feedback from Stalled to In Progress.
Thu, Apr 9, 8:08 AM · ServiceOps new, Incident Tooling, ServiceOps-Mediawiki
jijiki changed the status of T411202: SquareOne Dashboards MVP and feedback, a subtask of T414663: SquareOne Dashboards: Guided Incident Response, from Stalled to In Progress.
Thu, Apr 9, 8:08 AM · Incident Tooling, Epic
jijiki renamed T411202: SquareOne Dashboards MVP and feedback from Proof of Concept: MediaWiki SquareOne Dashboards to SquareOne Dashboards MVP and feedback.
Thu, Apr 9, 8:08 AM · ServiceOps new, Incident Tooling, ServiceOps-Mediawiki
jijiki created T422784: MediaWiki SquareOne Dashboards.
Thu, Apr 9, 8:04 AM · ServiceOps new

Wed, Apr 8

jijiki added a project to T422486: MediaWiki periodic job failures due to timeouts: DBA.
Wed, Apr 8, 12:15 PM · ServiceOps new (Next quarter), DBA
jijiki moved T422489: rdbms errors in eqiad from Inbox to Radar (Awareness) on the ServiceOps new board.
Wed, Apr 8, 11:46 AM · DBA, ServiceOps new
jijiki attached a referenced file: F75319581: image.png.
Wed, Apr 8, 11:43 AM · ServiceOps new (Next quarter), DBA
jijiki added a comment to T420223: High (relatively) number of memcached errors in eqiad.

While investigating a different problem, I found that we have a similar(?) issue when mediawiki contacts the DBs T422489: rdbms errors in eqiad

Wed, Apr 8, 11:36 AM · Infrastructure-Foundations, ServiceOps new, ServiceOps-Datastores
jijiki added a comment to T422486: MediaWiki periodic job failures due to timeouts.

I can't help noticing that MediaWiki periodic job update-special-pages-s5 failed failed twice for the same reason, which is either a very unfortunate coincidence related to T422489: rdbms errors in eqiad, or something worth investigating.

Wed, Apr 8, 11:35 AM · ServiceOps new (Next quarter), DBA
jijiki updated the task description for T422486: MediaWiki periodic job failures due to timeouts.
Wed, Apr 8, 11:32 AM · ServiceOps new (Next quarter), DBA
jijiki closed T422580: MediaWiki periodic job update-special-pages-s5 failed as Resolved.

I am closing this, since we are tracking that in T422486

Wed, Apr 8, 11:16 AM · ServiceOps new, Wikimedia-production-error, MediaWiki-Special-pages
jijiki closed T422580: MediaWiki periodic job update-special-pages-s5 failed, a subtask of T422486: MediaWiki periodic job failures due to timeouts, as Resolved.
Wed, Apr 8, 11:16 AM · ServiceOps new (Next quarter), DBA
jijiki removed a subtask for T422489: rdbms errors in eqiad: T422580: MediaWiki periodic job update-special-pages-s5 failed.
Wed, Apr 8, 11:16 AM · DBA, ServiceOps new
jijiki edited parent tasks for T422580: MediaWiki periodic job update-special-pages-s5 failed, added: T422486: MediaWiki periodic job failures due to timeouts; removed: T422489: rdbms errors in eqiad.
Wed, Apr 8, 11:16 AM · ServiceOps new, Wikimedia-production-error, MediaWiki-Special-pages
jijiki added a subtask for T422486: MediaWiki periodic job failures due to timeouts: T422580: MediaWiki periodic job update-special-pages-s5 failed.
Wed, Apr 8, 11:16 AM · ServiceOps new (Next quarter), DBA
jijiki added a subtask for T422489: rdbms errors in eqiad: T422486: MediaWiki periodic job failures due to timeouts.
Wed, Apr 8, 11:14 AM · DBA, ServiceOps new
jijiki added a parent task for T422486: MediaWiki periodic job failures due to timeouts: T422489: rdbms errors in eqiad.
Wed, Apr 8, 11:14 AM · ServiceOps new (Next quarter), DBA
jijiki added a parent task for T422580: MediaWiki periodic job update-special-pages-s5 failed: T422489: rdbms errors in eqiad.
Wed, Apr 8, 11:13 AM · ServiceOps new, Wikimedia-production-error, MediaWiki-Special-pages
jijiki added a subtask for T422489: rdbms errors in eqiad: T422580: MediaWiki periodic job update-special-pages-s5 failed.
Wed, Apr 8, 11:13 AM · DBA, ServiceOps new
jijiki added a comment to T420336: mw-parsoid improvements.

Update on the status of parsoid testreduce for rt-testing:

  • In order to not break the current rt-testing infrastructure which is very critical for our weekly processes I created a dev env in cloudvps.
  • Our testsuite is updated to use custom headers to access the parsoid instance in k8s
  • From a quick run things look like that the basic functionality is there

Some things that need improvement

  • I think that having 2 different envs and trying to keep track what the active DC is is getting complicated in our testsuite
    • Ideally we would like to have a single discovery instance that always points to the active DC
Wed, Apr 8, 10:43 AM · Content-Transform-Team (Work In Progress), User-jijiki, ServiceOps-Services-Oids, ServiceOps new, OKR-Work
jijiki added a project to T422489: rdbms errors in eqiad: DBA.
Wed, Apr 8, 10:02 AM · DBA, ServiceOps new
jijiki added a comment to T420223: High (relatively) number of memcached errors in eqiad.

@jijiki shall we deploy mcrouter 2023.07.17.00-2 and test the keep alive options? I have the feeling that the TKOs will go down after it.

Wed, Apr 8, 9:45 AM · Infrastructure-Foundations, ServiceOps new, ServiceOps-Datastores
jijiki removed a subtask for T419049: Upgrade the MediaWiki servers to ICU 72 ☂️: T419212: Upgrade ServiceOps roles from Bullseye to Debian Trixie.
Wed, Apr 8, 9:42 AM · Patch-For-Review, User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware, ServiceOps-Mediawiki
jijiki removed a subtask for T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap): T419212: Upgrade ServiceOps roles from Bullseye to Debian Trixie.
Wed, Apr 8, 9:42 AM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
jijiki removed parent tasks for T419212: Upgrade ServiceOps roles from Bullseye to Debian Trixie: T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap), T419049: Upgrade the MediaWiki servers to ICU 72 ☂️.
Wed, Apr 8, 9:42 AM · User-Raine, ServiceOps new, ServiceOps-Upgrades-Hardware
jijiki removed a parent task for T419976: Upgrade redis_misc hosts to Debian Trixie (Redis 8.0): T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap).
Wed, Apr 8, 9:41 AM · Infrastructure-Foundations, ServiceOps new, MediaWiki-Platform-Team (Radar), ServiceOps-Datastores, MW-Interfaces-Team
jijiki removed a subtask for T422544: Upgrade the MediaWiki servers to ICU 72 (no table swap): T419976: Upgrade redis_misc hosts to Debian Trixie (Redis 8.0).
Wed, Apr 8, 9:41 AM · User-notice, User-Raine, ServiceOps-Upgrades-Hardware, ServiceOps new, ServiceOps-Mediawiki
jijiki added a comment to T419976: Upgrade redis_misc hosts to Debian Trixie (Redis 8.0).

MW-Interfaces-Team same questions for you for changeprop/cpjobqueue /api-gateway/Ratelimit:

Wed, Apr 8, 9:37 AM · Infrastructure-Foundations, ServiceOps new, MediaWiki-Platform-Team (Radar), ServiceOps-Datastores, MW-Interfaces-Team

Tue, Apr 7

jijiki added a comment to T422455: Massive increase in "EtcdConfig failed to fetch data: Timeout was reached" warnings and errors since March 17th.

The changes in https://gerrit.wikimedia.org/r/1268569 were applied to codfw and eqiad at 15:10 and 15:20 UTC today respectively.

Following those changes, the rate of errors has plummeted (logstash):

Tue, Apr 7, 7:15 PM · Patch-For-Review, MediaWiki-Engineering, ServiceOps new, Wikimedia-production-error
jijiki attached a referenced file: F75259445: image.png.
Tue, Apr 7, 7:12 PM · ServiceOps new (Next quarter), DBA
jijiki added a comment to T422486: MediaWiki periodic job failures due to timeouts.

Things looks quite well so far mw-cron (MediaWiki Periodic Jobs on k8s) after merging 1268569. More details in T422455#11795500

Tue, Apr 7, 7:11 PM · ServiceOps new (Next quarter), DBA
jijiki added a comment to T419976: Upgrade redis_misc hosts to Debian Trixie (Redis 8.0).

Infrastructure-Foundations two questions:

  • how netbox will behave if it looses connectivity to its redis and then start with a cold cache?
  • do we have any concerns updating to redis 8?
Tue, Apr 7, 2:13 PM · Infrastructure-Foundations, ServiceOps new, MediaWiki-Platform-Team (Radar), ServiceOps-Datastores, MW-Interfaces-Team
jijiki added a project to T419976: Upgrade redis_misc hosts to Debian Trixie (Redis 8.0): Infrastructure-Foundations.
Tue, Apr 7, 2:11 PM · Infrastructure-Foundations, ServiceOps new, MediaWiki-Platform-Team (Radar), ServiceOps-Datastores, MW-Interfaces-Team
jijiki added a comment to T422455: Massive increase in "EtcdConfig failed to fetch data: Timeout was reached" warnings and errors since March 17th.

Take this with a grain of salt, it seems like something indeed changed during the week of March 17th, and if eqiad was not producing all those errors, we wouldn't have noticed.

Tue, Apr 7, 1:59 PM · Patch-For-Review, MediaWiki-Engineering, ServiceOps new, Wikimedia-production-error
jijiki updated the task description for T422489: rdbms errors in eqiad.
Tue, Apr 7, 1:22 PM · DBA, ServiceOps new
jijiki renamed T422489: rdbms errors in eqiad from rdbms erros in eqiad to rdbms errors in eqiad.
Tue, Apr 7, 1:13 PM · DBA, ServiceOps new
jijiki renamed T422486: MediaWiki periodic job failures due to timeouts from MediaWiki periodic job failures to MediaWiki periodic job failures due to timeouts.
Tue, Apr 7, 1:09 PM · ServiceOps new (Next quarter), DBA
jijiki created T422489: rdbms errors in eqiad.
Tue, Apr 7, 1:08 PM · DBA, ServiceOps new
jijiki moved T422486: MediaWiki periodic job failures due to timeouts from Inbox to In Progress on the ServiceOps new board.
Tue, Apr 7, 12:55 PM · ServiceOps new (Next quarter), DBA
jijiki attached a referenced file: F75237382: image.png.
Tue, Apr 7, 12:51 PM · ServiceOps new (Next quarter), DBA
jijiki attached a referenced file: F75237448: image.png.
Tue, Apr 7, 12:51 PM · ServiceOps new (Next quarter), DBA
jijiki updated the task description for T422455: Massive increase in "EtcdConfig failed to fetch data: Timeout was reached" warnings and errors since March 17th.
Tue, Apr 7, 12:47 PM · Patch-For-Review, MediaWiki-Engineering, ServiceOps new, Wikimedia-production-error
jijiki changed the status of T422486: MediaWiki periodic job failures due to timeouts from Open to In Progress.
Tue, Apr 7, 12:45 PM · ServiceOps new (Next quarter), DBA
jijiki updated the task description for T422486: MediaWiki periodic job failures due to timeouts.
Tue, Apr 7, 12:41 PM · ServiceOps new (Next quarter), DBA
jijiki closed T422308: MediaWiki periodic job update-flaggedrev-stats failed as Resolved.

failed jobs have been deleted, closing this too for T422486

Tue, Apr 7, 12:39 PM · ServiceOps new, Wikimedia-production-error, FlaggedRevs
jijiki closed T422410: MediaWiki periodic job refreshlinks-delete-from-nonexistent-s3 failed as Resolved.

failed jobs have been deleted, closing this too for T422486

Tue, Apr 7, 12:39 PM · ServiceOps new, Wikimedia-production-error, MediaWiki-Page-derived-data
jijiki closed T422413: MediaWiki periodic job update-special-pages-s5 failed as Resolved.

failed jobs have been deleted, closing this too for T422486

Tue, Apr 7, 12:39 PM · ServiceOps new, Wikimedia-production-error, MediaWiki-Special-pages
jijiki added a comment to T422486: MediaWiki periodic job failures due to timeouts.

I filtered timeouts from the mediamoderation-hourlyscan job in an attempt to establish if we are seeing those timouts more after switching to eqiad.

Tue, Apr 7, 12:38 PM · ServiceOps new (Next quarter), DBA
jijiki created T422486: MediaWiki periodic job failures due to timeouts.
Tue, Apr 7, 12:28 PM · ServiceOps new (Next quarter), DBA
jijiki moved T422455: Massive increase in "EtcdConfig failed to fetch data: Timeout was reached" warnings and errors since March 17th from Inbox to In Progress on the ServiceOps new board.
Tue, Apr 7, 11:23 AM · Patch-For-Review, MediaWiki-Engineering, ServiceOps new, Wikimedia-production-error
jijiki changed the status of T422455: Massive increase in "EtcdConfig failed to fetch data: Timeout was reached" warnings and errors since March 17th from Open to In Progress.
Tue, Apr 7, 11:23 AM · Patch-For-Review, MediaWiki-Engineering, ServiceOps new, Wikimedia-production-error
jijiki added a comment to T422227: MediaWiki periodic job refreshlinks-delete-from-nonexistent-s3 failed.

@A_smart_kitten Thank you!

Tue, Apr 7, 10:08 AM · ServiceOps new, Wikimedia-production-error, MediaWiki-Page-derived-data
jijiki added a comment to T422455: Massive increase in "EtcdConfig failed to fetch data: Timeout was reached" warnings and errors since March 17th.

Setting aside any mediawiki changes, the difference between the two DCs in the same time period (post codfw repooling), is alarming

Tue, Apr 7, 9:54 AM · Patch-For-Review, MediaWiki-Engineering, ServiceOps new, Wikimedia-production-error
jijiki added a comment to T422455: Massive increase in "EtcdConfig failed to fetch data: Timeout was reached" warnings and errors since March 17th.

It seems like EtcdConfig failed to fetch data: (curl error: 28) Timeout was reached started sometime around March 17th, with eqiad exhibiting

Tue, Apr 7, 9:35 AM · Patch-For-Review, MediaWiki-Engineering, ServiceOps new, Wikimedia-production-error

Mon, Apr 6

jijiki closed T422227: MediaWiki periodic job refreshlinks-delete-from-nonexistent-s3 failed as Resolved.

Seems like a temporary network error

Mon, Apr 6, 4:46 PM · ServiceOps new, Wikimedia-production-error, MediaWiki-Page-derived-data
jijiki added a comment to T422308: MediaWiki periodic job update-flaggedrev-stats failed.

The nearest (timestamp wise) long entry I found yielded a temp network problem. I am not aware how long it takes for @phaultfinder to create a task to be absolutely sure this is the one

Mon, Apr 6, 4:45 PM · ServiceOps new, Wikimedia-production-error, FlaggedRevs
jijiki closed T422313: MediaWiki periodic job update-special-pages-s5 failed as Resolved.
Mon, Apr 6, 4:39 PM · ServiceOps new, Wikimedia-production-error, MediaWiki-Special-pages
jijiki added a comment to T422313: MediaWiki periodic job update-special-pages-s5 failed.

This was due to a temp connection errors to the DB https://logstash.wikimedia.org/app/discover#/doc/logstash-*/logstash-k8s-1-7.0.0-1-2026.04.05?id=Z0EgXJ0BbI6kJ8WywbVL

Mon, Apr 6, 4:39 PM · ServiceOps new, Wikimedia-production-error, MediaWiki-Special-pages
jijiki added a comment to T420336: mw-parsoid improvements.

Update on the status of parsoid testreduce for rt-testing:

  • In order to not break the current rt-testing infrastructure which is very critical for our weekly processes I created a dev env in cloudvps.
  • Our testsuite is updated to use custom headers to access the parsoid instance in k8s
  • From a quick run things look like that the basic functionality is there

Some things that need improvement

  • I think that having 2 different envs and trying to keep track what the active DC is is getting complicated in our testsuite
    • Ideally we would like to have a single discovery instance that always points to the active DC
  • We might need to bump up the resources on the k8s instance we use. Currently it looks like if I crank up slightly the concurrency of our testing, it fails.
  • Redirects wipe the headers so for pages in the testsuite that are redirects it currently fails so we need to fix it too
Mon, Apr 6, 4:05 PM · Content-Transform-Team (Work In Progress), User-jijiki, ServiceOps-Services-Oids, ServiceOps new, OKR-Work
jijiki removed projects from T421168: Session store issues causing badtoken errors, session failures, logouts (late March–April 2026): ServiceOps new, SRE.
Mon, Apr 6, 3:06 PM · Data-Persistence, Datacenter-Switchover
jijiki updated subscribers of T421168: Session store issues causing badtoken errors, session failures, logouts (late March–April 2026).

I don’t see how that’s a reason to close the task? I don’t really care whether you think it’s related to the DC switchover or not, I care about the issue getting fixed. And to me the Grafana graph linked in the task description, looking at badtoken errors in the last 30 days, doesn’t really look like a return to normal yet (though the logarithmic scale makes it hard to interpret, and AFAICT my volunteer account doesn’t have sufficient privileges to preview an edited version with a linear scale):

image.png (1×3 px, 1 MB)

Mon, Apr 6, 3:05 PM · Data-Persistence, Datacenter-Switchover
jijiki added a project to T421168: Session store issues causing badtoken errors, session failures, logouts (late March–April 2026): Data-Persistence.
Mon, Apr 6, 2:57 PM · Data-Persistence, Datacenter-Switchover
jijiki changed the status of T422166: scap can’t deploy (blob upload unknown) after apus.discovery.wmnet is repooled in codfw from Open to In Progress.
Mon, Apr 6, 2:46 PM · Ceph, SRE-swift-storage, Patch-For-Review, ServiceOps new, Datacenter-Switchover, SRE
jijiki moved T422166: scap can’t deploy (blob upload unknown) after apus.discovery.wmnet is repooled in codfw from Inbox to In Progress on the ServiceOps new board.
Mon, Apr 6, 2:46 PM · Ceph, SRE-swift-storage, Patch-For-Review, ServiceOps new, Datacenter-Switchover, SRE
jijiki moved T422259: Create a wizard to support common combinations of patterns-actions from Inbox to Radar (Awareness) on the ServiceOps new board.
Mon, Apr 6, 2:45 PM · ServiceOps new, Hiddenparma
jijiki added a comment to T420223: High (relatively) number of memcached errors in eqiad.

Due to an unfortunate coincidence, this issue caused a paging event.

Mon, Apr 6, 11:05 AM · Infrastructure-Foundations, ServiceOps new, ServiceOps-Datastores
jijiki added a subtask for T422259: Create a wizard to support common combinations of patterns-actions: T409265: UI feature: tool for searching JA3N/JA4H.
Mon, Apr 6, 10:29 AM · ServiceOps new, Hiddenparma
jijiki added a parent task for T409265: UI feature: tool for searching JA3N/JA4H: T422259: Create a wizard to support common combinations of patterns-actions.
Mon, Apr 6, 10:29 AM · Hiddenparma

Fri, Apr 3

jijiki added a comment to T422259: Create a wizard to support common combinations of patterns-actions.

I just realised this is actually two tasks

Fri, Apr 3, 3:22 PM · ServiceOps new, Hiddenparma
jijiki created T422259: Create a wizard to support common combinations of patterns-actions.
Fri, Apr 3, 3:21 PM · ServiceOps new, Hiddenparma
jijiki changed the status of T421360: Upgrade mcrouter module to 1.3.5 from Open to In Progress.
Fri, Apr 3, 11:45 AM · Patch-For-Review, ServiceOps new, ServiceOps-Services-Oids
jijiki changed the status of T421504: mcrouter live config reload without pod restarts from Open to In Progress.
Fri, Apr 3, 11:45 AM · ServiceOps new, ServiceOps-Mediawiki, User-jijiki
jijiki changed the status of T421504: mcrouter live config reload without pod restarts, a subtask of T418263: mc10[55-72] implementation tracking, from Open to In Progress.
Fri, Apr 3, 11:45 AM · ServiceOps-Upgrades-Hardware, ServiceOps new
jijiki moved T421504: mcrouter live config reload without pod restarts from Scheduled (this Q) to In Progress on the ServiceOps new board.
Fri, Apr 3, 11:44 AM · ServiceOps new, ServiceOps-Mediawiki, User-jijiki
jijiki moved T421360: Upgrade mcrouter module to 1.3.5 from Scheduled (this Q) to In Progress on the ServiceOps new board.
Fri, Apr 3, 11:44 AM · Patch-For-Review, ServiceOps new, ServiceOps-Services-Oids
jijiki raised the priority of T411202: SquareOne Dashboards MVP and feedback from Medium to High.
Fri, Apr 3, 11:44 AM · ServiceOps new, Incident Tooling, ServiceOps-Mediawiki
jijiki closed T422178: Broken links on the memcached dashboard as Resolved.

Updated, I also sorted the dashboard's variables to be inline with the other ones

Fri, Apr 3, 11:38 AM · ServiceOps new, Sustainability (Incident Followup)
jijiki updated the task description for T414663: SquareOne Dashboards: Guided Incident Response.
Fri, Apr 3, 10:24 AM · Incident Tooling, Epic
jijiki updated the task description for T414663: SquareOne Dashboards: Guided Incident Response.
Fri, Apr 3, 10:23 AM · Incident Tooling, Epic
jijiki created P90251 Grafana backport and train annotations.
Fri, Apr 3, 10:23 AM · Grafana
jijiki added a comment to T422130: Database servers in cluster(number) are overloaded.

FWIW, I'm still currently encountering this error on frwiki, and it prevents my local custom JS/CSS files from loading.

Unexpectedly not loaded:

  • Special:Mypage/common.css, Special:Mypage/common.js, Special:Mypage/vector.css, Special:Mypage/vector.js

Not impacted — loading as expected:

  • MediaWiki:Common.css, MediaWiki:Common.js, MediaWiki:Vector.css, MediaWiki:Vector.js
  • meta:Special:Mypage/global.css, meta:Special:Mypage/global.js
Fri, Apr 3, 9:56 AM · Wikimedia-Incident, SRE, DBA
jijiki changed the status of T422178: Broken links on the memcached dashboard from Open to In Progress.
Fri, Apr 3, 9:54 AM · ServiceOps new, Sustainability (Incident Followup)
jijiki triaged T422178: Broken links on the memcached dashboard as Medium priority.
Fri, Apr 3, 9:54 AM · ServiceOps new, Sustainability (Incident Followup)
jijiki moved T422178: Broken links on the memcached dashboard from Inbox to In Progress on the ServiceOps new board.
Fri, Apr 3, 9:53 AM · ServiceOps new, Sustainability (Incident Followup)

Thu, Apr 2

jijiki closed T420468: Retire mw-parsoid LVS service as Resolved.
Thu, Apr 2, 9:45 PM · ServiceOps-Services-Oids, ServiceOps new
jijiki added a comment to T414486: Upgrade AUX clusters to kubernetes 1.31.

Other services:

Thu, Apr 2, 10:11 AM · ServiceOps new, Infrastructure-Foundations, Kubernetes, Prod-Kubernetes