Page MenuHomePhabricator

dancy (Ahmon Dancy)
Staff Software Engineer, Release EngineeringAdministrator

Today

  • No visible events.

Tomorrow

  • No visible events.

Thursday

  • No visible events.

User Details

User Since
Jun 27 2020, 12:14 AM (298 w, 3 d)
Roles
Administrator
Availability
Available
IRC Nick
dancy
LDAP User
Ahmon Dancy
MediaWiki User
ADancy (WMF) [ Global Accounts ]

Recent Activity

Thu, Mar 12

dancy added a comment to T419819: QuickInstantCommons triggering "Database error".

I just noticed this while doing local catalyst-api development.

Thu, Mar 12, 3:24 PM · MediaWiki-extensions-QuickInstantCommons, Catalyst (PatchDemo)

Tue, Mar 10

dancy updated subscribers of T416627: Rearrange Patch Demo webroot.

@brennen I heard you were going to be looking into this stuff.

Tue, Mar 10, 8:27 PM · User-brennen, Catalyst (Luka Ijo Pimeja Jan)

Wed, Mar 4

dancy renamed T415284: Buildkit v0.28.0 released from Buildkit v0.27.1 released to Buildkit v0.28.0 released.
Wed, Mar 4, 8:24 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)
dancy assigned T417455: Blast of broken pipe errors from catalyst-api container log stream to jnuche.
Wed, Mar 4, 6:12 PM · Release-Engineering-Team (Doing 😎), Essential-Work, Catalyst (Luka Ijo Pimeja Jan)

Mon, Mar 2

dancy added a comment to T413809: 1.46.0-wmf.18 deployment blockers.

Notice to train operator @jeena: I merged https://gitlab.wikimedia.org/repos/releng/release/-/merge_requests/236 today. This removes the 5 minute sleep that occurs after a full image build (T390251) now that T412951 has been implemented. If this causes a problem, merge a revert commit for https://gitlab.wikimedia.org/repos/releng/release/-/merge_requests/236 to recover.

Mon, Mar 2, 5:37 PM · Release-Engineering-Team (Doing 😎), Essential-Work, Release, Train Deployments

Thu, Feb 26

dancy changed the status of T417427: Improve the patchdemo/catalyst-api development environment situation from Open to In Progress.
Thu, Feb 26, 8:04 PM · Catalyst (Luka Ijo Pimeja Jan), Patch-For-Review
dancy created T418529: Allow users to add unlisted wikimedia skins/extensions when creating a wiki.
Thu, Feb 26, 7:33 PM · Catalyst (PatchDemo)

Tue, Feb 24

dancy closed T418156: Adjustment of fpm pool settings is broken as Resolved.
Tue, Feb 24, 3:37 PM · Catalyst
dancy closed T417711: dev-images: bookworm-php-sury children fail to build on sury key as Resolved.

bookworm-php-sury and its dependent images have been updated.

Tue, Feb 24, 3:28 PM · MW-1.46-notes (1.46.0-wmf.19; 2026-03-10), Patch-For-Review, Catalyst (Radar), User-brennen, Release-Engineering-Team, dev-images

Mon, Feb 23

dancy changed the status of T417711: dev-images: bookworm-php-sury children fail to build on sury key from Open to In Progress.
Mon, Feb 23, 8:27 PM · MW-1.46-notes (1.46.0-wmf.19; 2026-03-10), Patch-For-Review, Catalyst (Radar), User-brennen, Release-Engineering-Team, dev-images
dancy claimed T417711: dev-images: bookworm-php-sury children fail to build on sury key.

I'm running into this problem today so I'll take this ticket.

Mon, Feb 23, 8:27 PM · MW-1.46-notes (1.46.0-wmf.19; 2026-03-10), Patch-For-Review, Catalyst (Radar), User-brennen, Release-Engineering-Team, dev-images
dancy created T418156: Adjustment of fpm pool settings is broken.
Mon, Feb 23, 4:43 PM · Catalyst

Fri, Feb 20

dancy closed T413807: 1.46.0-wmf.16 deployment blockers as Resolved.
Fri, Feb 20, 3:40 PM · Release-Engineering-Team (Priority Backlog 📥), Essential-Work, Release, Train Deployments

Thu, Feb 19

dancy merged T417904: MediaWiki\Revision\RevisionAccessException: Failed to load data blob from {address} for revision {revision} into T393237: Some en.wikipedia pageviews fatal "RevisionAccessException: Failed to load data blob from {address} for revision {revision}.".
Thu, Feb 19, 8:05 PM · Wikimedia-database-issue (Bad data), DBA, WMF-General-or-Unknown, Wikimedia-production-error
dancy merged task T417904: MediaWiki\Revision\RevisionAccessException: Failed to load data blob from {address} for revision {revision} into T393237: Some en.wikipedia pageviews fatal "RevisionAccessException: Failed to load data blob from {address} for revision {revision}.".
Thu, Feb 19, 8:05 PM · MediaWiki-Page-derived-data, Wikimedia-production-error
dancy updated the task description for T417904: MediaWiki\Revision\RevisionAccessException: Failed to load data blob from {address} for revision {revision}.
Thu, Feb 19, 8:05 PM · MediaWiki-Page-derived-data, Wikimedia-production-error
dancy updated the task description for T417933: Protect https://patchdemo.wmcloud.org/ from bots.
Thu, Feb 19, 8:01 PM · Catalyst (PatchDemo)
dancy updated the task description for T417933: Protect https://patchdemo.wmcloud.org/ from bots.
Thu, Feb 19, 7:59 PM · Catalyst (PatchDemo)
dancy created T417933: Protect https://patchdemo.wmcloud.org/ from bots.
Thu, Feb 19, 7:58 PM · Catalyst (PatchDemo)
dancy added a comment to T417820: TypeError: MediaWiki\Extension\OAuth\Entity\AccessTokenEntity::setUserIdentifier(): Argument #1 ($identifier) must be of type string, null given, called in AccessTokenEntity.php.

Thanks everyone!

Thu, Feb 19, 4:50 PM · MW-1.46-notes (1.46.0-wmf.16; 2026-02-17), MediaWiki-Platform-Team, MediaWiki-extensions-OAuth, Wikimedia-production-error
dancy created T417904: MediaWiki\Revision\RevisionAccessException: Failed to load data blob from {address} for revision {revision}.
Thu, Feb 19, 4:25 PM · MediaWiki-Page-derived-data, Wikimedia-production-error
dancy added a comment to T369186: PHP Warning: Cannot modify header information - headers already sent by includes/libs/http/MultiHttpClient.php.

A burst of these errors happened on Feb 16 between 13:35 and 13:40. Here's a fresh trace:

Thu, Feb 19, 4:16 PM · MW-Interfaces-Team, MediaWiki-libs-HTTP, SRE-swift-storage, Wikimedia-production-error

Wed, Feb 18

dancy added a comment to T417820: TypeError: MediaWiki\Extension\OAuth\Entity\AccessTokenEntity::setUserIdentifier(): Argument #1 ($identifier) must be of type string, null given, called in AccessTokenEntity.php.

Thanks for the work on this. I'm holding the train until the fix is backported.

Wed, Feb 18, 10:51 PM · MW-1.46-notes (1.46.0-wmf.16; 2026-02-17), MediaWiki-Platform-Team, MediaWiki-extensions-OAuth, Wikimedia-production-error
dancy added a parent task for T417820: TypeError: MediaWiki\Extension\OAuth\Entity\AccessTokenEntity::setUserIdentifier(): Argument #1 ($identifier) must be of type string, null given, called in AccessTokenEntity.php: T413807: 1.46.0-wmf.16 deployment blockers.
Wed, Feb 18, 10:49 PM · MW-1.46-notes (1.46.0-wmf.16; 2026-02-17), MediaWiki-Platform-Team, MediaWiki-extensions-OAuth, Wikimedia-production-error
dancy added a subtask for T413807: 1.46.0-wmf.16 deployment blockers: T417820: TypeError: MediaWiki\Extension\OAuth\Entity\AccessTokenEntity::setUserIdentifier(): Argument #1 ($identifier) must be of type string, null given, called in AccessTokenEntity.php.
Wed, Feb 18, 10:49 PM · Release-Engineering-Team (Priority Backlog 📥), Essential-Work, Release, Train Deployments
dancy added a comment to T417820: TypeError: MediaWiki\Extension\OAuth\Entity\AccessTokenEntity::setUserIdentifier(): Argument #1 ($identifier) must be of type string, null given, called in AccessTokenEntity.php.

Should probably do a quick fix rather than waiting for the client credentials issue to be fully worked out.

Wed, Feb 18, 8:16 PM · MW-1.46-notes (1.46.0-wmf.16; 2026-02-17), MediaWiki-Platform-Team, MediaWiki-extensions-OAuth, Wikimedia-production-error
dancy updated the task description for T417820: TypeError: MediaWiki\Extension\OAuth\Entity\AccessTokenEntity::setUserIdentifier(): Argument #1 ($identifier) must be of type string, null given, called in AccessTokenEntity.php.
Wed, Feb 18, 7:58 PM · MW-1.46-notes (1.46.0-wmf.16; 2026-02-17), MediaWiki-Platform-Team, MediaWiki-extensions-OAuth, Wikimedia-production-error
dancy added a project to T417820: TypeError: MediaWiki\Extension\OAuth\Entity\AccessTokenEntity::setUserIdentifier(): Argument #1 ($identifier) must be of type string, null given, called in AccessTokenEntity.php: MediaWiki-extensions-OAuth.
Wed, Feb 18, 7:44 PM · MW-1.46-notes (1.46.0-wmf.16; 2026-02-17), MediaWiki-Platform-Team, MediaWiki-extensions-OAuth, Wikimedia-production-error
dancy created T417820: TypeError: MediaWiki\Extension\OAuth\Entity\AccessTokenEntity::setUserIdentifier(): Argument #1 ($identifier) must be of type string, null given, called in AccessTokenEntity.php.
Wed, Feb 18, 7:43 PM · MW-1.46-notes (1.46.0-wmf.16; 2026-02-17), MediaWiki-Platform-Team, MediaWiki-extensions-OAuth, Wikimedia-production-error
dancy added a project to T417819: PHP Deprecated: Use of MediaWiki\Parser\ParserOutput::setOutputFlag with non-standard flag was deprecated in MediaWiki 1.45. [Called from MediaWiki\Parser\ParserOutput::initFromJson]: MediaWiki-Parser.
Wed, Feb 18, 7:38 PM · MediaWiki-Parser, Wikimedia-production-error
dancy created T417819: PHP Deprecated: Use of MediaWiki\Parser\ParserOutput::setOutputFlag with non-standard flag was deprecated in MediaWiki 1.45. [Called from MediaWiki\Parser\ParserOutput::initFromJson].
Wed, Feb 18, 7:36 PM · MediaWiki-Parser, Wikimedia-production-error

Feb 13 2026

dancy created T417455: Blast of broken pipe errors from catalyst-api container log stream.
Feb 13 2026, 9:59 PM · Release-Engineering-Team (Doing 😎), Essential-Work, Catalyst (Luka Ijo Pimeja Jan)
dancy created T417437: Configure Go build cache for catalyst-api.
Feb 13 2026, 5:45 PM · Catalyst
dancy created T417427: Improve the patchdemo/catalyst-api development environment situation.
Feb 13 2026, 4:19 PM · Catalyst (Luka Ijo Pimeja Jan), Patch-For-Review
dancy created T417426: Quota increase request for project catalyst-dev.
Feb 13 2026, 4:15 PM · Catalyst, Cloud-VPS (Quota-requests)
dancy closed T416465: Install K9s on main K3s host as Resolved.
Feb 13 2026, 4:06 PM · Catalyst (Luka Ijo Pimeja Jan), Essential-Work

Feb 11 2026

dancy added a comment to T417071: k3s.catalyst-dev.eqiad1.wikimedia.cloud DNS lookup not working.

Thanks @jnuche !

Feb 11 2026, 3:50 PM · Catalyst
dancy added a comment to T412951: Move the docker registry's /restricted prefix to Docker Distribution backed up by Ceph.

@elukey Congrats on the experiment!

Feb 11 2026, 3:41 PM · Epic, Kubernetes, ServiceOps new, Release-Engineering-Team (Radar), Ceph, SRE-swift-storage
dancy added a comment to T412951: Move the docker registry's /restricted prefix to Docker Distribution backed up by Ceph.

This includes two 5-mins wait time, so it could have potentially be completed in 27 mins.

elukey@deploy2002:~$ grep "300 seconds" /home/elukey/scap-image-build-and-push-log
11:25:42 [mediawiki-publish-83-next] Waiting 300 seconds for swift after full mediawiki image build (T390251)
11:32:11 [mediawiki-publish-83] Waiting 300 seconds for swift after full mediawiki image build (T390251)
Feb 11 2026, 3:41 PM · Epic, Kubernetes, ServiceOps new, Release-Engineering-Team (Radar), Ceph, SRE-swift-storage

Feb 10 2026

dancy added a comment to T417071: k3s.catalyst-dev.eqiad1.wikimedia.cloud DNS lookup not working.

I have bodged up an a record and ptr record for this host, so you should be able to access it now. Because I created the record by hand it will likely live on forever after the VM is deleted so I'm going to keep this task open for a while.

If someone has a story about what happened and/or if that VM worked in the past, I'm still interested!

Feb 10 2026, 10:03 PM · Catalyst
dancy added a comment to T417071: k3s.catalyst-dev.eqiad1.wikimedia.cloud DNS lookup not working.

ok! It would be useful to know if/when that VM was ever functional and reachable.

Feb 10 2026, 9:47 PM · Catalyst
dancy updated subscribers of T417071: k3s.catalyst-dev.eqiad1.wikimedia.cloud DNS lookup not working.
Feb 10 2026, 9:39 PM · Catalyst
dancy added a comment to T417071: k3s.catalyst-dev.eqiad1.wikimedia.cloud DNS lookup not working.

@dancy have you been able to access this VM in the past?

Feb 10 2026, 9:39 PM · Catalyst
dancy added a comment to T361724: scap should check if it is running within a tmux/screen.

Just to confirm, the terminal-multiplexer check feature works as expected, and all that remains is docs updates and comms to ops@lists before it's reenabled, correct? (i.e., the description remains accurate)

Feb 10 2026, 8:28 PM · ServiceOps new, Release-Engineering-Team (Priority Backlog 📥), Sustainability (Incident Followup), Scap
dancy created T417071: k3s.catalyst-dev.eqiad1.wikimedia.cloud DNS lookup not working.
Feb 10 2026, 8:26 PM · Catalyst

Feb 5 2026

dancy closed T406733: registry.cloud.releng.team returning 503s as Resolved.

I believe that the work done in T408740 has resolved this issue.

Feb 5 2026, 4:28 PM · collaboration-services, GitLab (CI & Job Runners)

Feb 2 2026

dancy updated subscribers of T412951: Move the docker registry's /restricted prefix to Docker Distribution backed up by Ceph.

@dancy @Scott_French I think we are ready to move forward with https://gerrit.wikimedia.org/r/c/operations/puppet/+/1229145, what do you think?

I am ok to proceed with the sync-world and full_image_rebuild during a MediaWiki infrastructure window (could be good event tomorrow's), but it will be during your night so you'll not be able to join (but if needed I can loop in Matthew to check the apus/Ceph side). The alternative is to schedule the upgrade during a MediaWiki infra window happening during your workday, totally fine for me. Lemme know what you prefer!

Feb 2 2026, 4:39 PM · Epic, Kubernetes, ServiceOps new, Release-Engineering-Team (Radar), Ceph, SRE-swift-storage

Jan 29 2026

dancy closed T415839: "The connection to the server kubemaster.svc.eqiad.wmnet:6443 was refused - did you specify the right host or port?" and freezing as Resolved.

@Reedy I deployed a change to the kubernetes deployment monitor code in scap. Now it will not give up monitoring when it gets an error from kubectl.

Jan 29 2026, 4:40 PM · Scap
dancy renamed T415284: Buildkit v0.28.0 released from Buildkit v0.27.0 released to Buildkit v0.27.1 released.
Jan 29 2026, 3:39 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

Jan 28 2026

dancy closed T415827: scap help mentions PHP 7.x as Resolved.
Jan 28 2026, 9:26 PM · Documentation, Scap
dancy added a comment to T412941: Proposal: scap deploy-service.

Release-Engineering-Team could you please provide inputs on the Scap specific works in the description?

Jan 28 2026, 4:04 PM · User-jijiki, Epic, ServiceOps new, Scap, Release-Engineering-Team
dancy updated the task description for T412941: Proposal: scap deploy-service.
Jan 28 2026, 4:03 PM · User-jijiki, Epic, ServiceOps new, Scap, Release-Engineering-Team

Jan 27 2026

dancy closed T401197: Fix or remove `mediawiki-dev` from deployments charts as Resolved.

I have archived the documentation page so now we can remove the chart.

Jan 27 2026, 9:55 PM · ServiceOps new, MW-on-K8s

Jan 26 2026

dancy added a comment to T414864: Unblock running tests against Beta Cluster from Digital Ocean GitLab CI runners.

Here's a working .gitlab-ci.yml config using wmcs runners: https://gitlab.wikimedia.org/repos/m3api/tmp-m3api-oauth2/-/merge_requests/1/diffs

Jan 26 2026, 8:51 PM · GitLab, m3api, Beta-Cluster-Infrastructure
dancy added a comment to T414864: Unblock running tests against Beta Cluster from Digital Ocean GitLab CI runners.

@LucasWerkmeister Thanks for setting up https://gitlab.wikimedia.org/repos/m3api/tmp-m3api-oauth2 and adding me as a member. I tried to run the CI pipeline of that repo on the main branch (using the normal D.O. runners) but it fails: https://gitlab.wikimedia.org/repos/m3api/tmp-m3api-oauth2/-/jobs/727347. I need a working pipeline as a starting point for debugging.

Jan 26 2026, 5:03 PM · GitLab, m3api, Beta-Cluster-Infrastructure
dancy added a comment to T414273: SecurityPatchBot: Only change a task's status to 'open' if its current status is a 'closed status'.

Thanks @dancy :) FWIW, the issue (for me at least) would be less creating/submitting the actual patch, and more getting together a local environment that I'd be able to test it in...

Jan 26 2026, 3:52 PM · Scap

Jan 23 2026

dancy added a comment to T337570: Get GitLab to render `T{\d}+` in MR overviews, comments, etc. as links to Phabricator.

If a merge request description body consists of a single Bug: TXXXXX line, the link to Phabricator does not get created. I'm wondering if this is a new problem since the recent Gitlab UI changes. See the job link referenced in the task description for an example (https://gitlab.wikimedia.org/repos/abstract-wiki/wikifunctions/function-orchestrator/-/merge_requests/18).

Phabricator linking seems to be working on https://gitlab.wikimedia.org/toolforge-repos/versions/-/merge_requests/8. I wonder if there is something interesting going on in that wikifunctions MR description? Does the regex that is in use allow for trailing whitespace? Is there a newline requirement?

Jan 23 2026, 10:03 PM · Phabricator, GitLab (Integrations), User-brennen, Release-Engineering-Team (Priority Backlog 📥)
dancy closed T394302: One-click to retry a deployment that ended in error state as Resolved.

Deployed via scap 4.235.0.

Jan 23 2026, 4:55 PM · Essential-Work, Release-Engineering-Team, Scap (SpiderPig 🕸️)
dancy added a comment to T337570: Get GitLab to render `T{\d}+` in MR overviews, comments, etc. as links to Phabricator.

If a merge request description body consists of a single Bug: TXXXXX line, the link to Phabricator does not get created. I'm wondering if this is a new problem since the recent Gitlab UI changes. See the job link referenced in the task description for an example (https://gitlab.wikimedia.org/repos/abstract-wiki/wikifunctions/function-orchestrator/-/merge_requests/18).

Jan 23 2026, 4:53 PM · Phabricator, GitLab (Integrations), User-brennen, Release-Engineering-Team (Priority Backlog 📥)
dancy triaged T394302: One-click to retry a deployment that ended in error state as Low priority.
Jan 23 2026, 4:41 PM · Essential-Work, Release-Engineering-Team, Scap (SpiderPig 🕸️)

Jan 22 2026

dancy created T415284: Buildkit v0.28.0 released.
Jan 22 2026, 4:22 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

Jan 21 2026

dancy closed T415062: scap --unlock-all asks for confirmation even with --bg flag, a subtask of T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook, as Resolved.
Jan 21 2026, 4:47 PM · Essential-Work, ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE
dancy closed T415062: scap --unlock-all asks for confirmation even with --bg flag as Resolved.
Jan 21 2026, 4:47 PM · Essential-Work, ServiceOps new, Release-Engineering-Team, Scap
dancy closed T414628: scap patch management commands should lock the staging directory as Resolved.

Change deployed via scap 4.234.0

Jan 21 2026, 4:28 PM · Patch-For-Review, Essential-Work, Scap, Release-Engineering-Team

Jan 20 2026

dancy added a comment to T412951: Move the docker registry's /restricted prefix to Docker Distribution backed up by Ceph.

Thanks! When you have a moment let me know if what I wrote above is sound, namely if it is ok to just start clean or not (basically if it is ok not to be able to rollback when we build/deploy the first time after the switch). If it is fine, I think that we can probably choose when to do the test after the SRE summit (that will happen next week). Ideally we could just rebuild all images and try to push/pull them, and see how it goes.

Jan 20 2026, 9:15 PM · Epic, Kubernetes, ServiceOps new, Release-Engineering-Team (Radar), Ceph, SRE-swift-storage
dancy added a comment to T415062: scap --unlock-all asks for confirmation even with --bg flag.

@Clement_Goubert Please use scap lock --unlock-all --yes <unlock reason> (note: --bg is not passed here).

Jan 20 2026, 5:01 PM · Essential-Work, ServiceOps new, Release-Engineering-Team, Scap
dancy added a comment to T414864: Unblock running tests against Beta Cluster from Digital Ocean GitLab CI runners.

The runners you would like unblocked are hosted on Digital Ocean. I do not think that it would be reasonable to open the Beta Cluster to the full DO IPv4 address space. If we have a fixed sub-space for egress we can unblock that. @dancy can you help me figure out if there is a restricted range that the runner egress through?

Jan 20 2026, 4:19 PM · GitLab, m3api, Beta-Cluster-Infrastructure
dancy added a comment to T414864: Unblock running tests against Beta Cluster from Digital Ocean GitLab CI runners.

FWIW, I’ve tried to get m3api-oauth2 CI running on the WMCS runners instead (wmcs tag), but so far haven’t managed to get Chrome/Chromium running there yet (latest job).

Jan 20 2026, 4:09 PM · GitLab, m3api, Beta-Cluster-Infrastructure
dancy added a comment to T412951: Move the docker registry's /restricted prefix to Docker Distribution backed up by Ceph.

Thanks for the report @elukey. This sounds very promising!

Jan 20 2026, 4:02 PM · Epic, Kubernetes, ServiceOps new, Release-Engineering-Team (Radar), Ceph, SRE-swift-storage

Jan 16 2026

dancy added a comment to T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd").

I confirmed that I did receive an email when I created a beta wiki account.

Jan 16 2026, 5:12 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure
dancy added a comment to T414826: Found non-revoked Puppet certificates for 1 deleted instances on deployment-puppetserver-1.

xref T412975

Jan 16 2026, 4:48 PM · Beta-Cluster-Infrastructure
dancy closed T414826: Found non-revoked Puppet certificates for 1 deleted instances on deployment-puppetserver-1 as Resolved.

I ran sudo /usr/local/sbin/clean-stale-puppet-certs --clean on deployment-puppetserver-1.deployment-prep to take care of this.

Jan 16 2026, 4:48 PM · Beta-Cluster-Infrastructure
dancy closed T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd"), a subtask of T401839: Migrate deployment-prep away from Debian Bullseye to Bookworm/Trixie, as Resolved.
Jan 16 2026, 4:37 PM · Epic, Release-Engineering-Team (Priority Backlog 📥), Cloud-VPS (Debian Bullseye Deprecation), Beta-Cluster-Infrastructure
dancy closed T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd") as Resolved.

Thanks for the work on this @thcipriani !

Jan 16 2026, 4:37 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure
dancy updated subscribers of T401197: Fix or remove `mediawiki-dev` from deployments charts.

@jeena @thcipriani What's the status of the mediawiki-dev deployment-chart? I found that it is referenced in https://wikitech.wikimedia.org/wiki/Deployment_pipeline/Migration/Tutorial

Jan 16 2026, 3:54 PM · ServiceOps new, MW-on-K8s

Jan 15 2026

dancy added a comment to T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook.

@Blake I've installed a new release of scap on the deploy servers. You can now use scap lock --all --bg and scap lock --unlock-all to achieve your goal. I recommend testing things out on deployment.eqiad.wmnet well in advance of the datacenter switch. Let me know if you have any issues.

Jan 15 2026, 4:03 PM · Essential-Work, ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE
dancy updated the task description for T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook.
Jan 15 2026, 4:01 PM · Essential-Work, ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE

Jan 14 2026

dancy renamed T414628: scap patch management commands should lock the staging directory from sacp patch management commands should lock the staging directory to scap patch management commands should lock the staging directory.
Jan 14 2026, 9:39 PM · Patch-For-Review, Essential-Work, Scap, Release-Engineering-Team
dancy created T414628: scap patch management commands should lock the staging directory.
Jan 14 2026, 8:34 PM · Patch-For-Review, Essential-Work, Scap, Release-Engineering-Team
dancy added a comment to T414273: SecurityPatchBot: Only change a task's status to 'open' if its current status is a 'closed status'.

@A_smart_kitten patches are welcome! Let me know if you need help with getting something together.

Jan 14 2026, 7:26 PM · Scap

Jan 13 2026

dancy updated the task description for T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook.
Jan 13 2026, 10:17 PM · Essential-Work, ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE
dancy added a project to T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook: Release-Engineering-Team.
Jan 13 2026, 8:29 PM · Essential-Work, ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE
dancy added a comment to T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook.

Potential ideas:

  • Drop a lock file on the deployment server that scap detects, remove it at a later step
  • Add a switch to scap lock --all that puts the lock on and returns, add a scap unlock --all command
Jan 13 2026, 6:07 PM · Essential-Work, ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE
dancy closed T414504: Puppet failure on etherpad-bookworm.devtools.eqiad1.wikimedia.cloud as Resolved.

Added

profile::tlsproxy::envoy::upstream_sni: null
profile::tlsproxy::envoy::upstream_tls: false

to devtools project puppet according to @bd808's recommendation.

Jan 13 2026, 5:18 PM · Essential-Work, Release-Engineering-Team
dancy added a comment to T414504: Puppet failure on etherpad-bookworm.devtools.eqiad1.wikimedia.cloud.

Looks similar to T414304

Jan 13 2026, 5:12 PM · Essential-Work, Release-Engineering-Team
dancy created T414504: Puppet failure on etherpad-bookworm.devtools.eqiad1.wikimedia.cloud.
Jan 13 2026, 5:10 PM · Essential-Work, Release-Engineering-Team
dancy closed T413193: git::clone can fail to checkout its remote branch, leading to unrecoverable failure as Resolved.
Jan 13 2026, 5:06 PM · Infrastructure-Foundations, SRE

Jan 12 2026

dancy closed T414032: Add yubikey ssh key for dancy as Resolved.
Jan 12 2026, 6:16 PM · SRE, Release-Engineering-Team, SRE-Access-Requests
dancy added a comment to T414366: Project deployment-prep instance deployment-mx03 is down.

I turned off this instance as part of T412975

Jan 12 2026, 5:42 PM · Beta-Cluster-Infrastructure
dancy added a comment to T399348: Wikifunctions function orchestrator and evaluator test suites failing on GitLab CI with OOM errors.

@dancy I tried changing the tags here and am getting the same error... maybe I'm still doing something wrong though?

Jan 12 2026, 5:41 PM · Abstract Wikipedia team, GitLab (CI & Job Runners), Essential-Work, collaboration-services, Release-Engineering-Team, Patch-For-Review, function-orchestrator, function-evaluator
dancy claimed T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd").
Jan 12 2026, 5:07 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure
dancy added a comment to T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd").
Jan 12 2026, 5:04 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure

Jan 9 2026

dancy added a comment to T399348: Wikifunctions function orchestrator and evaluator test suites failing on GitLab CI with OOM errors.

After reading the history of this ticket, it's not clear to me if the memory-optimized runners have been tried yet. Can someone try and confirm?

Heya @dancy, this is done by tags: [wmcs] correct (looking at T399348#11007932)? That tag been added here for the orchestrator. So if that tag looks correct, then it has been tried.

Jan 9 2026, 7:33 PM · Abstract Wikipedia team, GitLab (CI & Job Runners), Essential-Work, collaboration-services, Release-Engineering-Team, Patch-For-Review, function-orchestrator, function-evaluator
dancy added a comment to T399348: Wikifunctions function orchestrator and evaluator test suites failing on GitLab CI with OOM errors.

After reading the history of this ticket, it's not clear to me if the memory-optimized runners have been tried yet. Can someone try and confirm?

Jan 9 2026, 3:55 PM · Abstract Wikipedia team, GitLab (CI & Job Runners), Essential-Work, collaboration-services, Release-Engineering-Team, Patch-For-Review, function-orchestrator, function-evaluator

Jan 8 2026

dancy updated the task description for T414032: Add yubikey ssh key for dancy.
Jan 8 2026, 3:49 PM · SRE, Release-Engineering-Team, SRE-Access-Requests

Jan 7 2026

dancy updated the task description for T414032: Add yubikey ssh key for dancy.
Jan 7 2026, 10:19 PM · SRE, Release-Engineering-Team, SRE-Access-Requests
dancy created T414032: Add yubikey ssh key for dancy.
Jan 7 2026, 10:11 PM · SRE, Release-Engineering-Team, SRE-Access-Requests

Jan 6 2026

dancy closed T395097: Recent incidents of buildkitd's storage volume filling up as Resolved.

@Andrew has since changed where he performing the build of the wikitech-static container image, so the main cause of this ticket has been resolved.

Jan 6 2026, 8:15 PM · Essential-Work, cloud-services-team, Release-Engineering-Team (Priority Backlog 📥), GitLab (CI & Job Runners)
dancy closed T412869: Buildkit v0.26.3 released as Resolved.
Jan 6 2026, 3:43 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

Jan 5 2026

dancy changed the status of T412869: Buildkit v0.26.3 released from Open to In Progress.
Jan 5 2026, 7:02 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)