Page MenuHomePhabricator

dancy (Ahmon Dancy)
Staff Software Engineer, Release EngineeringAdministrator

Today

  • No visible events.

Tomorrow

  • No visible events.

Wednesday

  • No visible events.

User Details

User Since
Jun 27 2020, 12:14 AM (293 w, 2 d)
Roles
Administrator
Availability
Available
IRC Nick
dancy
LDAP User
Ahmon Dancy
MediaWiki User
ADancy (WMF) [ Global Accounts ]

Recent Activity

Thu, Feb 5

dancy closed T406733: registry.cloud.releng.team returning 503s as Resolved.

I believe that the work done in T408740 has resolved this issue.

Thu, Feb 5, 4:28 PM · collaboration-services, GitLab (CI & Job Runners)

Mon, Feb 2

dancy updated subscribers of T412951: Move the docker registry's /restricted prefix to Docker Distribution backed up by Ceph.

@dancy @Scott_French I think we are ready to move forward with https://gerrit.wikimedia.org/r/c/operations/puppet/+/1229145, what do you think?

I am ok to proceed with the sync-world and full_image_rebuild during a MediaWiki infrastructure window (could be good event tomorrow's), but it will be during your night so you'll not be able to join (but if needed I can loop in Matthew to check the apus/Ceph side). The alternative is to schedule the upgrade during a MediaWiki infra window happening during your workday, totally fine for me. Lemme know what you prefer!

Mon, Feb 2, 4:39 PM · Patch-For-Review, Epic, Kubernetes, ServiceOps new, Release-Engineering-Team (Radar), Ceph, SRE-swift-storage

Thu, Jan 29

dancy closed T415839: "The connection to the server kubemaster.svc.eqiad.wmnet:6443 was refused - did you specify the right host or port?" and freezing as Resolved.

@Reedy I deployed a change to the kubernetes deployment monitor code in scap. Now it will not give up monitoring when it gets an error from kubectl.

Thu, Jan 29, 4:40 PM · Scap
dancy renamed T415284: Buildkit v0.27.1 released from Buildkit v0.27.0 released to Buildkit v0.27.1 released.
Thu, Jan 29, 3:39 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

Wed, Jan 28

dancy closed T415827: scap help mentions PHP 7.x as Resolved.
Wed, Jan 28, 9:26 PM · Documentation, Scap
dancy added a comment to T412941: Proposal: scap deploy-service.

Release-Engineering-Team could you please provide inputs on the Scap specific works in the description?

Wed, Jan 28, 4:04 PM · User-jijiki, Epic, ServiceOps new, Scap, Release-Engineering-Team
dancy updated the task description for T412941: Proposal: scap deploy-service.
Wed, Jan 28, 4:03 PM · User-jijiki, Epic, ServiceOps new, Scap, Release-Engineering-Team

Tue, Jan 27

dancy closed T401197: Fix or remove `mediawiki-dev` from deployments charts as Resolved.

I have archived the documentation page so now we can remove the chart.

Tue, Jan 27, 9:55 PM · ServiceOps new, MW-on-K8s

Mon, Jan 26

dancy added a comment to T414864: Unblock running tests against Beta Cluster from Digital Ocean GitLab CI runners.

Here's a working .gitlab-ci.yml config using wmcs runners: https://gitlab.wikimedia.org/repos/m3api/tmp-m3api-oauth2/-/merge_requests/1/diffs

Mon, Jan 26, 8:51 PM · GitLab, m3api, Beta-Cluster-Infrastructure
dancy added a comment to T414864: Unblock running tests against Beta Cluster from Digital Ocean GitLab CI runners.

@LucasWerkmeister Thanks for setting up https://gitlab.wikimedia.org/repos/m3api/tmp-m3api-oauth2 and adding me as a member. I tried to run the CI pipeline of that repo on the main branch (using the normal D.O. runners) but it fails: https://gitlab.wikimedia.org/repos/m3api/tmp-m3api-oauth2/-/jobs/727347. I need a working pipeline as a starting point for debugging.

Mon, Jan 26, 5:03 PM · GitLab, m3api, Beta-Cluster-Infrastructure
dancy added a comment to T414273: SecurityPatchBot: Only change a task's status to 'open' if its current status is a 'closed status'.

Thanks @dancy :) FWIW, the issue (for me at least) would be less creating/submitting the actual patch, and more getting together a local environment that I'd be able to test it in...

Mon, Jan 26, 3:52 PM · Scap

Fri, Jan 23

dancy added a comment to T337570: Get GitLab to render `T{\d}+` in MR overviews, comments, etc. as links to Phabricator.

If a merge request description body consists of a single Bug: TXXXXX line, the link to Phabricator does not get created. I'm wondering if this is a new problem since the recent Gitlab UI changes. See the job link referenced in the task description for an example (https://gitlab.wikimedia.org/repos/abstract-wiki/wikifunctions/function-orchestrator/-/merge_requests/18).

Phabricator linking seems to be working on https://gitlab.wikimedia.org/toolforge-repos/versions/-/merge_requests/8. I wonder if there is something interesting going on in that wikifunctions MR description? Does the regex that is in use allow for trailing whitespace? Is there a newline requirement?

Fri, Jan 23, 10:03 PM · Phabricator, GitLab (Integrations), User-brennen, Release-Engineering-Team (Priority Backlog 📥)
dancy closed T394302: One-click to retry a deployment that ended in error state as Resolved.

Deployed via scap 4.235.0.

Fri, Jan 23, 4:55 PM · Release-Engineering-Team, Scap (SpiderPig 🕸️)
dancy added a comment to T337570: Get GitLab to render `T{\d}+` in MR overviews, comments, etc. as links to Phabricator.

If a merge request description body consists of a single Bug: TXXXXX line, the link to Phabricator does not get created. I'm wondering if this is a new problem since the recent Gitlab UI changes. See the job link referenced in the task description for an example (https://gitlab.wikimedia.org/repos/abstract-wiki/wikifunctions/function-orchestrator/-/merge_requests/18).

Fri, Jan 23, 4:53 PM · Phabricator, GitLab (Integrations), User-brennen, Release-Engineering-Team (Priority Backlog 📥)
dancy triaged T394302: One-click to retry a deployment that ended in error state as Low priority.
Fri, Jan 23, 4:41 PM · Release-Engineering-Team, Scap (SpiderPig 🕸️)

Thu, Jan 22

dancy created T415284: Buildkit v0.27.1 released.
Thu, Jan 22, 4:22 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

Wed, Jan 21

dancy closed T415062: scap --unlock-all asks for confirmation even with --bg flag, a subtask of T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook, as Resolved.
Wed, Jan 21, 4:47 PM · ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE
dancy closed T415062: scap --unlock-all asks for confirmation even with --bg flag as Resolved.
Wed, Jan 21, 4:47 PM · ServiceOps new, Release-Engineering-Team, Scap
dancy closed T414628: scap patch management commands should lock the staging directory as Resolved.

Change deployed via scap 4.234.0

Wed, Jan 21, 4:28 PM · Patch-For-Review, Essential-Work, Scap, Release-Engineering-Team

Tue, Jan 20

dancy added a comment to T412951: Move the docker registry's /restricted prefix to Docker Distribution backed up by Ceph.

Thanks! When you have a moment let me know if what I wrote above is sound, namely if it is ok to just start clean or not (basically if it is ok not to be able to rollback when we build/deploy the first time after the switch). If it is fine, I think that we can probably choose when to do the test after the SRE summit (that will happen next week). Ideally we could just rebuild all images and try to push/pull them, and see how it goes.

Tue, Jan 20, 9:15 PM · Patch-For-Review, Epic, Kubernetes, ServiceOps new, Release-Engineering-Team (Radar), Ceph, SRE-swift-storage
dancy added a comment to T415062: scap --unlock-all asks for confirmation even with --bg flag.

@Clement_Goubert Please use scap lock --unlock-all --yes <unlock reason> (note: --bg is not passed here).

Tue, Jan 20, 5:01 PM · ServiceOps new, Release-Engineering-Team, Scap
dancy added a comment to T414864: Unblock running tests against Beta Cluster from Digital Ocean GitLab CI runners.

The runners you would like unblocked are hosted on Digital Ocean. I do not think that it would be reasonable to open the Beta Cluster to the full DO IPv4 address space. If we have a fixed sub-space for egress we can unblock that. @dancy can you help me figure out if there is a restricted range that the runner egress through?

Tue, Jan 20, 4:19 PM · GitLab, m3api, Beta-Cluster-Infrastructure
dancy added a comment to T414864: Unblock running tests against Beta Cluster from Digital Ocean GitLab CI runners.

FWIW, I’ve tried to get m3api-oauth2 CI running on the WMCS runners instead (wmcs tag), but so far haven’t managed to get Chrome/Chromium running there yet (latest job).

Tue, Jan 20, 4:09 PM · GitLab, m3api, Beta-Cluster-Infrastructure
dancy added a comment to T412951: Move the docker registry's /restricted prefix to Docker Distribution backed up by Ceph.

Thanks for the report @elukey. This sounds very promising!

Tue, Jan 20, 4:02 PM · Patch-For-Review, Epic, Kubernetes, ServiceOps new, Release-Engineering-Team (Radar), Ceph, SRE-swift-storage

Fri, Jan 16

dancy added a comment to T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd").

I confirmed that I did receive an email when I created a beta wiki account.

Fri, Jan 16, 5:12 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure
dancy added a comment to T414826: Found non-revoked Puppet certificates for 1 deleted instances on deployment-puppetserver-1.

xref T412975

Fri, Jan 16, 4:48 PM · Beta-Cluster-Infrastructure
dancy closed T414826: Found non-revoked Puppet certificates for 1 deleted instances on deployment-puppetserver-1 as Resolved.

I ran sudo /usr/local/sbin/clean-stale-puppet-certs --clean on deployment-puppetserver-1.deployment-prep to take care of this.

Fri, Jan 16, 4:48 PM · Beta-Cluster-Infrastructure
dancy closed T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd"), a subtask of T401839: Migrate deployment-prep away from Debian Bullseye to Bookworm/Trixie, as Resolved.
Fri, Jan 16, 4:37 PM · Epic, Release-Engineering-Team (Priority Backlog 📥), Cloud-VPS (Debian Bullseye Deprecation), Beta-Cluster-Infrastructure
dancy closed T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd") as Resolved.

Thanks for the work on this @thcipriani !

Fri, Jan 16, 4:37 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure
dancy updated subscribers of T401197: Fix or remove `mediawiki-dev` from deployments charts.

@jeena @thcipriani What's the status of the mediawiki-dev deployment-chart? I found that it is referenced in https://wikitech.wikimedia.org/wiki/Deployment_pipeline/Migration/Tutorial

Fri, Jan 16, 3:54 PM · ServiceOps new, MW-on-K8s

Thu, Jan 15

dancy added a comment to T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook.

@Blake I've installed a new release of scap on the deploy servers. You can now use scap lock --all --bg and scap lock --unlock-all to achieve your goal. I recommend testing things out on deployment.eqiad.wmnet well in advance of the datacenter switch. Let me know if you have any issues.

Thu, Jan 15, 4:03 PM · ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE
dancy updated the task description for T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook.
Thu, Jan 15, 4:01 PM · ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE

Wed, Jan 14

dancy renamed T414628: scap patch management commands should lock the staging directory from sacp patch management commands should lock the staging directory to scap patch management commands should lock the staging directory.
Wed, Jan 14, 9:39 PM · Patch-For-Review, Essential-Work, Scap, Release-Engineering-Team
dancy created T414628: scap patch management commands should lock the staging directory.
Wed, Jan 14, 8:34 PM · Patch-For-Review, Essential-Work, Scap, Release-Engineering-Team
dancy added a comment to T414273: SecurityPatchBot: Only change a task's status to 'open' if its current status is a 'closed status'.

@A_smart_kitten patches are welcome! Let me know if you need help with getting something together.

Wed, Jan 14, 7:26 PM · Scap

Tue, Jan 13

dancy updated the task description for T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook.
Tue, Jan 13, 10:17 PM · ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE
dancy added a project to T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook: Release-Engineering-Team.
Tue, Jan 13, 8:29 PM · ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE
dancy added a comment to T330996: Add scap lock/unlock steps to sre.switchdc.mediawiki cookbook.

Potential ideas:

  • Drop a lock file on the deployment server that scap detects, remove it at a later step
  • Add a switch to scap lock --all that puts the lock on and returns, add a scap unlock --all command
Tue, Jan 13, 6:07 PM · ServiceOps-good-first-task, ServiceOps new, Release-Engineering-Team, Scap, Datacenter-Switchover, SRE
dancy closed T414504: Puppet failure on etherpad-bookworm.devtools.eqiad1.wikimedia.cloud as Resolved.

Added

profile::tlsproxy::envoy::upstream_sni: null
profile::tlsproxy::envoy::upstream_tls: false

to devtools project puppet according to @bd808's recommendation.

Tue, Jan 13, 5:18 PM · Release-Engineering-Team
dancy added a comment to T414504: Puppet failure on etherpad-bookworm.devtools.eqiad1.wikimedia.cloud.

Looks similar to T414304

Tue, Jan 13, 5:12 PM · Release-Engineering-Team
dancy created T414504: Puppet failure on etherpad-bookworm.devtools.eqiad1.wikimedia.cloud.
Tue, Jan 13, 5:10 PM · Release-Engineering-Team
dancy closed T413193: git::clone can fail to checkout its remote branch, leading to unrecoverable failure as Resolved.
Tue, Jan 13, 5:06 PM · Infrastructure-Foundations, SRE

Mon, Jan 12

dancy closed T414032: Add yubikey ssh key for dancy as Resolved.
Mon, Jan 12, 6:16 PM · SRE, Release-Engineering-Team, SRE-Access-Requests
dancy added a comment to T414366: Project deployment-prep instance deployment-mx03 is down.

I turned off this instance as part of T412975

Mon, Jan 12, 5:42 PM · Beta-Cluster-Infrastructure
dancy added a comment to T399348: Wikifunctions function orchestrator and evaluator test suites failing on GitLab CI with OOM errors.

@dancy I tried changing the tags here and am getting the same error... maybe I'm still doing something wrong though?

Mon, Jan 12, 5:41 PM · Abstract Wikipedia team, GitLab (CI & Job Runners), Essential-Work, collaboration-services, Release-Engineering-Team, Patch-For-Review, function-orchestrator, function-evaluator
dancy claimed T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd").
Mon, Jan 12, 5:07 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure
dancy added a comment to T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd").
Mon, Jan 12, 5:04 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure

Jan 9 2026

dancy added a comment to T399348: Wikifunctions function orchestrator and evaluator test suites failing on GitLab CI with OOM errors.

After reading the history of this ticket, it's not clear to me if the memory-optimized runners have been tried yet. Can someone try and confirm?

Heya @dancy, this is done by tags: [wmcs] correct (looking at T399348#11007932)? That tag been added here for the orchestrator. So if that tag looks correct, then it has been tried.

Jan 9 2026, 7:33 PM · Abstract Wikipedia team, GitLab (CI & Job Runners), Essential-Work, collaboration-services, Release-Engineering-Team, Patch-For-Review, function-orchestrator, function-evaluator
dancy added a comment to T399348: Wikifunctions function orchestrator and evaluator test suites failing on GitLab CI with OOM errors.

After reading the history of this ticket, it's not clear to me if the memory-optimized runners have been tried yet. Can someone try and confirm?

Jan 9 2026, 3:55 PM · Abstract Wikipedia team, GitLab (CI & Job Runners), Essential-Work, collaboration-services, Release-Engineering-Team, Patch-For-Review, function-orchestrator, function-evaluator

Jan 8 2026

dancy updated the task description for T414032: Add yubikey ssh key for dancy.
Jan 8 2026, 3:49 PM · SRE, Release-Engineering-Team, SRE-Access-Requests

Jan 7 2026

dancy updated the task description for T414032: Add yubikey ssh key for dancy.
Jan 7 2026, 10:19 PM · SRE, Release-Engineering-Team, SRE-Access-Requests
dancy created T414032: Add yubikey ssh key for dancy.
Jan 7 2026, 10:11 PM · SRE, Release-Engineering-Team, SRE-Access-Requests

Jan 6 2026

dancy closed T395097: Recent incidents of buildkitd's storage volume filling up as Resolved.

@Andrew has since changed where he performing the build of the wikitech-static container image, so the main cause of this ticket has been resolved.

Jan 6 2026, 8:15 PM · cloud-services-team, Release-Engineering-Team (Priority Backlog 📥), GitLab (CI & Job Runners)
dancy closed T412869: Buildkit v0.26.3 released as Resolved.
Jan 6 2026, 3:43 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

Jan 5 2026

dancy changed the status of T412869: Buildkit v0.26.3 released from Open to In Progress.
Jan 5 2026, 7:02 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)
dancy closed T412868: Act as if testwikis are not part of group0 as Resolved.
Jan 5 2026, 6:22 PM · Release-Engineering-Team (Doing 😎), Essential-Work, Scap (SpiderPig 🕸️)
dancy updated subscribers of T413701: ReleaseTaggerBot is down for three weeks.

@Jdforrester-WMF FYI

Jan 5 2026, 4:50 PM · Regression, ReleaseTaggerBot
dancy updated subscribers of T413701: ReleaseTaggerBot is down for three weeks.

@thcipriani Making sure you see this.

Jan 5 2026, 4:49 PM · Regression, ReleaseTaggerBot

Dec 19 2025

dancy added a comment to T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd").

@dancy do you have any clues as to the need for this MX server?

Dec 19 2025, 4:52 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure
dancy updated subscribers of T413193: git::clone can fail to checkout its remote branch, leading to unrecoverable failure.

@fnegri wrote in T373815:
For future reference, I suspect there was a failure in the bash command substitution that is used to determine the default branch. Below is the PCC output from my patch:

Dec 19 2025, 4:18 PM · Infrastructure-Foundations, SRE
dancy closed T408277: 1.46.0-wmf.7 deployment blockers as Resolved.
Dec 19 2025, 3:38 PM · Essential-Work, Release-Engineering-Team (Priority Backlog 📥), Release, Train Deployments

Dec 18 2025

dancy added a comment to T413102: Add scap wheels for Trixie.
dancy@deploy2002:~$ scap install-world --version 4.230.0 --limit 'ml-build1001*'
Scap version "4.230.0" will be installed on 1 host(s). Proceed? [y/N]: y
20:48:09 Installing scap version "4.230.0" for 1 host(s)
20:48:09 Downloading version "4.230.0" locally
INFO: Scap version "4.230.0" for distribution "bullseye" already exists locally. Nothing to retrieve
INFO: Scap version "4.230.0" for distribution "bookworm" already exists locally. Nothing to retrieve
INFO: Scap version "4.230.0" for distribution "trixie" successfully extracted at /var/lib/scap/scap-wheels/trixie/4.230.0
INFO: Distributions downloaded. Skipping local installation on primary as requested
20:48:13 Syncing masters
20:48:16 scap-sync-to-masters: 100% (in-flight: 0; ok: 1; fail: 0; left: 0)
20:48:16 Syncing installation material to 1 scap targets from "deploy1003.eqiad.wmnet"
20:48:18 scap-sync-wheels-to-targets: 100% (in-flight: 0; ok: 1; fail: 0; left: 0)
20:48:19 scap-sync-install-script-to-targets: 100% (in-flight: 0; ok: 1; fail: 0; left: 0)
20:48:19 Installing 1 scap targets
20:49:06 scap-install-to-targets: 100% (in-flight: 0; ok: 1; fail: 0; left: 0)
20:49:06 Installation of scap version "4.230.0" completed for 1 hosts
Dec 18 2025, 8:50 PM · Essential-Work, Release-Engineering-Team (Doing 😎), Scap
dancy changed the status of T413102: Add scap wheels for Trixie from Open to In Progress.
Dec 18 2025, 5:05 PM · Essential-Work, Release-Engineering-Team (Doing 😎), Scap

Dec 17 2025

dancy added a comment to T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd").
dancy@deployment-mx03:~$ lsb_release  -a
No LSB modules are available.
Distributor ID: Debian
Description:    Debian GNU/Linux 11 (bullseye)
Release:        11
Codename:       bullseye
Dec 17 2025, 4:03 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure
dancy updated subscribers of T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd").

Presumably caused by https://gerrit.wikimedia.org/r/c/operations/puppet/+/1219137
@Muehlenhoff @Jelto

Dec 17 2025, 4:02 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure
dancy updated the task description for T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd").
Dec 17 2025, 3:59 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure
dancy created T412975: Replace deployment-mx03 with a bookworm-based instance (was Puppet failure: "Unable to locate package spamd").
Dec 17 2025, 3:59 PM · Patch-For-Review, collaboration-services, Beta-Cluster-Infrastructure

Dec 16 2025

dancy created T412869: Buildkit v0.26.3 released.
Dec 16 2025, 10:01 PM · Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)
dancy renamed T412868: Act as if testwikis are not part of group0 from Act as if testwikis is not part of group0 to Act as if testwikis are not part of group0.
Dec 16 2025, 9:54 PM · Release-Engineering-Team (Doing 😎), Essential-Work, Scap (SpiderPig 🕸️)
dancy created T412868: Act as if testwikis are not part of group0.
Dec 16 2025, 9:50 PM · Release-Engineering-Team (Doing 😎), Essential-Work, Scap (SpiderPig 🕸️)
dancy added a comment to T408277: 1.46.0-wmf.7 deployment blockers.

@dancy 1.46.0-wmf.7 has been rolled to testwikis.

The pretrain Jenkins job failed due to some npm connection issue. I reran it, the image took some efforts to get published and finally made it there. So I think you are all set to continue! 🚆

Dec 16 2025, 3:37 PM · Essential-Work, Release-Engineering-Team (Priority Backlog 📥), Release, Train Deployments

Dec 15 2025

dancy added a comment to T412265: Pushing to the docker registry fails with 500 Internal Server Error.

Here are pretrain image build and push times for the last several days:

Dec 15 2025, 4:56 PM · ServiceOps-SharedInfra, ServiceOps new, SRE, MW-on-K8s

Dec 3 2025

dancy added a comment to T411516: Add ability to ignore missing extensions in mergeMessageFileList's `--list-file` input.

Hmm now that I'm thinking about it a little more... do we run mergeMessageFileList for each wmf.N branch separately? If we run it only once and reuse the result across both branches, that would be a problem if an extension was missing in one branch but present in the other.

Dec 3 2025, 4:21 PM · User-bd808, Release-Engineering-Team (Doing 😎), Scap, MediaWiki-Internationalization
dancy closed T393128: SpiderPig UI: Add titles of backport patches to UI as Resolved.

Changes deployed via scap 4.229.0

Dec 3 2025, 4:19 PM · Essential-Work, Release-Engineering-Team (Doing 😎), Scap (SpiderPig 🕸️)

Dec 2 2025

dancy added a comment to T409519: WikibaseLexeme submodule is not available.

I'm currently blocked from updating my train-dev development environment because I can't clone https://phabricator.wikimedia.org/diffusion/NLSP/new-lexeme-special-page.git from offsite.

Dec 2 2025, 4:13 PM · Wikidata-Omega, collaboration-services, Regression, Wikidata Lexicographical data, Wikidata, Diffusion, Phabricator

Dec 1 2025

dancy added a comment to T410018: Initial group versions displayed by "scap train" can be wrong if previous scap attempt failed.

Does anyone know where https://noc.wikimedia.org sources its data?

I may be wrong but judging by https://github.com/wikimedia/operations-mediawiki-config/blob/a0f366b8b706c963ed9badac8ebe7a074a292549/src/Noc/utils.php#L9-L18, I believe it might read the wikiversions.json file that's local to the server its running on.

Dec 1 2025, 9:39 PM · Scap
dancy closed T371620: (some) Gitlab builds hanging as Resolved.

@dancy it's been a while now, but I think we just moved trafficserver to specify tags: [wmcs] for all its jobs and that worked around the issue.

Dec 1 2025, 6:11 PM · Essential-Work, Release-Engineering-Team (Doing 😎), GitLab (CI & Job Runners)
dancy added a comment to T410049: Buildkit v0.26.2 released.

@dancy Not quite. We still need the WMCS/trusted runner changes in puppet.

Dec 1 2025, 5:41 PM · Patch-For-Review, Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)
dancy added a comment to T410049: Buildkit v0.26.2 released.

@dduvall Is this ticket effectively resolved?

Dec 1 2025, 5:25 PM · Patch-For-Review, Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)
dancy added a comment to T371620: (some) Gitlab builds hanging.

@MatthewVernon Can you give me an update on the status of your gitlab builds? Have you managed to work around the original problems?

Dec 1 2025, 4:55 PM · Essential-Work, Release-Engineering-Team (Doing 😎), GitLab (CI & Job Runners)

Nov 21 2025

dancy closed T410653: Puppet failure on integration-agent-pkgbuilder-1003.integration.eqiad1.wikimedia.cloud as Resolved.

I didn't get an email notification this morning so I checked the host and it looks like puppet is working now.

Nov 21 2025, 3:49 PM · Continuous-Integration-Infrastructure
dancy changed the status of T410049: Buildkit v0.26.2 released from Open to In Progress.
Nov 21 2025, 3:48 PM · Patch-For-Review, Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)
dancy added a comment to T410680: scap publish-docs job failing.

Thanks @jnuche !

Nov 21 2025, 3:43 PM · Essential-Work, Release-Engineering-Team (Doing 😎), GitLab (CI & Job Runners)

Nov 20 2025

dancy renamed T408740: Set cpu limits and memory on gitlab-cloud-runner workloads from Set cpu limits on gitlab-cloud-runner workloads to Set cpu limits and memory on gitlab-cloud-runner workloads.
Nov 20 2025, 10:30 PM · Patch-For-Review, Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)
dancy added a comment to T410680: scap publish-docs job failing.

@jnuche Any ideas about this?

Nov 20 2025, 8:21 PM · Essential-Work, Release-Engineering-Team (Doing 😎), GitLab (CI & Job Runners)
dancy created T410680: scap publish-docs job failing.
Nov 20 2025, 5:50 PM · Essential-Work, Release-Engineering-Team (Doing 😎), GitLab (CI & Job Runners)
dancy added a comment to T409855: Document correct way to deploy security patches.

I think they're fine. I think it's more about just changing the way someone has done something for 7 or 8 years.

Nov 20 2025, 5:34 PM · Scap, Security
dancy added a comment to T409855: Document correct way to deploy security patches.

So what is the officially sanctioned or maintained way to deploy security patches, then?

I don't think there is one, but most of us on the Security-Team that deploy security patches typically do so manually.

Nov 20 2025, 4:33 PM · Scap, Security
dancy updated the task description for T410653: Puppet failure on integration-agent-pkgbuilder-1003.integration.eqiad1.wikimedia.cloud.
Nov 20 2025, 4:01 PM · Continuous-Integration-Infrastructure
dancy updated subscribers of T410653: Puppet failure on integration-agent-pkgbuilder-1003.integration.eqiad1.wikimedia.cloud.
Nov 20 2025, 4:00 PM · Continuous-Integration-Infrastructure
dancy created T410653: Puppet failure on integration-agent-pkgbuilder-1003.integration.eqiad1.wikimedia.cloud.
Nov 20 2025, 3:58 PM · Continuous-Integration-Infrastructure
dancy renamed T410049: Buildkit v0.26.2 released from Buildkit v0.26.1 released to Buildkit v0.26.2 released.
Nov 20 2025, 3:54 PM · Patch-For-Review, Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

Nov 18 2025

dancy added a comment to T405921: "View in Alertmanager" button from PuppetAgentNoResources email does not work.

@dancy I don't believe this relates to Metrics Platform. Could I ask why you had thought it did?

Nov 18 2025, 4:41 PM · Observability-Alerting

Nov 17 2025

dancy added a comment to T396166: Are `php_fpm`/`php_version` inside `scap.cfg` used anymore?.

dancy merged https://gitlab.wikimedia.org/repos/releng/scap/-/merge_requests/864

Drop php_fpm_restart_script support

Nov 17 2025, 9:17 PM · Essential-Work, Release-Engineering-Team (Doing 😎), Patch-For-Review, Scap
dancy renamed T410049: Buildkit v0.26.2 released from Buildkit v0.26.0 released to Buildkit v0.26.1 released.
Nov 17 2025, 6:47 PM · Patch-For-Review, Essential-Work, Release-Engineering-Team, GitLab (CI & Job Runners)

Nov 14 2025

dancy added a comment to T410159: sync-gitlab-group-with-ldap - failed to create user / duplicate email.

Noting for the record that I had prior authorization from Mónica to delete the monica_wmde account and I'm syncing with her via email on the changes.

Nov 14 2025, 7:23 PM · collaboration-services, LDAP, GitLab
dancy added a comment to T410159: sync-gitlab-group-with-ldap - failed to create user / duplicate email.
Nov 14 19:06:33 gitlab1004 systemd[1]: Starting sync-gitlab-group-with-ldap.service - Sync various GitLab groups with their LDAP groups...
Nov 14 19:06:39 gitlab1004 sync-gitlab-group-with-ldap[2194617]: 2025-11-14 19:06:39,230 Collecting membership list of LDAP group wmde
Nov 14 19:06:39 gitlab1004 sync-gitlab-group-with-ldap[2194617]: 2025-11-14 19:06:39,264 Collecting member list of Gitlab group people/wmde
Nov 14 19:06:39 gitlab1004 sync-gitlab-group-with-ldap[2194617]: 2025-11-14 19:06:39,884 ldap user darthmon needs to be added to people/wmde.
Nov 14 19:06:39 gitlab1004 sync-gitlab-group-with-ldap[2194617]: 2025-11-14 19:06:39,938 user darthmon will be created in Gitlab.
Nov 14 19:06:39 gitlab1004 sync-gitlab-group-with-ldap[2194617]: 2025-11-14 19:06:39,938 There are 1 GitLab users to create.
Nov 14 19:06:39 gitlab1004 sync-gitlab-group-with-ldap[2194617]: 2025-11-14 19:06:39,938 There are 1 members to add to people/wmde.
Nov 14 19:06:39 gitlab1004 sync-gitlab-group-with-ldap[2194617]: 2025-11-14 19:06:39,938 There are 0 members to remove from people/wmde.
Nov 14 19:06:40 gitlab1004 sync-gitlab-group-with-ldap[2194617]: 2025-11-14 19:06:40,009 Creating gitlab user darthmon
Nov 14 19:06:41 gitlab1004 sync-gitlab-group-with-ldap[2194617]: 2025-11-14 19:06:41,153 Adding darthmon to group people/wmde
Nov 14 19:06:41 gitlab1004 sync-gitlab-group-with-ldap[2194617]: 2025-11-14 19:06:41,339 Sync completed.
Nov 14 2025, 7:08 PM · collaboration-services, LDAP, GitLab
dancy claimed T410159: sync-gitlab-group-with-ldap - failed to create user / duplicate email.

I'll take this one.

Nov 14 2025, 6:35 PM · collaboration-services, LDAP, GitLab
dancy added a comment to T410018: Initial group versions displayed by "scap train" can be wrong if previous scap attempt failed.

<hashar> my guess is scap thinks that because it reads the local /srv/mediawiki-staging/wikiversions.json

Nov 14 2025, 4:27 PM · Scap
dancy added a comment to T410018: Initial group versions displayed by "scap train" can be wrong if previous scap attempt failed.

https://versions.toolforge.org/ correctly showed still old 1.46.0-wmf.1 for group2 instead of 1.46.0-wmf.2.

Nov 14 2025, 4:26 PM · Scap