Page MenuHomePhabricator

Jelto (jwodstrcil)
User

Today

  • No visible events.

Tomorrow

  • No visible events.

Saturday

  • No visible events.

User Details

User Since
Jun 7 2021, 7:25 AM (257 w, 3 d)
Availability
Available
IRC Nick
jelto
LDAP User
Jelto
MediaWiki User
JWodstrcil (WMF) [ Global Accounts ]

Recent Activity

Yesterday

Jelto created T426164: Upgrade GitLab to major version 19.
Wed, May 13, 9:47 AM · Release-Engineering-Team, collaboration-services, GitLab (Infrastructure)
Jelto added a comment to T321316: Self-build and publish buildkit helper images.

Thank you very much for the detailed explanation!

Wed, May 13, 9:41 AM · Release-Engineering-Team, GitLab (CI & Job Runners), collaboration-services

Tue, May 12

Jelto added a comment to T329991: Upgrade Design/Strategy site to use Vitepress and Codex.

I manually removed the design-strategy helm release in all environments (which includes the ingress mapping for /strategy). The /strategy endpoint should serve new content from the landing page now. Let me know if you need anything else.

Tue, May 12, 3:49 PM · collaboration-services, Design-Research
Jelto moved T329991: Upgrade Design/Strategy site to use Vitepress and Codex from Incoming to Consultation on the collaboration-services board.
Tue, May 12, 3:40 PM · collaboration-services, Design-Research
Jelto added a project to T329991: Upgrade Design/Strategy site to use Vitepress and Codex: collaboration-services.
Tue, May 12, 3:29 PM · collaboration-services, Design-Research
Jelto moved T321316: Self-build and publish buildkit helper images from Backlog to Awaiting Input on the collaboration-services board.
Tue, May 12, 1:34 PM · Release-Engineering-Team, GitLab (CI & Job Runners), collaboration-services
Jelto added a comment to T321316: Self-build and publish buildkit helper images.

@dduvall blubbers buildkit frontend uses nativ llb directly with T345458: Refactor Blubber's BuildKit frontend gateway to use LLB directly. So my understanding is dockerfile-copy is no longer used/needed by blubber?

Tue, May 12, 9:29 AM · Release-Engineering-Team, GitLab (CI & Job Runners), collaboration-services
Jelto added a comment to T315877: Configure a default cleanup policy for GitLab package registry.

Yes, during the object storage migration the default deletion policy was lowered to 180 days in https://gitlab.wikimedia.org/repos/releng/gitlab-settings/-/commit/a34e01d2ca41cf3adbced35fa7e2fa20c98c384e. So "artifacts" (CI job logs) use a reasonable cleanup policy. Old CI jobs are not available anymore in GitLab for example.

Tue, May 12, 9:07 AM · GitLab (Administration, Settings & Policy), collaboration-services

Mon, May 11

Jelto claimed T315877: Configure a default cleanup policy for GitLab package registry.
Mon, May 11, 3:45 PM · GitLab (Administration, Settings & Policy), collaboration-services
Jelto claimed T321316: Self-build and publish buildkit helper images.
Mon, May 11, 3:28 PM · Release-Engineering-Team, GitLab (CI & Job Runners), collaboration-services
Jelto added a comment to T413871: Make sure GitLab does not exceed apus object storage quotas.

current quota usage:

Mon, May 11, 2:17 PM · collaboration-services, GitLab (Infrastructure)
Jelto closed T411240: SystemdUnitFailed - sync-gitlab-group-with-ldap.service on gitlab1004:9100, a subtask of T423984: Add retries, error handling and metrics to sync-gitlab-group-with-ldap, as Resolved.
Mon, May 11, 7:11 AM · collaboration-services
Jelto closed T411240: SystemdUnitFailed - sync-gitlab-group-with-ldap.service on gitlab1004:9100 as Resolved.

This issues should be addressed in T423984: Add retries, error handling and metrics to sync-gitlab-group-with-ldap

Mon, May 11, 7:11 AM · collaboration-services
Jelto added a parent task for T411240: SystemdUnitFailed - sync-gitlab-group-with-ldap.service on gitlab1004:9100: T423984: Add retries, error handling and metrics to sync-gitlab-group-with-ldap.
Mon, May 11, 7:10 AM · collaboration-services
Jelto added a subtask for T423984: Add retries, error handling and metrics to sync-gitlab-group-with-ldap: T411240: SystemdUnitFailed - sync-gitlab-group-with-ldap.service on gitlab1004:9100.
Mon, May 11, 7:10 AM · collaboration-services
Jelto closed T425875: SystemdUnitFailed (sync-gitlab-group-with-ldap.service ), a subtask of T423984: Add retries, error handling and metrics to sync-gitlab-group-with-ldap, as Resolved.
Mon, May 11, 7:10 AM · collaboration-services
Jelto closed T425875: SystemdUnitFailed (sync-gitlab-group-with-ldap.service ) as Resolved.
Mon, May 11, 7:10 AM · collaboration-services
Jelto added a subtask for T423984: Add retries, error handling and metrics to sync-gitlab-group-with-ldap: T425875: SystemdUnitFailed (sync-gitlab-group-with-ldap.service ).
Mon, May 11, 7:10 AM · collaboration-services
Jelto added a parent task for T425875: SystemdUnitFailed (sync-gitlab-group-with-ldap.service ): T423984: Add retries, error handling and metrics to sync-gitlab-group-with-ldap.
Mon, May 11, 7:10 AM · collaboration-services
Jelto renamed T425875: SystemdUnitFailed (sync-gitlab-group-with-ldap.service ) from SystemdUnitFailed to SystemdUnitFailed (sync-gitlab-group-with-ldap.service ).
Mon, May 11, 7:09 AM · collaboration-services
Jelto renamed T425872: ProbeDown (gitlab1004) from ProbeDown to ProbeDown (gitlab1004).
Mon, May 11, 7:09 AM · collaboration-services
Jelto added a comment to T425875: SystemdUnitFailed (sync-gitlab-group-with-ldap.service ).

The script failed because GitLab returned a 500 error:

Mon, May 11, 7:08 AM · collaboration-services
Jelto added a parent task for T425872: ProbeDown (gitlab1004): Unknown Object (Task).
Mon, May 11, 7:04 AM · collaboration-services
Jelto closed T425872: ProbeDown (gitlab1004) as Resolved.

There was a short request spike: https://grafana.wikimedia.org/d/R_1IvBZnz/gitlab-omnibus-overview?orgId=1&refresh=1m&from=2026-05-10T08:40:22.909Z&to=2026-05-10T16:48:59.965Z&timezone=utc&var-node=gitlab1004.

Mon, May 11, 7:04 AM · collaboration-services

Fri, May 8

Jelto closed T425650: Codesearch is down (2026-05-07) as Resolved.

@A_smart_kitten thank you for reporting the issue. I'll resolve the task, codesearch is back and works normally.

Fri, May 8, 6:55 AM · collaboration-services, VPS-project-Codesearch

Thu, May 7

Jelto added a project to T425650: Codesearch is down (2026-05-07): collaboration-services.
Thu, May 7, 1:39 PM · collaboration-services, VPS-project-Codesearch
Jelto created T425667: Investigate Gerrit root disk usage and logging.
Thu, May 7, 12:30 PM · Gerrit, collaboration-services

Wed, May 6

Jelto updated the task description for T425562: Requesting GitLab account activation for Lenap.
Wed, May 6, 2:58 PM · Essential-Work, GitLab (Account Approval), Release-Engineering-Team
Jelto added a comment to T425562: Requesting GitLab account activation for Lenap.

@Lenap is known to me and I can vouch for the user. I'll let @Aklapper or @brennen approve the account in GitLab.

Wed, May 6, 2:58 PM · Essential-Work, GitLab (Account Approval), Release-Engineering-Team
Jelto added a comment to T361090: Move k8s miscweb blackbox checks out of microsites puppet module.

^ this happened again in T425476

Wed, May 6, 8:42 AM · collaboration-services
Jelto closed T425166: ProbeDown - phab1004 as Resolved.

Alert resolved after 5 minutes. The traffic was a short spike from known sources (seen before):

Wed, May 6, 8:32 AM · collaboration-services

Thu, Apr 30

Jelto closed T333143: Move Gerrit data out of root partition, a subtask of T372804: setup gerrit2003 with gerrit service (gerrit on bookworm), as Resolved.
Thu, Apr 30, 8:06 AM · Patch-For-Review, SRE, collaboration-services
Jelto closed T333143: Move Gerrit data out of root partition, a subtask of T423027: 2026-04-12 Gerrit Outage (was: DiskSpace), as Resolved.
Thu, Apr 30, 8:06 AM · Wikimedia-Incident, Gerrit, collaboration-services
Jelto closed T333143: Move Gerrit data out of root partition as Resolved.

All hosts are migrated and cleanup has happened on all hosts, I'll resolve the task.

Thu, Apr 30, 8:06 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto updated the task description for T333143: Move Gerrit data out of root partition.
Thu, Apr 30, 8:04 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto closed T424924: GerritHAProxyBackendUnavailable, a subtask of T333143: Move Gerrit data out of root partition, as Resolved.
Thu, Apr 30, 8:03 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto closed T424924: GerritHAProxyBackendUnavailable as Resolved.

This happened because of a service restart in gerrit in T333143: Move Gerrit data out of root partition. Resolved after a few minutes.

Thu, Apr 30, 8:03 AM · collaboration-services
Jelto added a subtask for T333143: Move Gerrit data out of root partition: T424924: GerritHAProxyBackendUnavailable.
Thu, Apr 30, 7:59 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto added a parent task for T424924: GerritHAProxyBackendUnavailable: T333143: Move Gerrit data out of root partition.
Thu, Apr 30, 7:59 AM · collaboration-services

Wed, Apr 29

Jelto merged T424320: CertAlmostExpired - Certificate for service planet2003:443 is about to expire into T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.
Wed, Apr 29, 1:13 PM · collaboration-services
Jelto merged task T424320: CertAlmostExpired - Certificate for service planet2003:443 is about to expire into T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.
Wed, Apr 29, 1:12 PM · collaboration-services
Jelto updated the task description for T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.
Wed, Apr 29, 1:05 PM · collaboration-services
Jelto closed T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate, a subtask of T420993: Rotate discovery intermediate certificate (expires 2026-05-03), as Resolved.
Wed, Apr 29, 1:05 PM · ServiceOps new, Infrastructure-Foundations, Patch-For-Review
Jelto closed T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate as Resolved.

All hosts are done, I'll resolve this task. Thank you @MoritzMuehlenhoff for the detailed checklist!

Wed, Apr 29, 1:05 PM · collaboration-services
Jelto updated the task description for T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.
Wed, Apr 29, 12:54 PM · collaboration-services
Jelto added a comment to T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.

releases done and it's using the new discovery2026 cert:

Wed, Apr 29, 12:53 PM · collaboration-services
Jelto updated the task description for T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.
Wed, Apr 29, 12:32 PM · collaboration-services
Jelto added a comment to T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.

doc hosts done and it's using the new discovery2026 cert:

Wed, Apr 29, 12:32 PM · collaboration-services
Jelto closed T424833: ProbeDown (gitlab1004:443) as Resolved.

resolved after 5 minutes and GitLab was responding normally for me. I linked this to T423985.

Wed, Apr 29, 11:52 AM · collaboration-services
Jelto added a parent task for T424833: ProbeDown (gitlab1004:443): Unknown Object (Task).
Wed, Apr 29, 11:51 AM · collaboration-services
Jelto renamed T424833: ProbeDown (gitlab1004:443) from ProbeDown to ProbeDown (gitlab1004:443).
Wed, Apr 29, 11:51 AM · collaboration-services
Jelto updated the task description for T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.
Wed, Apr 29, 11:50 AM · collaboration-services
Jelto added a comment to T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.

peopeweb done and it's using the new discovery2026 cert:

Wed, Apr 29, 11:50 AM · collaboration-services
Jelto updated the task description for T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.
Wed, Apr 29, 11:36 AM · collaboration-services
Jelto added a comment to T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.

Phabricator done and it's using the new discovery2026 cert:

Wed, Apr 29, 11:35 AM · collaboration-services
Jelto added a comment to T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.

Aphlict done and it's using the new discovery2026 cert:

Wed, Apr 29, 11:20 AM · collaboration-services
Jelto updated the task description for T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.
Wed, Apr 29, 11:18 AM · collaboration-services
Jelto updated the task description for T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.
Wed, Apr 29, 11:09 AM · collaboration-services
Jelto added a comment to T424669: Migrate Collab Envoy TLS proxy services to the 2026 discovery intermediate.

Etherpad done and it's using the new discovery2026 cert:

Wed, Apr 29, 11:09 AM · collaboration-services
Jelto added a comment to T424239: SystemdUnitFailed - backup-restore.service on gitlab2002:9100.

The sre.gitlab.upgrade cookbook creates a downtime now for the gitlab-backup-restore.service:

Wed, Apr 29, 9:46 AM · collaboration-services

Thu, Apr 23

Jelto closed T423027: 2026-04-12 Gerrit Outage (was: DiskSpace) as Resolved.

I'll resolve this task. Sub-tasks have been open for the follow up action items. The successful migration in T333143: Move Gerrit data out of root partition will prevent the root disk from filling up again with Gerrit cache files. We are also looking into improving the existing documentation and alerting (thanks @ABran-WMF ).

Thu, Apr 23, 2:32 PM · Wikimedia-Incident, Gerrit, collaboration-services
Jelto added a comment to T333143: Move Gerrit data out of root partition.

All hosts are migrated and cleanup has happened on gerrit-replica and gerrit-spare. I'll wait until next week for the gerrit production host cleanup (removing the symlink and the temporary backup folder /srv/gerrit/var-lib-gerrit-backup).

Thu, Apr 23, 9:16 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto renamed T424165: ProbeDown (etherpad1004) from ProbeDown to ProbeDown (etherpad1004).
Thu, Apr 23, 9:10 AM · collaboration-services
Jelto closed T424165: ProbeDown (etherpad1004) as Resolved.

the alert resolved after 5 minutes, I'll not investigate this

Thu, Apr 23, 9:10 AM · collaboration-services
Jelto updated the task description for T333143: Move Gerrit data out of root partition.
Thu, Apr 23, 9:09 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto updated the task description for T333143: Move Gerrit data out of root partition.
Thu, Apr 23, 8:18 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto updated the task description for T333143: Move Gerrit data out of root partition.
Thu, Apr 23, 6:29 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto added a comment to T333143: Move Gerrit data out of root partition.

Data on gerrit2003 production gerrit has also been moved to /srv/gerrit.

Thu, Apr 23, 6:29 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto added a comment to T333143: Move Gerrit data out of root partition.

Runbook for the production host:

Thu, Apr 23, 5:45 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit

Tue, Apr 21

Jelto updated the task description for T333143: Move Gerrit data out of root partition.
Tue, Apr 21, 11:14 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto added a comment to T333143: Move Gerrit data out of root partition.

The replica host gerrit1003 has been migrated to /srv/gerrit.

Tue, Apr 21, 11:14 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto added a comment to T333143: Move Gerrit data out of root partition.

The alert mentioned above is resolved after removing the manually created systemd unit file in /etc/systemd/system/gerrit.service on gerrit2002. All gerrit hosts use the puppet-managed unit file again.

Tue, Apr 21, 8:03 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit

Mon, Apr 20

Jelto added a project to T403746: Support IPv6 on WMCS hosted runners: collaboration-services.
Mon, Apr 20, 3:51 PM · collaboration-services, IPv6, GitLab (CI & Job Runners)
Jelto added a comment to T421726: Wikimedia gerrit load management 429s break fresh-install.

I can't see the patches above and don't have the full context, but I think the user agent in https://requestctl.wikimedia.org/pattern/ua/gerrit_gitiles_legitimate_access_user_agent has to be escaped properly

Mon, Apr 20, 2:30 PM · collaboration-services, Gerrit, Fresh
Jelto added a comment to T333143: Move Gerrit data out of root partition.

There is an active incinga alert for the migrated spare host:

Mon, Apr 20, 12:50 PM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto updated the task description for T333143: Move Gerrit data out of root partition.
Mon, Apr 20, 12:44 PM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit

Thu, Apr 16

Jelto claimed T333143: Move Gerrit data out of root partition.

I successfully migrated the spare host gitlab2002 to /srv/gerrit.

Thu, Apr 16, 3:03 PM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto updated the task description for T423601: Update and improve operation runbooks and documentation for Gerrit.
Thu, Apr 16, 2:20 PM · Documentation, Sustainability (Incident Followup), Gerrit, collaboration-services
Jelto updated the task description for T423027: 2026-04-12 Gerrit Outage (was: DiskSpace).
Thu, Apr 16, 2:20 PM · Wikimedia-Incident, Gerrit, collaboration-services
Jelto created T423601: Update and improve operation runbooks and documentation for Gerrit.
Thu, Apr 16, 2:19 PM · Documentation, Sustainability (Incident Followup), Gerrit, collaboration-services
Jelto added a comment to T333143: Move Gerrit data out of root partition.

I did another migration attempt and puppet keeps re-creating the /var/lib/gerrit even when profile::gerrit::gerrit_site: /srv/gerrit/site_path is set. This happenes because it's hardcoded in init.pp:

Thu, Apr 16, 1:45 PM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto added a comment to T333143: Move Gerrit data out of root partition.

I've tested the migration snippet on the spare instance gerrit2002. Beside fixing a typo there is a problem with puppet re-creating the /var/lib/gerrit folder:

Thu, Apr 16, 10:29 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit

Apr 14 2026

Jelto added a comment to T333143: Move Gerrit data out of root partition.

Suggested run book to move remaining data:

prep:

  • mkdir /srv/gerrit/site_path (we need/should use some new directory and this is what that is, /var/lib/gerrit (or previously /var/lib/gerrit2 is the default Gerrit Site Path.
  • pre-rsync data from /var/lib/gerrit/ to /srv/gerrit/site_path/

migrate:

  • stop gerrit
  • rsync data from /var/lib/gerrit/ to /srv/gerrit/site_path/ one more time
  • mv /var/lib/gerrit /srv/gerrit/var-lib-gerrit-backup (just in case, but don't forget it forever, /var/lib/gerrit needs to move out of the way though, or we simply "mv /var/lib/gerrit /srv/gerrit/site_path" and forget rsync!?)
  • mount --bind /srv/gerrit/site_path /var/lib/gerrit (alternative: ln -s /srv/gerrit/site_path /var/lib/gerrit)
  • start gerrit

?

Apr 14 2026, 8:13 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto closed T423234: GitlabPackagePullerFailedOnPrepare as Resolved.

This happened because of permission errors in https://gitlab.wikimedia.org/repos/projects/wmf-navigator, related to T414405.

Apr 14 2026, 7:51 AM · collaboration-services

Apr 13 2026

Jelto moved T423027: 2026-04-12 Gerrit Outage (was: DiskSpace) from Incoming to Work in Progress on the collaboration-services board.
Apr 13 2026, 3:55 PM · Wikimedia-Incident, Gerrit, collaboration-services
Jelto closed T422954: SystemdUnitFailed (prometheus-ethtool-exporter.service on gerrit2003:9100) as Resolved.

This alert recovered, probably related to T423027 or previous problems. I'll resolve the task.

Apr 13 2026, 1:26 PM · collaboration-services
Jelto updated the task description for T423027: 2026-04-12 Gerrit Outage (was: DiskSpace).
Apr 13 2026, 12:38 PM · Wikimedia-Incident, Gerrit, collaboration-services
Jelto added a project to T423123: Alert when Gerrit CI (Zuul, Jenkins, Gearman) is down/stuck: Sustainability (Incident Followup).
Apr 13 2026, 12:38 PM · Sustainability (Incident Followup), Gerrit, Release-Engineering-Team, collaboration-services
Jelto created T423123: Alert when Gerrit CI (Zuul, Jenkins, Gearman) is down/stuck.
Apr 13 2026, 12:37 PM · Sustainability (Incident Followup), Gerrit, Release-Engineering-Team, collaboration-services
Jelto added a comment to T333143: Move Gerrit data out of root partition.

This has caused disk space issues again in T423027. This task is already tagged with Sustainability (Incident Followup) .

Apr 13 2026, 9:27 AM · Patch-For-Review, Sustainability (Incident Followup), Release-Engineering-Team, collaboration-services, Gerrit
Jelto updated the task description for T423027: 2026-04-12 Gerrit Outage (was: DiskSpace).
Apr 13 2026, 9:20 AM · Wikimedia-Incident, Gerrit, collaboration-services
Jelto claimed T423027: 2026-04-12 Gerrit Outage (was: DiskSpace).

Thank you to everyone who troubleshooted and mitigated this issue on a weekend.

Apr 13 2026, 9:12 AM · Wikimedia-Incident, Gerrit, collaboration-services
Jelto added a comment to T423035: Enable paging for Gerrit (was: Gerrit outage didn't page until 4.5 hours after the first alert).

thank you for opening this task. For additional context: paging alerts for Gerrit were also previously discussed in T365148.

Apr 13 2026, 7:17 AM · Sustainability (Incident Followup), observability, collaboration-services
Jelto closed T423034: ProbeDown (gerrit2003:443) as Resolved.

This issue is tracked in T423027, I'll close this task.

Apr 13 2026, 7:14 AM · Release-Engineering-Team, collaboration-services
Jelto renamed T423034: ProbeDown (gerrit2003:443) from ProbeDown to ProbeDown (gerrit2003:443).
Apr 13 2026, 7:12 AM · Release-Engineering-Team, collaboration-services
Jelto renamed T422954: SystemdUnitFailed (prometheus-ethtool-exporter.service on gerrit2003:9100) from SystemdUnitFailed to SystemdUnitFailed (prometheus-ethtool-exporter.service on gerrit2003:9100).
Apr 13 2026, 7:11 AM · collaboration-services
Jelto merged task T422983: ProbeDown into Restricted Task.
Apr 13 2026, 7:10 AM · collaboration-services
Jelto merged task T422982: ProbeDown into Restricted Task.
Apr 13 2026, 7:10 AM · collaboration-services

Apr 10 2026

Jelto closed T422858: ferm problem on gitlab test instance as Resolved.

Puppet is happy on the test instance now.

Apr 10 2026, 8:25 AM · collaboration-services
Jelto claimed T422858: ferm problem on gitlab test instance.

I think this is related to enabling QoS for rsync in production: https://gerrit.wikimedia.org/r/c/operations/puppet/+/1234984

Apr 10 2026, 8:07 AM · collaboration-services