Page MenuHomePhabricator

JMeybohm
User

Projects (13)

Today

  • No visible events.

Tomorrow

  • No visible events.

Sunday

  • No visible events.

User Details

User Since
Apr 2 2020, 9:01 AM (305 w, 1 d)
Availability
Available
IRC Nick
jayme
LDAP User
JMeybohm
MediaWiki User
JMeybohm (WMF) [ Global Accounts ]

Recent Activity

Today

JMeybohm added a comment to T397685: helmfile/scap does not reliably bootstrap mediawiki.

Thank you all for untangling and documenting this!
I would like to suggest to uncouple this from the k8s upgrade procedure. It surfaced there, but it is actually a mediawiki bootstraping problem that might bite us in disaster recovery or similar scenarios as well. I'm not totally sure about this but if scap was capable of bootstraping mediawiki in the past, shouldn't it still be able to do so? The comparison is probably bad since we where running support-releases outside of scaps reach in the past (like statsd-exporter for example) but it also feels off to have maintain knowledge about what to do when (like the list of mw namespaces and support releases) in multiple places (scap and wikitech/charlie/...).

Fri, Feb 6, 5:35 PM · ServiceOps-Mediawiki, ServiceOps new, MW-on-K8s, Release-Engineering-Team, Scap
JMeybohm added a comment to T303744: Keep track of teams responsible for namespaces inside kubernetes.

I don't think it makes much sense to maintain a list of namespaces where something is deployed inside the artifact that is being deployed.
My aim here was to add an annotation or label to the namespace objects in kubernetes. This could easily be done during namespace creation in https://gerrit.wikimedia.org/r/plugins/gitiles/operations/deployment-charts/+/refs/heads/master/helmfile.d/admin_ng/values/main.yaml#14 and https://gerrit.wikimedia.org/r/plugins/gitiles/operations/deployment-charts/+/refs/heads/master/helmfile.d/admin_ng/helmfile_namespaces.yaml. The hard part is to figure out the identifier to add add (phab tag, team name, .. ideally not something that changes every quarter) and the actually responsible group (as usual).

Fri, Feb 6, 4:32 PM · Serviceops-easywins, ServiceOps new, Prod-Kubernetes
JMeybohm added a comment to T416580: Kserve helm chart.

We have a rough documentation about our policy and process around adopting upstream helm charts which can be found here: https://wikitech.wikimedia.org/wiki/Kubernetes/Upstream_Helm_charts_policy

Fri, Feb 6, 3:57 PM · Charts, Kubernetes, SRE

Yesterday

JMeybohm added a comment to T412693: Ensure all Chart.yaml files include required metadata fields.

It would be great, but also something for the future. We have two options here, either put a dummy entry for the "orphaned" services, and add CI checks for this, or wait to do so after every chart has an owner.

Thu, Feb 5, 4:04 PM · ServiceOps-Services-Oids, Kubernetes, ServiceOps new
JMeybohm reopened T303744: Keep track of teams responsible for namespaces inside kubernetes as "Open".

I disagree this being a duplicate of T412693: Ensure all Chart.yaml files include required metadata fields as that one aims at chart ownership while this task aims at ownership for k8s namespaces (or groups of deployments if that makes more sense). Those might contain deployments of charts maintained by the same group/team - but that's not a requirement.

Thu, Feb 5, 4:02 PM · Serviceops-easywins, ServiceOps new, Prod-Kubernetes
JMeybohm moved T383553: Set cert-manager leader election namespace to cert-manager from Needs Info / Blocked to Radar on the ServiceOps new board.

This is done for all wikikube clusters, so we can move this to radar on our side. Still relevant for:

Thu, Feb 5, 10:06 AM · Machine-Learning-Team, Infrastructure-Foundations, ServiceOps new, Data-Platform-SRE, Kubernetes, Prod-Kubernetes

Tue, Jan 20

JMeybohm added a project to T399729: Clean up buster docker images: ServiceOps-SharedInfra.
Tue, Jan 20, 2:53 PM · ServiceOps-SharedInfra, ServiceOps new
JMeybohm renamed T399729: Clean up buster docker images from Clean up buster docker images to Clean up buster docker images.
Tue, Jan 20, 2:51 PM · ServiceOps-SharedInfra, ServiceOps new
JMeybohm triaged T399729: Clean up buster docker images as Medium priority.
Tue, Jan 20, 2:49 PM · ServiceOps-SharedInfra, ServiceOps new
JMeybohm edited projects for T399158: Alert in need of triage: OsmSynchronisationLag (instance maps-test2001:9100), added: SRE; removed serviceops.

Removing serviceops since we won't be working on this.

Tue, Jan 20, 2:48 PM · Infrastructure-Foundations, SRE, Maps, sre-alert-triage
JMeybohm added a comment to T352956: Handling inbound IPIP traffic on low traffic LVS k8s based realservers.

Summarizing the current state and our recent discussion about this:

Tue, Jan 20, 11:36 AM · ServiceOps new, Patch-For-Review, Prod-Kubernetes, Kubernetes, Traffic
JMeybohm added a comment to T415029: Library restart detection is very slow in Kubernetes workers.

I've already tried to make lsof exclude bunch of mountpoints with:
lsof -nXd DEL $(findmnt -t tmpfs,nsfs,overlay,proc,^Csfs,cgroup2,devtmpfs,devpts,securityfs,pstore,bpf,hugetlbfs,mqueue,debugfs,tracefs,fusectl,configfs,ramfs, -o TARGET -n --list | sed 's/^/-e /')

Tue, Jan 20, 10:48 AM · Infrastructure-Foundations, ServiceOps new

Mon, Jan 19

JMeybohm moved T412805: Migrate ipblocks from fetch_external_clouds_vendors_nets.py to HIDDENPARMA from Scheduled (this Q) to In Progress on the ServiceOps new board.
Mon, Jan 19, 4:27 PM · ServiceOps new, Patch-For-Review, SRE
JMeybohm closed T341984: Update Kubernetes clusters to 1.31 as Resolved.

I'm resolving this since we have updated the wikikube clusters quite some time ago and remaining work, cleanups etc. will be handled in subtasks.

Mon, Jan 19, 10:38 AM · Data-Platform-SRE (2026.01.05 - 2026.01.23), Epic, ServiceOps new, Patch-For-Review, collaboration-services, Kubernetes, Prod-Kubernetes
JMeybohm edited projects for T341984: Update Kubernetes clusters to 1.31, added: ServiceOps new, Epic; removed serviceops.
Mon, Jan 19, 10:37 AM · Data-Platform-SRE (2026.01.05 - 2026.01.23), Epic, ServiceOps new, Patch-For-Review, collaboration-services, Kubernetes, Prod-Kubernetes
JMeybohm moved T374366: Race condition in iptables rules during puppet runs on k8s nodes from Inbox to Backlog on the ServiceOps new board.
Mon, Jan 19, 10:36 AM · ServiceOps new, Kubernetes, Prod-Kubernetes
JMeybohm edited projects for T374366: Race condition in iptables rules during puppet runs on k8s nodes, added: ServiceOps new; removed serviceops.
Mon, Jan 19, 10:36 AM · ServiceOps new, Kubernetes, Prod-Kubernetes
JMeybohm added projects to T400100: FY 25/26 WE 5.4.2: Known bots / clients: ServiceOps new, Epic.
Mon, Jan 19, 10:35 AM · Epic, ServiceOps new, SRE
JMeybohm closed T404591: requestctl support to enable/disable ipblocks as Resolved.
Mon, Jan 19, 10:33 AM · ServiceOps new, SRE
JMeybohm closed T404591: requestctl support to enable/disable ipblocks, a subtask of T400100: FY 25/26 WE 5.4.2: Known bots / clients, as Resolved.
Mon, Jan 19, 10:33 AM · Epic, ServiceOps new, SRE

Thu, Jan 15

JMeybohm moved T414576: Failing docker registry httpbb tests from Inbox to Radar on the ServiceOps new board.
Thu, Jan 15, 1:24 PM · Kubernetes, ServiceOps new, SRE
JMeybohm assigned T414576: Failing docker registry httpbb tests to DPogorzelski-WMF.

The X-Cache-Status failures are gone now:

jayme@cumin1003:~$ sudo httpbb /srv/deployment/httpbb-tests/docker-registry/test_docker-registry.yaml --hosts 'registry2004.codfw.wmnet'
Sending to registry2004.codfw.wmnet...
https://docker-registry.wikimedia.org/v2/ml/nonexistent/manifests/latest (/srv/deployment/httpbb-tests/docker-registry/test_docker-registry.yaml:106)
    Status code: expected 404, got 401.
https://docker-registry.wikimedia.org/v2/ml/nonexistent/blobs/upload (/srv/deployment/httpbb-tests/docker-registry/test_docker-registry.yaml:110)
    Status code: expected 404, got 401.
===
FAIL: 22 requests sent to registry2004.codfw.wmnet. 2 requests with failed assertions.
Thu, Jan 15, 1:23 PM · Kubernetes, ServiceOps new, SRE
JMeybohm closed T256762: Fix nginx config and caching for docker registry , a subtask of T209271: improve docker registry architecture, as Resolved.
Thu, Jan 15, 12:27 PM · User-fsero, serviceops, Prod-Kubernetes, Kubernetes, SRE
JMeybohm closed T256762: Fix nginx config and caching for docker registry as Resolved.

Since there is clearly no need for optimization here, I'll resolve this now.

Thu, Jan 15, 12:27 PM · serviceops, Kubernetes, SRE
JMeybohm renamed T414576: Failing docker registry httpbb tests from Failing docker registry tests to Failing docker registry httpbb tests.
Thu, Jan 15, 10:19 AM · Kubernetes, ServiceOps new, SRE
JMeybohm triaged T414576: Failing docker registry httpbb tests as Medium priority.

The 403 vs. 401 or 404 are the result of the tests being run against a read-only (profile::docker_registry::read_only_mode) instance of the registry. I have updated the wikitech page accordingly.

Thu, Jan 15, 10:19 AM · Kubernetes, ServiceOps new, SRE
JMeybohm added a project to T414375: Grant Access to analytics-privatedata-users for hmonroy: Data-Platform-SRE.

@JMeybohm Hi! I'm trying a query wmf.mediawiki_history in superset. I'm getting: mysql error: SELECT command denied to user 'research'@'10.67.28.86' for table wmf`.mediawiki_history`

What access level would I need in order to pull data from wmf.mediawiki_history ?

I would assume this not related to your personal account (since the SQL query is clearly done as research user). Data-Platform-SRE || Data-Engineering can you help with this?

Thu, Jan 15, 9:30 AM · Essential-Work, Data-Platform-SRE (2026.01.05 - 2026.01.23), SRE, SRE-Access-Requests, Data-Engineering
JMeybohm assigned T414619: Yubikey-SSH-FIDO access for dduvall to MoritzMuehlenhoff.

@MoritzMuehlenhoff assigning to you so the next clinic duty person knows you're working on this with Dan, thanks

Thu, Jan 15, 9:22 AM · SRE, SRE-Access-Requests

Wed, Jan 14

JMeybohm closed T414492: Grant Access to analytics-privatedata-users for HFanWMF as Resolved.

I have added you to the analytics-privatedata-users group. If that does not grand you the required privileges, please take a look at https://wikitech.wikimedia.org/wiki/Data_Platform/Data_access#Access_Levels and try to figure out (maybe with help from Data-Engineering folks) what access level you require.

Wed, Jan 14, 4:20 PM · SRE-Access-Requests, SRE
JMeybohm moved T412951: Move the docker registry's /restricted prefix to Docker Distribution backed up by Ceph from Inbox to Backlog on the ServiceOps new board.
Wed, Jan 14, 4:16 PM · Patch-For-Review, Epic, Kubernetes, ServiceOps new, Release-Engineering-Team (Radar), Ceph, SRE-swift-storage
JMeybohm triaged T412951: Move the docker registry's /restricted prefix to Docker Distribution backed up by Ceph as High priority.
Wed, Jan 14, 4:16 PM · Patch-For-Review, Epic, Kubernetes, ServiceOps new, Release-Engineering-Team (Radar), Ceph, SRE-swift-storage
JMeybohm moved T412947: Reduce cache miss noise in memcached due to hcaptcha health checks from Inbox to In Progress on the ServiceOps new board.
Wed, Jan 14, 4:13 PM · ServiceOps-Datastores, ServiceOps new, Product Safety and Integrity, ConfirmEdit (CAPTCHA extension), WE4.2 Bot detection (WE4.2 hCaptcha editing trial)
JMeybohm triaged T412947: Reduce cache miss noise in memcached due to hcaptcha health checks as Medium priority.
Wed, Jan 14, 4:12 PM · ServiceOps-Datastores, ServiceOps new, Product Safety and Integrity, ConfirmEdit (CAPTCHA extension), WE4.2 Bot detection (WE4.2 hCaptcha editing trial)
JMeybohm moved T412818: Add configuration for MESH_CHECK_SKIP in periodic job puppet definition from In Progress to Done on the ServiceOps new board.
Wed, Jan 14, 4:12 PM · ServiceOps-Mediawiki, ServiceOps new, Patch-For-Review
JMeybohm moved T412941: Proposal: scap deploy-service from Inbox to Needs Info / Blocked on the ServiceOps new board.
Wed, Jan 14, 4:06 PM · User-jijiki, Epic, ServiceOps new, Scap, Release-Engineering-Team
JMeybohm triaged T412941: Proposal: scap deploy-service as Low priority.
Wed, Jan 14, 4:06 PM · User-jijiki, Epic, ServiceOps new, Scap, Release-Engineering-Team
JMeybohm added a comment to T412818: Add configuration for MESH_CHECK_SKIP in periodic job puppet definition.

@Clement_Goubert this looks done, is it?

Wed, Jan 14, 4:03 PM · ServiceOps-Mediawiki, ServiceOps new, Patch-For-Review
JMeybohm moved T412818: Add configuration for MESH_CHECK_SKIP in periodic job puppet definition from Inbox to In Progress on the ServiceOps new board.
Wed, Jan 14, 4:02 PM · ServiceOps-Mediawiki, ServiceOps new, Patch-For-Review
JMeybohm edited projects for T412818: Add configuration for MESH_CHECK_SKIP in periodic job puppet definition, added: ServiceOps new, ServiceOps-Mediawiki; removed serviceops, MW-on-K8s.
Wed, Jan 14, 4:02 PM · ServiceOps-Mediawiki, ServiceOps new, Patch-For-Review
JMeybohm moved T412801: Proof of Concept: Train Health Dashboard from Inbox to Backlog on the ServiceOps new board.
Wed, Jan 14, 4:00 PM · Incident Tooling, ServiceOps new, ServiceOps-Mediawiki, Release-Engineering-Team
JMeybohm added a project to T412801: Proof of Concept: Train Health Dashboard: ServiceOps new.
Wed, Jan 14, 3:59 PM · Incident Tooling, ServiceOps new, ServiceOps-Mediawiki, Release-Engineering-Team
JMeybohm triaged T412801: Proof of Concept: Train Health Dashboard as Low priority.
Wed, Jan 14, 3:59 PM · Incident Tooling, ServiceOps new, ServiceOps-Mediawiki, Release-Engineering-Team
JMeybohm updated the task description for T413364: Requesting access to analytics-privatedata-users for kareid.
Wed, Jan 14, 1:59 PM · Data-Engineering, SRE-Access-Requests, SRE
JMeybohm renamed T414492: Grant Access to analytics-privatedata-users for HFanWMF from Grant Access to WMF(?) for HFanWMF to Grant Access to analytics-privatedata-users for HFanWMF.
Wed, Jan 14, 1:40 PM · SRE-Access-Requests, SRE
JMeybohm assigned T413364: Requesting access to analytics-privatedata-users for kareid to thcipriani.

@thcipriani this needs sign-off from you as the approver for the deployment group

Wed, Jan 14, 1:31 PM · Data-Engineering, SRE-Access-Requests, SRE
JMeybohm updated the task description for T413364: Requesting access to analytics-privatedata-users for kareid.
Wed, Jan 14, 1:27 PM · Data-Engineering, SRE-Access-Requests, SRE
JMeybohm placed T413364: Requesting access to analytics-privatedata-users for kareid up for grabs.
Wed, Jan 14, 1:20 PM · Data-Engineering, SRE-Access-Requests, SRE
JMeybohm moved T414375: Grant Access to analytics-privatedata-users for hmonroy from Untriaged to Awaiting User Input on the SRE-Access-Requests board.
Wed, Jan 14, 1:02 PM · Essential-Work, Data-Platform-SRE (2026.01.05 - 2026.01.23), SRE, SRE-Access-Requests, Data-Engineering

Tue, Jan 13

JMeybohm updated the task description for T414484: Upgrade DSE clusters to kubernetes 1.31.
Tue, Jan 13, 4:21 PM · Data-Platform-SRE (2026.01.23 - 2026.02.13), Essential-Work, Kubernetes, Prod-Kubernetes
JMeybohm updated the task description for T414485: Upgrade ML clusters to kubernetes 1.31.
Tue, Jan 13, 4:21 PM · Machine-Learning-Team, Kubernetes, Prod-Kubernetes
JMeybohm updated the task description for T414486: Upgrade AUX clusters to kubernetes 1.31.
Tue, Jan 13, 4:21 PM · Infrastructure-Foundations, Kubernetes, Prod-Kubernetes
JMeybohm created T414486: Upgrade AUX clusters to kubernetes 1.31.
Tue, Jan 13, 4:20 PM · Infrastructure-Foundations, Kubernetes, Prod-Kubernetes
JMeybohm created T414485: Upgrade ML clusters to kubernetes 1.31.
Tue, Jan 13, 4:19 PM · Machine-Learning-Team, Kubernetes, Prod-Kubernetes
JMeybohm updated the task description for T414484: Upgrade DSE clusters to kubernetes 1.31.
Tue, Jan 13, 4:19 PM · Data-Platform-SRE (2026.01.23 - 2026.02.13), Essential-Work, Kubernetes, Prod-Kubernetes
JMeybohm created T414484: Upgrade DSE clusters to kubernetes 1.31.
Tue, Jan 13, 4:18 PM · Data-Platform-SRE (2026.01.23 - 2026.02.13), Essential-Work, Kubernetes, Prod-Kubernetes
JMeybohm closed T414192: Requesting access to DataPlatform for trueg as Resolved.

Key has been verified and patch merged. You should have access after ~30min max.

Tue, Jan 13, 3:19 PM · SRE, SRE-Access-Requests
JMeybohm updated the task description for T414192: Requesting access to DataPlatform for trueg.
Tue, Jan 13, 3:18 PM · SRE, SRE-Access-Requests
JMeybohm added a comment to T414192: Requesting access to DataPlatform for trueg.

The kerberos principal has been created.
For off band verification of the SSH key, please confirm the key by putting it onto your (wiki user page).

Tue, Jan 13, 2:48 PM · SRE, SRE-Access-Requests
JMeybohm triaged T414427: Increase capacity for Mercurius webvideoTranscode job (1080p) processing as Medium priority.
Tue, Jan 13, 2:29 PM · ServiceOps new, SRE, TimedMediaHandler-Transcode
JMeybohm updated the task description for T414192: Requesting access to DataPlatform for trueg.
Tue, Jan 13, 2:27 PM · SRE, SRE-Access-Requests
JMeybohm added a comment to T413364: Requesting access to analytics-privatedata-users for kareid.

Hi @Dzahn - the experimentation platform dashboards use private data, and as such I'll need to be part of the group to work on the dashboards. Level 1 access to the group should be sufficient.

Regarding shell access, from talking to the team, shell access is needed to deploy our service (documented in https://wikitech.wikimedia.org/wiki/Test_Kitchen/Test_Kitchen_UI/Administration#Deployment), so while I didn't initially ask for shell access, I will need it to be able to do deploys. If I should put in a separate request for that, I can do so, just let me know.

Tue, Jan 13, 2:11 PM · Data-Engineering, SRE-Access-Requests, SRE
JMeybohm renamed T413364: Requesting access to analytics-privatedata-users for kareid from Grant Access to analytics-privatedata-users for kareid to Requesting access to analytics-privatedata-users for kareid.
Tue, Jan 13, 2:09 PM · Data-Engineering, SRE-Access-Requests, SRE
JMeybohm updated the task description for T414347: Requesting deployment access for AKhatun.
Tue, Jan 13, 2:03 PM · Essential-Work, Data-Platform-SRE (2026.01.05 - 2026.01.23), SRE, SRE-Access-Requests
JMeybohm added a comment to T414375: Grant Access to analytics-privatedata-users for hmonroy.

Your account is already a member of the group (https://gerrit.wikimedia.org/r/plugins/gitiles/operations/puppet/+/refs/heads/production/modules/admin/data/data.yaml#459). Would you please take a look at https://wikitech.wikimedia.org/wiki/Data_Platform/Data_access#Access_Levels and try to describe what you are missing or what you are trying to do that does not work?

Tue, Jan 13, 2:02 PM · Essential-Work, Data-Platform-SRE (2026.01.05 - 2026.01.23), SRE, SRE-Access-Requests, Data-Engineering
JMeybohm assigned T414223: New SRE manager - Get emails sent to noc to MLechvien-WMF.
Tue, Jan 13, 12:54 PM · SRE
JMeybohm added a project to T407994: Move Druid realtime configuration out of Refinery into standalone repo on GitLab: Data-Platform-SRE.
Tue, Jan 13, 12:53 PM · Data-Platform-SRE, Data-Engineering (Q3 FY25/26 January 1st - March 31th), SRE
JMeybohm moved T414417: Add support for k8s 1.31 on trixie from Inbox to In Progress on the ServiceOps new board.
Tue, Jan 13, 11:17 AM · ServiceOps new, Kubernetes
JMeybohm triaged T414417: Add support for k8s 1.31 on trixie as High priority.
Tue, Jan 13, 11:17 AM · ServiceOps new, Kubernetes
JMeybohm added a comment to T414417: Add support for k8s 1.31 on trixie.

I have created the trixie components and copied the packages:

Tue, Jan 13, 11:16 AM · ServiceOps new, Kubernetes
JMeybohm created T414417: Add support for k8s 1.31 on trixie.
Tue, Jan 13, 10:27 AM · ServiceOps new, Kubernetes
JMeybohm added a comment to T400155: Reduce the chances of false positives on MSS clamping alerts.

We just got this as a red herring during a registry outage where nginx was failing to start (so nothing listening)

Tue, Jan 13, 9:48 AM · Liberica, Traffic

Mon, Jan 12

JMeybohm assigned T414032: Add yubikey ssh key for dancy to dancy.
Mon, Jan 12, 12:03 PM · SRE, Release-Engineering-Team, SRE-Access-Requests
JMeybohm reassigned T414192: Requesting access to DataPlatform for trueg from gmodena to DSantamaria.
Mon, Jan 12, 12:02 PM · SRE, SRE-Access-Requests
JMeybohm added a comment to T414187: Requesting access to Grafana and Logstash for trueg.

I am sorry, I do not know what this means: "Grafana access is granted by having an LDAP account."
Is the LDAP account not my dev account?

Mon, Jan 12, 8:47 AM · SRE, SRE-Access-Requests

Fri, Jan 9

JMeybohm added a comment to T414192: Requesting access to DataPlatform for trueg.

@trueg could you please specify what access level you're requesting/what you need access to (see https://wikitech.wikimedia.org/wiki/Data_Platform/Data_access#What_access_should_I_request?)?
I see that you currently don't have shell access, but given you provided an SSH key I assume you're requesting shell access and analytics-privatedata-user membership?

Fri, Jan 9, 1:19 PM · SRE, SRE-Access-Requests
JMeybohm updated the task description for T414192: Requesting access to DataPlatform for trueg.
Fri, Jan 9, 1:02 PM · SRE, SRE-Access-Requests
JMeybohm updated the task description for T414192: Requesting access to DataPlatform for trueg.
Fri, Jan 9, 12:49 PM · SRE, SRE-Access-Requests
JMeybohm closed T414187: Requesting access to Grafana and Logstash for trueg as Resolved.

Welcome!
Grafana access is granted by having an LDAP account. Please request access to logstash via Wikimedia IDM at https://idm.wikimedia.org.
Feel free to reopen this ticket in case you run into issues!

Fri, Jan 9, 12:48 PM · SRE, SRE-Access-Requests
JMeybohm updated the task description for T414061: Requesting access to analytics-privatedata-users for tgritschacher.
Fri, Jan 9, 12:39 PM · SRE, SRE-Access-Requests
JMeybohm closed T414061: Requesting access to analytics-privatedata-users for tgritschacher as Resolved.

Merged the patch prepared by @Dzahn (thanks).

Fri, Jan 9, 12:39 PM · SRE, SRE-Access-Requests
JMeybohm updated the task description for T413634: DannyS712 "offboarding".
Fri, Jan 9, 9:37 AM · Release-Engineering-Team, SecTeam-Processed, LDAP-Access-Requests, Security-Team, SRE-Access-Requests, SRE, User-DannyS712

Thu, Jan 8

JMeybohm updated the task description for T414061: Requesting access to analytics-privatedata-users for tgritschacher.
Thu, Jan 8, 4:44 PM · SRE, SRE-Access-Requests
JMeybohm updated subscribers of T414061: Requesting access to analytics-privatedata-users for tgritschacher.

@KFrancis could you please confirm NDA status?

Thu, Jan 8, 4:42 PM · SRE, SRE-Access-Requests
JMeybohm added a project to T413634: DannyS712 "offboarding": Release-Engineering-Team.

Release-Engineering-Team: Could you help with removing +2 ?

Thu, Jan 8, 4:41 PM · Release-Engineering-Team, SecTeam-Processed, LDAP-Access-Requests, Security-Team, SRE-Access-Requests, SRE, User-DannyS712
JMeybohm added a comment to T414102: Grant Access to wmf for aghirelli.

Access to the wmf group needs to be requested Using_the_Wikimedia_Identity_Management_System nowadays. If you run into issues, please feel free to reopen this task.

Thu, Jan 8, 4:34 PM · SRE, LDAP-Access-Requests
JMeybohm updated subscribers of T413994: Grant Access to wmde for martyn.ranyard.

@KFrancis could you please confirm NDA status?

Thu, Jan 8, 4:31 PM · SRE, LDAP-Access-Requests
JMeybohm updated subscribers of T413364: Requesting access to analytics-privatedata-users for kareid.

Do you currently have shell access (Yes/No): Not sure - how can I check?

Looking at our existing user file I can see you are not set up for shell access yet. Which is no problem we can proceed.

If a manager can reply to approve we will get the ball rolling. It seems to me kerberos access will be needed here is that right?

Thu, Jan 8, 4:30 PM · Data-Engineering, SRE-Access-Requests, SRE
JMeybohm closed T413433: Unrecognised file under /srv/deployment-charts as Resolved.

I've moved the file out of the way to /root/See_T413433 in case someone lost a session.

Thu, Jan 8, 4:13 PM · Data-Engineering, Data Pipelines, SRE

Dec 19 2025

JMeybohm added a comment to T413179: deployment charts: automate testing on staging.

FWIW there is the concept of helm test (https://helm.sh/docs/topics/chart_tests/) that is totally unused for most of our services although we do create a test based on service-checker by default for all new charts: https://gerrit.wikimedia.org/r/plugins/gitiles/operations/deployment-charts/+/refs/heads/master/_scaffold/service/_skel/templates/tests/test-service-checker.yaml

Dec 19 2025, 12:22 PM · ServiceOps-Services-Oids, Kubernetes, ServiceOps new, Developer Productivity

Dec 18 2025

JMeybohm added a comment to T390861: wikikube-ctrl200[4-5] implementation tracking.

I think that would be the first wikikube nodes that we use UEFI on I think, so we may want to pay a little more attention than usual. No objections on the face of it though.

Dec 18 2025, 12:17 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, Patch-For-Review

Dec 16 2025

JMeybohm closed T265357: Build envoy-build-tools image locally as Declined.

Since we package envoy binaries now, this is no longer required.

Dec 16 2025, 2:22 PM · serviceops
JMeybohm reassigned T360636: Phase out cergen for ServiceOps services from JMeybohm to MoritzMuehlenhoff.

Thanks for volunteering to remove the remaining certs and cergen config during your January cleanup

Dec 16 2025, 2:01 PM · serviceops, Epic, SRE
JMeybohm closed T360636: Phase out cergen for ServiceOps services, a subtask of T357750: Phase out cergen, as Resolved.
Dec 16 2025, 1:01 PM · Patch-For-Review, Puppet-Infrastructure, Puppet (Puppet 7.0), Infrastructure-Foundations, SRE
JMeybohm closed T360636: Phase out cergen for ServiceOps services as Resolved.

With T352245: Migrate the etcd main cluster to cfssl-based PKI resolved, this has now been completed.

Dec 16 2025, 1:01 PM · serviceops, Epic, SRE
JMeybohm closed T357616: Logs from containers sometimes not visible in logstash as Resolved.

Closing again because it seems to work fine mostly and we can't reproduce failures

Dec 16 2025, 12:55 PM · Patch-For-Review, Observability-Logging, serviceops
JMeybohm closed T402014: Add ipblock-source objects and logic as Resolved.

This is done. I've created T412805: Migrate ipblocks from fetch_external_clouds_vendors_nets.py to HIDDENPARMA for the follow up work.

Dec 16 2025, 12:53 PM · Patch-For-Review, Traffic, Hiddenparma
JMeybohm closed T402014: Add ipblock-source objects and logic, a subtask of T400100: FY 25/26 WE 5.4.2: Known bots / clients, as Resolved.
Dec 16 2025, 12:53 PM · Epic, ServiceOps new, SRE
JMeybohm created T412805: Migrate ipblocks from fetch_external_clouds_vendors_nets.py to HIDDENPARMA.
Dec 16 2025, 12:53 PM · ServiceOps new, Patch-For-Review, SRE
JMeybohm added a comment to T390861: wikikube-ctrl200[4-5] implementation tracking.

Two questions/suggestions in this regard:

  • I see that we also have wikikube-ctrl2006 racked (T406596), would it make sense to do all three at once?
  • Given we moved to UEFI as default (and wikikube-ctrl2006 seems to require it anyways) I would suggest to switch wikikube-ctrl200[4-5] to UEFI as well (so we don't have to do that later), see: https://wikitech.wikimedia.org/wiki/UEFI_Boot
Dec 16 2025, 12:03 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, Patch-For-Review

Dec 15 2025

JMeybohm moved T387760: Migrate release template inheritance in helmfiles from YAML anchors to the inherit field from Incoming 🐫 to 🥋Good First Task on the serviceops board.
Dec 15 2025, 1:56 PM · Data-Platform-SRE, Kubernetes, Prod-Kubernetes, serviceops