Page MenuHomePhabricator

SLyngshede-WMF (Simon Lyngshede)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
May 2 2022, 11:51 AM (22 w, 2 d)
Availability
Available
LDAP User
Slyngshede
MediaWiki User
SLyngshede-WMF [ Global Accounts ]

Recent Activity

Today

SLyngshede-WMF added a comment to T319409: Pick a name for the IDM.

There's also a Norse version https://en.wikipedia.org/wiki/M%C3%B3%C3%B0gu%C3%B0r

Wed, Oct 5, 1:45 PM · Infrastructure-Foundations, SRE

Tue, Sep 13

SLyngshede-WMF added a comment to T311288: Implement Prometheus exporter for Ganeti capacity data.

The patch use the oper_state of the instances, rather than just assuming that None should be 0. It's much the same result, but it feels more correct.

Tue, Sep 13, 12:41 PM · Patch-For-Review, Observability-Metrics, Ganeti, Infrastructure-Foundations, SRE
SLyngshede-WMF added a comment to T311288: Implement Prometheus exporter for Ganeti capacity data.

The problem is this host: dispatch-be1001.eqiad.wmnet which is configured to be down. It does in fact have no vCPUs allocated.

Tue, Sep 13, 12:16 PM · Patch-For-Review, Observability-Metrics, Ganeti, Infrastructure-Foundations, SRE
SLyngshede-WMF added a comment to T311288: Implement Prometheus exporter for Ganeti capacity data.

One of the hosts actually do report having "None" oper_vcpus, rather than 0.

Tue, Sep 13, 11:57 AM · Patch-For-Review, Observability-Metrics, Ganeti, Infrastructure-Foundations, SRE

Mon, Sep 12

SLyngshede-WMF added a comment to T317344: New RAID alerts (e.g. WARNING: unexpectedly checked no devices).

The servers with the PERC controllers are now happy an reports no errors. The controllers are currently configured for JBOD, meaning that there are no virtual disks/RAID arrays and the perccli tool then just happily removes all references to the VD LIST.

Mon, Sep 12, 7:46 AM · Patch-For-Review, cloud-services-team (Kanban)

Fri, Sep 9

SLyngshede-WMF claimed T317344: New RAID alerts (e.g. WARNING: unexpectedly checked no devices).
Fri, Sep 9, 12:06 PM · Patch-For-Review, cloud-services-team (Kanban)

Sep 2 2022

SLyngshede-WMF added a comment to T316903: vrts - spamassassin icinga alerts.

It seems weird if the script didn't also sometimes fail under cron. Perhaps we just didn't notice.
But yes, maybe we just need to modify the script a bit and have et send an email, and then notice alert as a failed service. It is just SpamAssassin definition updates, it's not super critical if they are a day behind.

Sep 2 2022, 6:34 AM · serviceops, serviceops-collab, vrts

Sep 1 2022

SLyngshede-WMF changed the status of T315537: IcingaHosts.wait_for_downtimed() does not honor dry_run from Open to In Progress.

Missing the two last bulletpoints:

Sep 1 2022, 11:08 AM · Infrastructure-Foundations, SRE-tools, Spicerack

Aug 2 2022

SLyngshede-WMF created T314371: Access request for Cumin hosts in WMCS.
Aug 2 2022, 11:16 AM · User-dcaro, Cloud-VPS, cloud-services-team (Kanban)

Jul 27 2022

SLyngshede-WMF added a comment to T313603: Business hours oncall implementation delays pages to batphone by 5 minutes when there are no oncallers.

A slightly weird way of handling the issue automatically could be using Selenium. It seems a bit overkill for a minor issue, but if it's something we want to automate, that could be a way to do it.

Jul 27 2022, 7:00 AM · Patch-For-Review, User-fgiunchedi, SRE Observability (FY2022/2023-Q1), observability, SRE-OnFire

Jun 24 2022

SLyngshede-WMF changed the status of T311288: Implement Prometheus exporter for Ganeti capacity data from Open to In Progress.
Jun 24 2022, 8:41 AM · Patch-For-Review, Observability-Metrics, Ganeti, Infrastructure-Foundations, SRE
SLyngshede-WMF created T311288: Implement Prometheus exporter for Ganeti capacity data.
Jun 24 2022, 8:41 AM · Patch-For-Review, Observability-Metrics, Ganeti, Infrastructure-Foundations, SRE

Jun 21 2022

SLyngshede-WMF closed T309375: Requesting access to contint-admins for taavi as Resolved.

@taavi You're now added to ciadmin, but let me know if something doesn't work.

Jun 21 2022, 7:04 AM · SRE, SRE-Access-Requests
SLyngshede-WMF added a comment to T309375: Requesting access to contint-admins for taavi.

@taavi Sorry, didn't spot that. I'll be right back :)

Jun 21 2022, 7:01 AM · SRE, SRE-Access-Requests
SLyngshede-WMF closed T309375: Requesting access to contint-admins for taavi as Resolved.
Jun 21 2022, 6:54 AM · SRE, SRE-Access-Requests

Jun 17 2022

SLyngshede-WMF closed T310227: Requesting access to Superset for Ricardo Baeza-Yates as Resolved.
Jun 17 2022, 10:59 AM · SRE, Product-Analytics, LDAP-Access-Requests
SLyngshede-WMF closed T302231: Requesting access to deployment for TheresNoTime as Resolved.
Jun 17 2022, 10:57 AM · SRE-Access-Requests, SRE
SLyngshede-WMF updated the task description for T302231: Requesting access to deployment for TheresNoTime.
Jun 17 2022, 10:56 AM · SRE-Access-Requests, SRE

Jun 16 2022

SLyngshede-WMF triaged T310721: eventstreams chart should use latest common_templates as Medium priority.
Jun 16 2022, 6:25 PM · Event-Platform Value Stream (Sprint 02), Patch-For-Review, Data-Engineering, SRE, serviceops
SLyngshede-WMF triaged T310761: Allow Wikimedia Maps usage on desciclopedia.org as High priority.
Jun 16 2022, 7:42 AM · Maps, SRE
SLyngshede-WMF triaged T310738: Setup redirect of policy.wikimedia.org to Advocacy portal on Foundation website as Medium priority.
Jun 16 2022, 7:40 AM · serviceops-collab, Traffic, wikimediafoundation.org, SRE, serviceops, DNS, WMF-Legal

Jun 15 2022

SLyngshede-WMF closed T283165: OpenSSL < 1.1.0 compatibility issues with new LE issuance chain as Resolved.
Jun 15 2022, 2:00 PM · Patch-For-Review, Infrastructure-Foundations, SRE, Traffic
SLyngshede-WMF closed T283165: OpenSSL < 1.1.0 compatibility issues with new LE issuance chain, a subtask of T283164: Let's Encrypt issuance chains update, as Resolved.
Jun 15 2022, 1:59 PM · SRE, Traffic
SLyngshede-WMF closed T268974: systemd.timer not executing on cumin2001 after command was modified as Resolved.
Jun 15 2022, 1:05 PM · Infrastructure-Foundations, Patch-For-Review, Puppet, SRE
SLyngshede-WMF triaged T310227: Requesting access to Superset for Ricardo Baeza-Yates as High priority.

@leila I've added Ricardo to the analytics-privatedata-users users group, and the NDA group in LDAP, rather than the WMF, given that we only have Wikimedia employees in the WMF group. The result should be the same though.

Jun 15 2022, 12:03 PM · SRE, Product-Analytics, LDAP-Access-Requests
SLyngshede-WMF closed T310524: Grant Access to Superset and Tunilo for Caroline Myrick as Resolved.
Jun 15 2022, 11:01 AM · SRE, LDAP-Access-Requests
SLyngshede-WMF triaged T310524: Grant Access to Superset and Tunilo for Caroline Myrick as High priority.

@CMyrick-WMF I have added you to the WMF LDAP group, that should grant you access to Superset and Turnilo.

Jun 15 2022, 11:01 AM · SRE, LDAP-Access-Requests
SLyngshede-WMF added a member for WMF-NDA: CMyrick-WMF.
Jun 15 2022, 10:59 AM
SLyngshede-WMF closed T310385: Grant Access to wmf for Xcollazo as Resolved.
Jun 15 2022, 9:08 AM · SRE, LDAP-Access-Requests
SLyngshede-WMF closed T310055: Check access rights for GoranSMilovanovic as Resolved.

Email address is updated, everything else looks fine.

Jun 15 2022, 8:55 AM · SRE, SRE-Access-Requests, LDAP-Access-Requests
SLyngshede-WMF triaged T310608: (Re) evaluate effectiveness / usefulness of varnish/haproxy traffic drop alerts as Medium priority.
Jun 15 2022, 8:51 AM · SRE, SRE-OnFire, Sustainability (Incident Followup), Traffic
SLyngshede-WMF added a comment to T310620: Requesting SSH keypair for deployment server keyholder to push to Gerrit.

This doesn't appear to be an SRE-Access-Request. Adding the ServiceOps tags, as they are involved in the Kubernetes migration and it makes sense to loop them in.

Jun 15 2022, 8:51 AM · serviceops, SRE
SLyngshede-WMF triaged T310620: Requesting SSH keypair for deployment server keyholder to push to Gerrit as Medium priority.
Jun 15 2022, 8:50 AM · serviceops, SRE
SLyngshede-WMF added a comment to T310450: fawiki user reports getting 503 errors with message "upstream connect error or disconnect before headers".

Merged with an unrelated bug, and the relevant tags was dropped. I've re-added the correct tags.

Jun 15 2022, 8:42 AM · serviceops, Wikimedia-production-error
SLyngshede-WMF added a project to T310450: fawiki user reports getting 503 errors with message "upstream connect error or disconnect before headers": serviceops.
Jun 15 2022, 8:41 AM · serviceops, Wikimedia-production-error
SLyngshede-WMF edited projects for T310450: fawiki user reports getting 503 errors with message "upstream connect error or disconnect before headers", added: Discovery-Search, CirrusSearch, Wikimedia-production-error; removed serviceops, Traffic, SRE.
Jun 15 2022, 8:40 AM · serviceops, Wikimedia-production-error
SLyngshede-WMF triaged T310450: fawiki user reports getting 503 errors with message "upstream connect error or disconnect before headers" as Medium priority.
Jun 15 2022, 8:38 AM · serviceops, Wikimedia-production-error
SLyngshede-WMF removed a project from T310528: Thumbor URLs are too permissive: SRE.
Jun 15 2022, 8:36 AM · Traffic, Thumbor
SLyngshede-WMF triaged T310528: Thumbor URLs are too permissive as Medium priority.
Jun 15 2022, 8:36 AM · Traffic, Thumbor
SLyngshede-WMF raised the priority of T310062: Update conf1* servers from Medium to High.
Jun 15 2022, 8:35 AM · serviceops, SRE
SLyngshede-WMF triaged T310087: Advance declaration of query parameters as Medium priority.
Jun 15 2022, 7:58 AM · SRE, Traffic, MediaWiki-General
SLyngshede-WMF triaged T310062: Update conf1* servers as Medium priority.
Jun 15 2022, 7:57 AM · serviceops, SRE
SLyngshede-WMF closed T310654: Allow deployers to sudo -u mwpresync as Resolved.
Jun 15 2022, 7:54 AM · SRE, Infrastructure-Foundations, Release-Engineering-Team (Deployment Autopilot 🛩️), SRE-Access-Requests
SLyngshede-WMF closed T310654: Allow deployers to sudo -u mwpresync, a subtask of T310395: Automated Tuesday Train via a timer, as Resolved.
Jun 15 2022, 7:54 AM · Release-Engineering-Team (Priority Backlog 📥), Patch-For-Review, Scap
SLyngshede-WMF triaged T309885: cloudstore1008 - eno2 reporting no carrier as Medium priority.
Jun 15 2022, 7:46 AM · ops-eqiad, SRE
SLyngshede-WMF raised the priority of T310610: Degraded RAID on aqs2005 from Low to Medium.
Jun 15 2022, 7:44 AM · SRE, ops-codfw
SLyngshede-WMF triaged T310610: Degraded RAID on aqs2005 as Low priority.
Jun 15 2022, 7:43 AM · SRE, ops-codfw

Jun 14 2022

SLyngshede-WMF added a comment to T310055: Check access rights for GoranSMilovanovic.

Then let's not revoke that :-)

Jun 14 2022, 1:55 PM · SRE, SRE-Access-Requests, LDAP-Access-Requests
SLyngshede-WMF added a comment to T310055: Check access rights for GoranSMilovanovic.

I notice that Goran has access to analytics_privatedata_users, is that still required?

Jun 14 2022, 1:44 PM · SRE, SRE-Access-Requests, LDAP-Access-Requests
SLyngshede-WMF claimed T310555: Requesting access to Analytics for xcollazo.
Jun 14 2022, 9:37 AM · SRE, SRE-Access-Requests
SLyngshede-WMF updated subscribers of T310555: Requesting access to Analytics for xcollazo.

This need approval of @odimitrijevic or @Ottomata in order to grant access to the analytics-privatedata-users group

Jun 14 2022, 9:04 AM · SRE, SRE-Access-Requests
SLyngshede-WMF added a comment to T310055: Check access rights for GoranSMilovanovic.

We have removed Goran from the WMDE group, as that is only for WMDE staff.

Jun 14 2022, 7:40 AM · SRE, SRE-Access-Requests, LDAP-Access-Requests
SLyngshede-WMF added a comment to T310055: Check access rights for GoranSMilovanovic.

@KFrancis we just need to update Gorans email address, it's still listed as the wikimedia.de address. Could you please provide me with the updated address?

Jun 14 2022, 7:18 AM · SRE, SRE-Access-Requests, LDAP-Access-Requests

Jun 13 2022

SLyngshede-WMF triaged T310387: cp1089 memory errors on DIMM_B1 as Medium priority.
Jun 13 2022, 1:40 PM · ops-eqiad, SRE, DC-Ops, Traffic
SLyngshede-WMF closed T310465: Request to create new mailing lists for Chinese Wikipedia Administrators as Resolved.

Mailing list have been created, but please check that you have access via: https://lists.wikimedia.org
Also check the settings for the list, it is a private and currently unlisted mailing list, and I've disabled the archives, but feel free to reenable them if needed.

Jun 13 2022, 12:48 PM · SRE, Wikimedia-Mailing-lists, Chinese-Sites
SLyngshede-WMF changed the status of T310465: Request to create new mailing lists for Chinese Wikipedia Administrators from Open to In Progress.
Jun 13 2022, 12:10 PM · SRE, Wikimedia-Mailing-lists, Chinese-Sites
SLyngshede-WMF triaged T310465: Request to create new mailing lists for Chinese Wikipedia Administrators as Low priority.

We just need to clarify if there's an approval process for requesting new mailing lists. I'll try to find out and let you know as soon as possible.

Jun 13 2022, 12:00 PM · SRE, Wikimedia-Mailing-lists, Chinese-Sites
SLyngshede-WMF closed T309886: an-tool1005 - memcached Connection refused as Resolved.

Memcache was restarted by @elukey on Mon 2022-06-06 06:30:42 UTC

Jun 13 2022, 11:56 AM · SRE
SLyngshede-WMF assigned T310044: Requesting access to ores-admin for ml-team-admins to elukey.
Jun 13 2022, 11:05 AM · SRE, SRE-Access-Requests
SLyngshede-WMF updated subscribers of T310455: thumbor2004 is down.
Jun 13 2022, 8:18 AM · ops-codfw, Thumbor, SRE
SLyngshede-WMF triaged T310455: thumbor2004 is down as Medium priority.
Jun 13 2022, 8:16 AM · ops-codfw, Thumbor, SRE

May 27 2022

SLyngshede-WMF closed T309371: Gerrit: all patches are being reported as merge conflicts as Resolved.
May 27 2022, 7:43 AM · Release-Engineering-Team, User-DannyS712, Continuous-Integration-Infrastructure
SLyngshede-WMF closed T309371: Gerrit: all patches are being reported as merge conflicts, a subtask of T309376: gerrit-bot holding open SSH sessions, as Resolved.
May 27 2022, 7:43 AM · Release-Engineering-Team, Continuous-Integration-Infrastructure
SLyngshede-WMF added a comment to T309371: Gerrit: all patches are being reported as merge conflicts.

I've restarted Zuul on contint2001, and that seems to have helped a bit.

May 27 2022, 7:27 AM · Release-Engineering-Team, User-DannyS712, Continuous-Integration-Infrastructure

May 10 2022

SLyngshede-WMF closed T307574: New VictorOps user request for slyngshede as Resolved.
May 10 2022, 12:20 PM · SRE Observability (FY2021/2022-Q4), observability
SLyngshede-WMF added a comment to T307574: New VictorOps user request for slyngshede.

@fgiunchedi It worked, thank you.

May 10 2022, 12:19 PM · SRE Observability (FY2021/2022-Q4), observability

May 9 2022

SLyngshede-WMF reopened T307574: New VictorOps user request for slyngshede as "Open".

I still haven't received the invite from VictorOps, can we try resending it?

May 9 2022, 6:53 AM · SRE Observability (FY2021/2022-Q4), observability

May 4 2022

SLyngshede-WMF created T307574: New VictorOps user request for slyngshede.
May 4 2022, 1:14 PM · SRE Observability (FY2021/2022-Q4), observability
SLyngshede-WMF created T307562: Security Issue Access Request for SLyngshede-WMF.
May 4 2022, 11:18 AM · SecTeam-Processed, Security
SLyngshede-WMF created T307548: Add SLyngshede to security@wikimedia.org mailinglist.
May 4 2022, 8:24 AM · SecTeam-Processed