Page MenuHomePhabricator

Clement_Goubert (claime)
Senior SRE

Projects (10)

Today

  • No visible events.

Tomorrow

  • No visible events.

Tuesday

  • No visible events.

User Details

User Since
Jul 26 2022, 2:11 PM (189 w, 4 d)
Availability
Available
IRC Nick
claime
LDAP User
Clément Goubert
MediaWiki User
CGoubert-WMF [ Global Accounts ]

Recent Activity

Fri, Mar 13

Clement_Goubert created T420041: db1253 depooled following host crash.
Fri, Mar 13, 6:09 PM · Patch-For-Review, DBA
Clement_Goubert closed T419866: CORS policy errors with any user script that calls a different subdomain's API as Resolved.
Fri, Mar 13, 12:12 PM · SecTeam-Processed, WMF-General-or-Unknown, Hackathon-Northwestern-Europe-2026, MediaWiki-Platform-Team, JavaScript
Clement_Goubert added a comment to T419866: CORS policy errors with any user script that calls a different subdomain's API.

We have removed the rate limits on OPTIONS requests and I don't see anymore 429 responses to them in logs. Resolving, please feel free to reopen if the issue presents itself again.

Fri, Mar 13, 12:12 PM · SecTeam-Processed, WMF-General-or-Unknown, Hackathon-Northwestern-Europe-2026, MediaWiki-Platform-Team, JavaScript

Thu, Mar 12

Clement_Goubert added a comment to T416390: Q3:rack/setup/install wikikube-worker137[3-4].

@Clement_Goubert Could you update site.pp it is missing. Also add these to preseed for efi booting also Thanks!

Thu, Mar 12, 5:39 PM · SRE, DC-Ops, ops-eqiad

Wed, Mar 11

Clement_Goubert moved T414434: [WE5.4.8] Media rate limiting from Radar (Awareness) to Radar (Pending) on the ServiceOps new board.
Wed, Mar 11, 3:16 PM · ServiceOps new, Epic
Clement_Goubert moved T249663: write some recording rules for queries used in the appserver RED k8s dashboard from Radar (Awareness) to Radar (Pending) on the ServiceOps new board.
Wed, Mar 11, 3:16 PM · Observability-Metrics, SRE Observability (FY2025/2026-Q3), Prod-Kubernetes, ServiceOps new, SRE
Clement_Goubert moved T410198: Determine the source of internal requests going through the API gateway. from Radar (Awareness) to Radar (Pending) on the ServiceOps new board.
Wed, Mar 11, 3:15 PM · ServiceOps-SharedInfra, ServiceOps new, MediaWiki-Platform-Team (Q3 Kanban Board), Content-Transform-Team (Work In Progress), Essential-Work, PageViewInfo, Growth-Team, OKR-Work
Clement_Goubert moved T411771: Migrate PageViewInfo calls away from rest-gateway from Needs Info / Blocked to Radar (Pending) on the ServiceOps new board.
Wed, Mar 11, 3:12 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), ServiceOps-SharedInfra, ServiceOps new, PageViewInfo
Clement_Goubert added a comment to T411771: Migrate PageViewInfo calls away from rest-gateway.

I'll leave the MediaWiki internals for you to decide, but as far as what we want to achieve, yes, that sounds right. Given pageviews and unique-devices are now two services, the PageViewInfo extension needs to be able to address both individually through two different service mesh listeners.

Wed, Mar 11, 3:09 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), ServiceOps-SharedInfra, ServiceOps new, PageViewInfo
Clement_Goubert added a comment to T417026: Create a rewrite for the GraphQL endpoint on wikidata.org.

Ack, thanks for following up.

Wed, Mar 11, 10:06 AM · ServiceOps new, Wikimedia-Apache-configuration, SRE, Wikidata, Wikibase GraphQL, Wikibase Reuse Team

Mon, Mar 9

Clement_Goubert moved T418494: Delete the API Portal wiki from Backlog to Next quarter on the ServiceOps new board.
Mon, Mar 9, 3:35 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, Wiki-Setup (Close), API-Portal, Tech-Docs-Team
Clement_Goubert moved T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs from Backlog to Next quarter on the ServiceOps new board.
Mon, Mar 9, 3:34 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, API-Portal, Tech-Docs-Team
Clement_Goubert added a comment to T390773: Split PrivateSettings into config and business logic.

[...]

  1. Update deployment-charts, not really sure what needs to be done here (here it seems to claim it's not used anymore)
Mon, Mar 9, 2:57 PM · SecTeam-Processed, Release-Engineering-Team, Security-Team, Security
Clement_Goubert added a comment to T419212: Upgrade ServiceOps roles to Debian Trixie.

@JMeybohm as discussed today:

Mon, Mar 9, 2:50 PM · ServiceOps-Upgrades-Hardware, ServiceOps new (Next quarter)
Clement_Goubert placed T418916: Q3:rack/setup/install rdb101[56] up for grabs.
Mon, Mar 9, 2:34 PM · Patch-For-Review, ServiceOps-Upgrades-Hardware, SRE, ServiceOps new, ops-eqiad, DC-Ops
Clement_Goubert placed T418925: Q3:rack/setup/install wikikube-worker23[57-74] up for grabs.
Mon, Mar 9, 2:34 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-eqiad, DC-Ops

Thu, Mar 5

Clement_Goubert added a comment to T418939: eno1 on wikikube-worker1162:9100 has the wrong speed: 1.25e+07..

Host back in the pool, thanks <3

Thu, Mar 5, 5:12 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-eqiad, DC-Ops
Clement_Goubert updated subscribers of T417026: Create a rewrite for the GraphQL endpoint on wikidata.org.

Depending on exactly what you want this rewrite to do, it may be that an apache rule isn't the right choice here.

Thu, Mar 5, 5:08 PM · ServiceOps new, Wikimedia-Apache-configuration, SRE, Wikidata, Wikibase GraphQL, Wikibase Reuse Team
Clement_Goubert moved T418939: eno1 on wikikube-worker1162:9100 has the wrong speed: 1.25e+07. from Inbox to Radar (Awareness) on the ServiceOps new board.
Thu, Mar 5, 12:59 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-eqiad, DC-Ops
Clement_Goubert added projects to T418939: eno1 on wikikube-worker1162:9100 has the wrong speed: 1.25e+07.: ServiceOps new, ServiceOps-Upgrades-Hardware.
Thu, Mar 5, 12:59 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-eqiad, DC-Ops
Clement_Goubert added a comment to T418939: eno1 on wikikube-worker1162:9100 has the wrong speed: 1.25e+07..

It's cordoned and depooled, fire away.

Thu, Mar 5, 12:57 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-eqiad, DC-Ops

Wed, Mar 4

Clement_Goubert updated the task description for T418925: Q3:rack/setup/install wikikube-worker23[57-74].
Wed, Mar 4, 1:21 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-eqiad, DC-Ops
Clement_Goubert added a comment to T418916: Q3:rack/setup/install rdb101[56].

Updated task description with racking details and OS. Waiting for review on the puppet patch.

Wed, Mar 4, 11:46 AM · Patch-For-Review, ServiceOps-Upgrades-Hardware, SRE, ServiceOps new, ops-eqiad, DC-Ops
Clement_Goubert updated the task description for T418916: Q3:rack/setup/install rdb101[56].
Wed, Mar 4, 11:44 AM · Patch-For-Review, ServiceOps-Upgrades-Hardware, SRE, ServiceOps new, ops-eqiad, DC-Ops
Clement_Goubert moved T418922: Q3:rack/setup/install rdb201[34] from In Progress to Radar (Awareness) on the ServiceOps new board.

Updated racking details, changed OS to Trixie, puppet patches merged. All yours.

Wed, Mar 4, 11:42 AM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-codfw, DC-Ops
Clement_Goubert placed T418922: Q3:rack/setup/install rdb201[34] up for grabs.
Wed, Mar 4, 11:41 AM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-codfw, DC-Ops
Clement_Goubert added a comment to T418922: Q3:rack/setup/install rdb201[34].

@Clement_Goubert The current rdb* hosts are on Bullseye, and you listed Bookworm as the designated OS, if we move to a new OS, let's directly move to Trixie?

Wed, Mar 4, 11:41 AM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-codfw, DC-Ops
Clement_Goubert placed T418919: Q3:rack/setup/install wikikube-ctrl100[56] up for grabs.

All yours.

Wed, Mar 4, 11:37 AM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-eqiad, DC-Ops
Clement_Goubert updated subscribers of T418919: Q3:rack/setup/install wikikube-ctrl100[56].

@Clement_Goubert,

I've assumed the racking details, please double check them for accuracy and provide racking preferences in addition to the boilerplate:

Please update the site.pp file with the insetup role for your team (detailed on https://wikitech.wikimedia.org/wiki/SRE/Dc-operations) and add the new servers to preseed.yml for partition info.

If possible, please reference this task number in your patch set, so it is clear when complete. Once complete, just un-assign yourself (leaving no assignee) for this task and once the hardware arrives on-site engineerss will claim this task for racking and setup. Please don't re-subscribe me to this task unless there is a direct question for me.

Thank you!

Wed, Mar 4, 11:19 AM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-eqiad, DC-Ops
Clement_Goubert renamed T418919: Q3:rack/setup/install wikikube-ctrl100[56] from Q3:rack/setup/install wikikube-ctrl100[45] to Q3:rack/setup/install wikikube-ctrl100[56].
Wed, Mar 4, 11:18 AM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-eqiad, DC-Ops
Clement_Goubert updated the task description for T418922: Q3:rack/setup/install rdb201[34].
Wed, Mar 4, 10:55 AM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-codfw, DC-Ops
Clement_Goubert moved T418916: Q3:rack/setup/install rdb101[56] from Inbox to In Progress on the ServiceOps new board.
Wed, Mar 4, 10:49 AM · Patch-For-Review, ServiceOps-Upgrades-Hardware, SRE, ServiceOps new, ops-eqiad, DC-Ops
Clement_Goubert moved T418919: Q3:rack/setup/install wikikube-ctrl100[56] from Inbox to In Progress on the ServiceOps new board.
Wed, Mar 4, 10:49 AM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-eqiad, DC-Ops
Clement_Goubert moved T418922: Q3:rack/setup/install rdb201[34] from Inbox to In Progress on the ServiceOps new board.
Wed, Mar 4, 10:49 AM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-codfw, DC-Ops
Clement_Goubert moved T418925: Q3:rack/setup/install wikikube-worker23[57-74] from Inbox to In Progress on the ServiceOps new board.
Wed, Mar 4, 10:49 AM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-eqiad, DC-Ops

Tue, Mar 3

Clement_Goubert moved T414446: Enforce upload rate limit from Scheduled (this Q) to Backlog on the ServiceOps new board.
Tue, Mar 3, 12:37 PM · ServiceOps new (Next quarter), ServiceOps-Services-Oids
Clement_Goubert moved T418145: Configure ATS to allow fractional routing for api.wikimedia.org from Scheduled (this Q) to In Progress on the ServiceOps new board.
Tue, Mar 3, 12:12 PM · MW-Interfaces-Team, Patch-For-Review, ServiceOps new, OKR-Work

Mon, Mar 2

Clement_Goubert changed the status of T417772: wikikube-worker23[32-56] implementation tracking, a subtask of T408757: Q2:rack/setup/install wikikube-worker2332-56, from Open to In Progress.
Mon, Mar 2, 3:06 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, SRE, ops-codfw, DC-Ops
Clement_Goubert changed the status of T417772: wikikube-worker23[32-56] implementation tracking from Open to In Progress.

This should now be unblocked.

Mon, Mar 2, 3:06 PM · ServiceOps-Upgrades-Hardware, ServiceOps new

Thu, Feb 26

Clement_Goubert assigned T397075: Package Wikimedia's PHP 8.3 component for bookworm to Scott_French.

Assigning to Scott for reshaping.

Thu, Feb 26, 5:23 PM · ServiceOps new, ServiceOps-Mediawiki
Clement_Goubert triaged T397075: Package Wikimedia's PHP 8.3 component for bookworm as Low priority.
Thu, Feb 26, 5:22 PM · ServiceOps new, ServiceOps-Mediawiki
Clement_Goubert moved T348379: Migrate use of php-tideways_xhprof to php-xhprof from Inbox to Radar (Awareness) on the ServiceOps new board.
Thu, Feb 26, 5:18 PM · ServiceOps-Mediawiki, ServiceOps new, MW-1.45-notes (1.45.0-wmf.18; 2025-09-09), Patch-For-Review, Documentation, MediaWiki-Core-Profiler
Clement_Goubert edited projects for T348379: Migrate use of php-tideways_xhprof to php-xhprof, added: ServiceOps new, ServiceOps-Mediawiki; removed serviceops-deprecated.
Thu, Feb 26, 5:17 PM · ServiceOps-Mediawiki, ServiceOps new, MW-1.45-notes (1.45.0-wmf.18; 2025-09-09), Patch-For-Review, Documentation, MediaWiki-Core-Profiler
Clement_Goubert triaged T410273: api rate limiting: Assign ratelimit class based on IP range as High priority.
Thu, Feb 26, 5:15 PM · ServiceOps-SharedInfra, ServiceOps new, MW-Interfaces-Team, MediaWiki-Platform-Team, OKR-Work
Clement_Goubert moved T410273: api rate limiting: Assign ratelimit class based on IP range from Inbox to In Progress on the ServiceOps new board.
Thu, Feb 26, 5:15 PM · ServiceOps-SharedInfra, ServiceOps new, MW-Interfaces-Team, MediaWiki-Platform-Team, OKR-Work
Clement_Goubert edited projects for T410273: api rate limiting: Assign ratelimit class based on IP range, added: ServiceOps new, ServiceOps-SharedInfra; removed serviceops-deprecated.
Thu, Feb 26, 5:14 PM · ServiceOps-SharedInfra, ServiceOps new, MW-Interfaces-Team, MediaWiki-Platform-Team, OKR-Work
Clement_Goubert moved T391457: Move Wikikube services to Istio ingress (where possible) from Inbox to Backlog on the ServiceOps new board.
Thu, Feb 26, 5:07 PM · ServiceOps-SharedInfra, ServiceOps new
Clement_Goubert triaged T391457: Move Wikikube services to Istio ingress (where possible) as Low priority.
Thu, Feb 26, 5:07 PM · ServiceOps-SharedInfra, ServiceOps new
Clement_Goubert updated the task description for T418145: Configure ATS to allow fractional routing for api.wikimedia.org.
Thu, Feb 26, 3:22 PM · MW-Interfaces-Team, Patch-For-Review, ServiceOps new, OKR-Work
Clement_Goubert updated the task description for T418494: Delete the API Portal wiki.
Thu, Feb 26, 3:10 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, Wiki-Setup (Close), API-Portal, Tech-Docs-Team
Clement_Goubert moved T415923: Plan shutdown process for the API Portal from Scheduled (this Q) to Radar (Awareness) on the ServiceOps new board.

Moving to serviceops Radar as we have provided the required information.

Thu, Feb 26, 3:09 PM · ServiceOps-SharedInfra, ServiceOps new, API-Portal
Clement_Goubert updated the task description for T415923: Plan shutdown process for the API Portal.
Thu, Feb 26, 3:08 PM · ServiceOps-SharedInfra, ServiceOps new, API-Portal
Clement_Goubert removed a subtask for T415293: Shut down the API Portal: T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs.
Thu, Feb 26, 3:08 PM · User-notice, Wiki-Setup (Close), API-Portal, Tech-Docs-Team
Clement_Goubert removed a parent task for T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs: T415293: Shut down the API Portal.
Thu, Feb 26, 3:07 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, API-Portal, Tech-Docs-Team
Clement_Goubert added a subtask for T418494: Delete the API Portal wiki: T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs.
Thu, Feb 26, 3:07 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, Wiki-Setup (Close), API-Portal, Tech-Docs-Team
Clement_Goubert added a parent task for T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs: T418494: Delete the API Portal wiki.
Thu, Feb 26, 3:06 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, API-Portal, Tech-Docs-Team
Clement_Goubert changed the status of T418494: Delete the API Portal wiki, a subtask of T415293: Shut down the API Portal, from Open to Stalled.
Thu, Feb 26, 3:06 PM · User-notice, Wiki-Setup (Close), API-Portal, Tech-Docs-Team
Clement_Goubert changed the status of T418494: Delete the API Portal wiki from Open to Stalled.

Unstall once T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs is ready to do.

Thu, Feb 26, 3:05 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, Wiki-Setup (Close), API-Portal, Tech-Docs-Team
Clement_Goubert created T418494: Delete the API Portal wiki.
Thu, Feb 26, 3:04 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, Wiki-Setup (Close), API-Portal, Tech-Docs-Team
Clement_Goubert moved T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs from Inbox to Backlog on the ServiceOps new board.
Thu, Feb 26, 3:01 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, API-Portal, Tech-Docs-Team
Clement_Goubert added projects to T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs: ServiceOps new, ServiceOps-SharedInfra.
Thu, Feb 26, 3:01 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, API-Portal, Tech-Docs-Team
Clement_Goubert changed the status of T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs, a subtask of T415293: Shut down the API Portal, from Open to Stalled.
Thu, Feb 26, 3:00 PM · User-notice, Wiki-Setup (Close), API-Portal, Tech-Docs-Team
Clement_Goubert changed the status of T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs from Open to Stalled.

Setting stalled for @apaskulin to unstall when ready for us to implement.

Thu, Feb 26, 3:00 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, API-Portal, Tech-Docs-Team
Clement_Goubert created T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs.
Thu, Feb 26, 2:58 PM · ServiceOps new (Next quarter), ServiceOps-SharedInfra, API-Portal, Tech-Docs-Team
Clement_Goubert added a comment to T418188: Simplify static Restbase json spec file configuration.

Change #1242576 merged by jenkins-bot:

[operations/deployment-charts@master] Simplify spec-json-wikimedia route and use meta.wikimedia.org

https://gerrit.wikimedia.org/r/1242576

Thu, Feb 26, 11:24 AM · MW-Interfaces-Team (MWI-Sprint-28 (2026-02-24 to 2026-03-10))
Clement_Goubert added a comment to T414112: Deploy instance of hoarde as linked-artifacts(?) in k8s.

I was misled by the namespace changes caused by the puppet patch, the namespace definition itself wasn't merged. I just merged and deployed it this morning.

Thu, Feb 26, 10:50 AM · ServiceOps-Services-Oids, ServiceOps new, User-Eevans, Patch-For-Review, Data-Persistence

Wed, Feb 25

Clement_Goubert added a comment to T418212: Automate the creation of implementation task from rack/setup/install tasks for Serviceops.

Thanks Blake. All those hosts were cross checked by Clement (thanks!) and now have corresponding tasks in our spreadsheet tracker.

DC-Ops could you double check if the following conditions would work for a Herald Rule:

When a task:

  • has rack/setup/install {hosts} in its task name
  • has the tags #DC-Ops and #serviceops
  • has its status becoming Resolved

Then:

  • create a task Implementation {hosts} with tags #serviceops and #serviceops-upgrades-hardware under the same parent

Once confirmed I'll request one of our Phab admin to create it

s/#serviceops/#serviceops-new/g

Please note I didn't know about #serviceops-new but I can adjust my workflow accordingly and when I create a racking task use serviceops-new rather than serviceops.

Wed, Feb 25, 5:37 PM · ServiceOps-Upgrades-Hardware, serviceops-tooling, ServiceOps new, DC-Ops
Clement_Goubert added a comment to T414112: Deploy instance of hoarde as linked-artifacts(?) in k8s.

Change #1243850 had a related patch set uploaded (by Eevans; author: Eevans):

[operations/deployment-charts@master] admin_ng: add namespace for linked-artifacts

https://gerrit.wikimedia.org/r/1243850

Wed, Feb 25, 5:35 PM · ServiceOps-Services-Oids, ServiceOps new, User-Eevans, Patch-For-Review, Data-Persistence
Clement_Goubert triaged T418383: Investigate mw-on-k8s statsd-exporter RAM usage pattern as Low priority.
Wed, Feb 25, 4:02 PM · MW-on-K8s, ServiceOps-Mediawiki, ServiceOps new
Clement_Goubert moved T418383: Investigate mw-on-k8s statsd-exporter RAM usage pattern from Inbox to In Progress on the ServiceOps new board.
Wed, Feb 25, 4:00 PM · MW-on-K8s, ServiceOps-Mediawiki, ServiceOps new
Clement_Goubert added a comment to T418383: Investigate mw-on-k8s statsd-exporter RAM usage pattern.

Since this behaviour was apparent on multiple mw-on-k8s deployments, and disappeared with more replicas, it's very likely that it's load related and not an actual memory leak. This should be confirmed if we get either longer time to oomkill or the behaviour disappears with more replicas. If the behaviour disappears, no more changes should be needed. If the time to oomkill is longer, we should raise the memory limits until the asymptote reached by the RAM usage curves doesn't hit the limit.

Wed, Feb 25, 3:05 PM · MW-on-K8s, ServiceOps-Mediawiki, ServiceOps new
Clement_Goubert created T418383: Investigate mw-on-k8s statsd-exporter RAM usage pattern.
Wed, Feb 25, 2:58 PM · MW-on-K8s, ServiceOps-Mediawiki, ServiceOps new
Clement_Goubert updated subscribers of T411771: Migrate PageViewInfo calls away from rest-gateway.

When rate limits on "internal" traffic (WME/WMCS/etc) start being enforced on the rest-gateway, PageViewInfo will get rate limited as well unless we add an exception just for this extension.

Wed, Feb 25, 12:53 PM · Data-Engineering (Q3 FY25/26 January 1st - March 31th), ServiceOps-SharedInfra, ServiceOps new, PageViewInfo
Clement_Goubert added a comment to T418262: deploy2003 implementation tracking.

I agree that the second option makes the most sense with the overall strategy, as well as waiting until after the switchover (while the implementation is very late, it's only one host that is not being decommissioned).

Wed, Feb 25, 12:08 PM · ServiceOps new (Next quarter), ServiceOps-Upgrades-Hardware
Clement_Goubert added a comment to T418212: Automate the creation of implementation task from rack/setup/install tasks for Serviceops.

Thanks Blake. All those hosts were cross checked by Clement (thanks!) and now have corresponding tasks in our spreadsheet tracker.

DC-Ops could you double check if the following conditions would work for a Herald Rule:

When a task:

  • has rack/setup/install {hosts} in its task name
  • has the tags #DC-Ops and #serviceops
  • has its status becoming Resolved

Then:

  • create a task Implementation {hosts} with tags #serviceops and #serviceops-upgrades-hardware under the same parent

Once confirmed I'll request one of our Phab admin to create it

Wed, Feb 25, 11:46 AM · ServiceOps-Upgrades-Hardware, serviceops-tooling, ServiceOps new, DC-Ops
Clement_Goubert added a comment to T364245: Recentchanges and cu_changes tables are occasionally missing revisions on multiple wikis.

It would be nice if we had a diagram or explanation of the whole request flow, including every TCP connection. I see Effie's diagrams at c:Category:Wikimedia servers diagrams with their "it's complicated" boxes, and Timo's backend-focused diagrams, but none of them are really what I need. I've looked into the Envoy service inside the k8s pod in the past, so I should probably have been able to answer my own question, but I do get rusty when I don't have to touch this stuff for a while.

I've made a quick one, omitting LVS, the rest-gateway for API calls, and non-request-flow sidecars. Hope it helps.

mw-on-k8s connection diagram.drawio.png (851×491 px, 65 KB)

This diagram is great. Would encourage you to upload it and insert it into a Wikitech page somewhere :)

Wed, Feb 25, 11:07 AM · ServiceOps new, MW-on-K8s, MediaWiki-Recent-changes

Tue, Feb 24

Clement_Goubert moved T418257: wikikube-worker13[28-34] implementation tracking from Inbox to Scheduled (this Q) on the ServiceOps new board.
Tue, Feb 24, 4:08 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert moved T418258: wikikube-worker13[60-72] implementation tracking from Inbox to Scheduled (this Q) on the ServiceOps new board.
Tue, Feb 24, 4:08 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert moved T418259: wikikube-worker13[35-59] implementation tracking from Inbox to Scheduled (this Q) on the ServiceOps new board.
Tue, Feb 24, 4:08 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert moved T418263: mc10[55-72] implementation tracking from Inbox to Needs Info / Blocked on the ServiceOps new board.
Tue, Feb 24, 4:07 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert moved T418261: rdb20[11-12] implementation tracking from Inbox to Scheduled (this Q) on the ServiceOps new board.
Tue, Feb 24, 4:07 PM · ServiceOps new (Next quarter), ServiceOps-Upgrades-Hardware
Clement_Goubert moved T418262: deploy2003 implementation tracking from Inbox to Scheduled (this Q) on the ServiceOps new board.
Tue, Feb 24, 4:07 PM · ServiceOps new (Next quarter), ServiceOps-Upgrades-Hardware
Clement_Goubert triaged T418257: wikikube-worker13[28-34] implementation tracking as Medium priority.
Tue, Feb 24, 4:07 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert triaged T418258: wikikube-worker13[60-72] implementation tracking as Medium priority.
Tue, Feb 24, 4:07 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert triaged T418259: wikikube-worker13[35-59] implementation tracking as Medium priority.
Tue, Feb 24, 4:07 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert triaged T418261: rdb20[11-12] implementation tracking as High priority.
Tue, Feb 24, 4:06 PM · ServiceOps new (Next quarter), ServiceOps-Upgrades-Hardware
Clement_Goubert changed the status of T418263: mc10[55-72] implementation tracking, a subtask of T412255: Q2:rack/setup/install mc1055-72, from Open to Stalled.
Tue, Feb 24, 4:06 PM · ServiceOps-Upgrades-Hardware, ServiceOps new, User-jijiki, DC-Ops, ops-eqiad
Clement_Goubert changed the status of T418263: mc10[55-72] implementation tracking from Open to Stalled.
Tue, Feb 24, 4:06 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert renamed T418262: deploy2003 implementation tracking from deploy2003 implentation tracking to deploy2003 implementation tracking.
Tue, Feb 24, 4:05 PM · ServiceOps new (Next quarter), ServiceOps-Upgrades-Hardware
Clement_Goubert created T418263: mc10[55-72] implementation tracking.
Tue, Feb 24, 3:54 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert created T418262: deploy2003 implementation tracking.
Tue, Feb 24, 3:52 PM · ServiceOps new (Next quarter), ServiceOps-Upgrades-Hardware
Clement_Goubert renamed T418261: rdb20[11-12] implementation tracking from rdb200[7-8] implementation tracking to rdb20[11-12] implementation tracking.
Tue, Feb 24, 3:45 PM · ServiceOps new (Next quarter), ServiceOps-Upgrades-Hardware
Clement_Goubert created T418261: rdb20[11-12] implementation tracking.
Tue, Feb 24, 3:45 PM · ServiceOps new (Next quarter), ServiceOps-Upgrades-Hardware
Clement_Goubert updated the task description for T418258: wikikube-worker13[60-72] implementation tracking.
Tue, Feb 24, 3:40 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert updated the task description for T418257: wikikube-worker13[28-34] implementation tracking.
Tue, Feb 24, 3:39 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert created T418259: wikikube-worker13[35-59] implementation tracking.
Tue, Feb 24, 3:36 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert renamed T418258: wikikube-worker13[60-72] implementation tracking from wikikube-worker13[35-59] implementation tracking to wikikube-worker13[60-72] implementation tracking.
Tue, Feb 24, 3:36 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert created T418258: wikikube-worker13[60-72] implementation tracking.
Tue, Feb 24, 3:35 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert created T418257: wikikube-worker13[28-34] implementation tracking.
Tue, Feb 24, 3:34 PM · ServiceOps-Upgrades-Hardware, ServiceOps new
Clement_Goubert closed T400871: Reimage sretest2009 as a wikikube worker and assess performance, a subtask of T396365: Q4:rack/setup/install sretest2009, as Declined.
Tue, Feb 24, 2:58 PM · SRE, DC-Ops, ops-codfw