Page MenuHomePhabricator

jijiki (effie mouzeli)
is an animal

Projects (16)

Today

  • No visible events.

Tomorrow

  • No visible events.

Monday

  • No visible events.

User Details

User Since
Aug 14 2018, 10:50 AM (381 w, 3 d)
Availability
Available
IRC Nick
effie
LDAP User
Effie Mouzeli
MediaWiki User
EMouzeli (WMF) [ Global Accounts ]

Recent Activity

Thu, Dec 4

jijiki updated the task description for T410626: WE6.2.6: ☂️ hcaptcha-proxy Production Readiness Review.
Thu, Dec 4, 4:17 PM · serviceops

Mon, Dec 1

jijiki updated the task description for T411202: Proof of Concept: SquareOne Dashboards.
Mon, Dec 1, 10:18 AM · serviceops

Fri, Nov 28

jijiki added a comment to T410696: Deploy enwiki edition of 2025 GRS.

Hello! Since 1212204 was backported, it has been producing thousands of error messages https://logstash.wikimedia.org/goto/103ab3d23a65b901740f79fc62e71e9a

Fri, Nov 28, 12:59 PM · Research (FY2025-26-Research-October-December)

Thu, Nov 27

jijiki changed the status of T411204: Draft Guided Dashboards Design Proposal from Open to In Progress.
Thu, Nov 27, 4:37 PM · SRE Observability, serviceops
jijiki updated the task description for T411202: Proof of Concept: SquareOne Dashboards.
Thu, Nov 27, 4:37 PM · serviceops
jijiki added a subtask for T411204: Draft Guided Dashboards Design Proposal: T411202: Proof of Concept: SquareOne Dashboards.
Thu, Nov 27, 4:36 PM · SRE Observability, serviceops
jijiki added a parent task for T411202: Proof of Concept: SquareOne Dashboards: T411204: Draft Guided Dashboards Design Proposal.
Thu, Nov 27, 4:36 PM · serviceops
jijiki created T411204: Draft Guided Dashboards Design Proposal.
Thu, Nov 27, 4:36 PM · SRE Observability, serviceops
jijiki changed the status of T411202: Proof of Concept: SquareOne Dashboards from Open to In Progress.
Thu, Nov 27, 4:23 PM · serviceops
jijiki created T411202: Proof of Concept: SquareOne Dashboards.
Thu, Nov 27, 4:22 PM · serviceops
jijiki renamed T410626: WE6.2.6: ☂️ hcaptcha-proxy Production Readiness Review from WE6.2.6: ☂️ Proxoid Production Readiness Review to WE6.2.6: ☂️ hcaptcha-proxy Production Readiness Review.
Thu, Nov 27, 8:42 AM · serviceops

Wed, Nov 26

jijiki closed T317340: Incident: 2022-09-08 codfw appservers degradation as Resolved.

Bluntly closing

Wed, Nov 26, 11:11 AM · serviceops

Tue, Nov 25

jijiki closed T408138: cxserver: Remove Yandex MT key from production, a subtask of T407345: cxserver: Yandex MT service failure, as Resolved.
Tue, Nov 25, 3:46 PM · Unplanned-Sprint-Work, LPL Projects (Other), LPL Essential (2025 Jul-Oct), CXServer
jijiki closed T408138: cxserver: Remove Yandex MT key from production as Resolved.

Key has been removed from puppet, please reopen if something is not right

Tue, Nov 25, 3:46 PM · serviceops, Unplanned-Sprint-Work, LPL Projects (Other), CXServer
jijiki added a comment to T408138: cxserver: Remove Yandex MT key from production.

@KartikMistry I will have a look, sorry for that

Tue, Nov 25, 10:53 AM · serviceops, Unplanned-Sprint-Work, LPL Projects (Other), CXServer

Fri, Nov 21

jijiki created T410722: Grey out submit buttons after submitting.
Fri, Nov 21, 12:13 PM · Hiddenparma

Thu, Nov 20

jijiki moved T410626: WE6.2.6: ☂️ hcaptcha-proxy Production Readiness Review from Incoming 🐫 to this.quarter 🍕 on the serviceops board.
Thu, Nov 20, 1:52 PM · serviceops
jijiki renamed T410626: WE6.2.6: ☂️ hcaptcha-proxy Production Readiness Review from ☂️ Proxoid Production Readiness Review to WE6.2.6: ☂️ Proxoid Production Readiness Review.
Thu, Nov 20, 1:25 PM · serviceops
jijiki created T410626: WE6.2.6: ☂️ hcaptcha-proxy Production Readiness Review.
Thu, Nov 20, 1:22 PM · serviceops
jijiki edited P85414 Production Readiness Feedback.
Thu, Nov 20, 1:21 PM · serviceops
jijiki created P85414 Production Readiness Feedback.
Thu, Nov 20, 1:21 PM · serviceops

Wed, Nov 19

jijiki closed T410506: New SSH keys for effie as Resolved.
Wed, Nov 19, 3:40 PM · SRE, SRE-Access-Requests
jijiki created T410506: New SSH keys for effie.
Wed, Nov 19, 1:18 PM · SRE, SRE-Access-Requests

Mon, Nov 17

jijiki added a comment to T409469: Enable ChangeProp to consume mediawiki.page_content_change.v1.

Thank you for the discussion everyone! Reading through, I would suggest proceeding with Option D for the time being. This approach not only unblocks the work without requiring any significant changes to Kafka, but also allows us to observe the workflow in practice and better understand its requirements.
That said, we can later define a set of performance expectations (eg for latency), which will then help us to assess whether any of the other options would provide sufficient benefit to justify any additional efforts.

Mon, Nov 17, 11:14 AM · Data-Engineering, serviceops, Machine-Learning-Team

Thu, Nov 13

jijiki renamed T410038: Alert Management Review and Improvement for ServiceOps from Update MediaWiki and ServiceOps alerts to Alert Management Review and Improvement for ServiceOps.
Thu, Nov 13, 2:07 PM · serviceops
jijiki created T410038: Alert Management Review and Improvement for ServiceOps.
Thu, Nov 13, 2:07 PM · serviceops

Tue, Nov 11

jijiki added a comment to T386371: Request capacity increase in preparation for MinT for wiki Readers experiment.

Looking at the memory usage

Tue, Nov 11, 1:00 PM · LPL Projects (Other), LPL Essential (FY2025-26 Q2), MinT
jijiki closed T409181: Memcached Cluster Capacity Planning for eqiad as Resolved.

Yesterday we decided to go for option B, and assess if an expansion will be needed sometime in the next quarters

Tue, Nov 11, 8:34 AM · serviceops

Nov 4 2025

jijiki created T409181: Memcached Cluster Capacity Planning for eqiad.
Nov 4 2025, 1:57 PM · serviceops
jijiki moved T408749: Q2:rack/setup/install wikikube-worker11XX from Incoming 🐫 to 🛠 Upgrades and Hardware on the serviceops board.
Nov 4 2025, 12:49 PM · SRE, ops-eqiad, serviceops, DC-Ops
jijiki moved T408752: Q2:rack/setup/install wikikube-worker1335-59 from Incoming 🐫 to 🛠 Upgrades and Hardware on the serviceops board.
Nov 4 2025, 12:49 PM · SRE, ops-eqiad, serviceops, DC-Ops
jijiki moved T408757: Q2:rack/setup/install wikikube-worker2332-56 from Incoming 🐫 to 🛠 Upgrades and Hardware on the serviceops board.
Nov 4 2025, 12:49 PM · SRE, ops-codfw, serviceops, DC-Ops
jijiki moved T408760: Q2:rack/setup/install wikikube-worker refresh from Incoming 🐫 to 🛠 Upgrades and Hardware on the serviceops board.
Nov 4 2025, 12:47 PM · ops-eqiad, serviceops, DC-Ops, SRE
jijiki claimed T397498: Deprecate mwdebugXXXX hosts.
Nov 4 2025, 12:44 PM · Release-Engineering-Team (Radar), Patch-For-Review, MediaWiki-Engineering, serviceops
jijiki closed T397498: Deprecate mwdebugXXXX hosts as Resolved.
Nov 4 2025, 12:44 PM · Release-Engineering-Team (Radar), Patch-For-Review, MediaWiki-Engineering, serviceops
jijiki triaged T408138: cxserver: Remove Yandex MT key from production as Low priority.
Nov 4 2025, 12:02 PM · serviceops, Unplanned-Sprint-Work, LPL Projects (Other), CXServer

Oct 31 2025

jijiki updated the task description for T327663: Create a visual representation of where each service is active from, any given time.
Oct 31 2025, 4:24 PM · serviceops, observability
jijiki updated the task description for T408925: Create a cookbook for memcached management.
Oct 31 2025, 3:31 PM · Patch-For-Review, serviceops
jijiki created T408925: Create a cookbook for memcached management.
Oct 31 2025, 3:29 PM · Patch-For-Review, serviceops
jijiki created T408916: Enable extstore to gutter pool cluster.
Oct 31 2025, 12:41 PM · serviceops

Oct 24 2025

jijiki updated subscribers of T407723: Sendmail network error (deployment).
Oct 24 2025, 3:58 PM · serviceops, Infrastructure-Foundations, Mail, SRE
jijiki claimed T408138: cxserver: Remove Yandex MT key from production.
Oct 24 2025, 3:46 PM · serviceops, Unplanned-Sprint-Work, LPL Projects (Other), CXServer

Oct 23 2025

jijiki added a comment to T405688: Support shell to mw-experimental pod.

@Krinkle Please have a go running P84273 on the deployment host, and let me know if it helps

Oct 23 2025, 10:35 AM · MW-on-K8s, serviceops
jijiki created P84273 mw-experimental-shell.
Oct 23 2025, 10:33 AM

Oct 21 2025

jijiki closed T407615: Alert in need of triage: ProbeDown (instance proxoid:4260) as Resolved.
Oct 21 2025, 12:07 PM · serviceops, sre-alert-triage
jijiki added a comment to T407094: Requesting access to analytics-privatedata-users for SKaram-WMF.

confirmed oob

Oct 21 2025, 8:39 AM · SRE, SRE-Access-Requests
jijiki added a comment to T406927: Requesting access to fr-tech-devs for lsandergreen.

confirmed oob

Oct 21 2025, 8:39 AM · SRE, SRE-Access-Requests

Oct 20 2025

jijiki closed T407732: ΤΒΑ as Invalid.
Oct 20 2025, 10:36 AM
jijiki created T407732: ΤΒΑ.
Oct 20 2025, 9:19 AM

Oct 16 2025

jijiki closed T405917: Requesting access to Superset for marialechnerwmde as Resolved.

This is sorted

Oct 16 2025, 11:15 AM · LDAP-Access-Requests, SRE, SRE-Access-Requests
jijiki added a comment to T406927: Requesting access to fr-tech-devs for lsandergreen.

pinged user for out of band key confirmation

Oct 16 2025, 9:52 AM · SRE, SRE-Access-Requests
jijiki added a comment to T407094: Requesting access to analytics-privatedata-users for SKaram-WMF.

Pinged user for out of band key verification

Oct 16 2025, 9:48 AM · SRE, SRE-Access-Requests
jijiki assigned T406243: Requesting access to deployment for VolkerE to Volker_E.
Oct 16 2025, 9:44 AM · SRE, SRE-Access-Requests
jijiki closed T406106: Grant Access to wmde and nda for Maria Lechner WMDE as Resolved.

Added to nda and wmde ldap groups.

Oct 16 2025, 9:34 AM · Patch-For-Review, SRE, LDAP-Access-Requests
jijiki changed the status of T406590: Requesting access to 'restricted' for neslihanturan from Open to In Progress.
Oct 16 2025, 9:05 AM · SRE-Access-Requests, SRE
jijiki changed the status of T406592: Requesting access to 'deployment' for seanleong-wmde from Open to In Progress.
Oct 16 2025, 9:05 AM · SRE, SRE-Access-Requests

Sep 29 2025

jijiki closed Restricted Task, a subtask of T402366: hCaptcha account creation trial deployment tracker, as Resolved.
Sep 29 2025, 7:44 AM · Product Safety and Integrity (Sprint Apfel Strudel (Sep 29 - Oct 17)), WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)
jijiki added a subtask for T402366: hCaptcha account creation trial deployment tracker: Unknown Object (Task).
Sep 29 2025, 7:41 AM · Product Safety and Integrity (Sprint Apfel Strudel (Sep 29 - Oct 17)), WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)
jijiki added a subtask for T402366: hCaptcha account creation trial deployment tracker: T404184: Investigate options for per-wiki, percentage-based rollout of hCaptcha.
Sep 29 2025, 7:41 AM · Product Safety and Integrity (Sprint Apfel Strudel (Sep 29 - Oct 17)), WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)
jijiki edited parent tasks for T404184: Investigate options for per-wiki, percentage-based rollout of hCaptcha, added: T402366: hCaptcha account creation trial deployment tracker; removed: Unknown Object (Task).
Sep 29 2025, 7:41 AM · serviceops, Traffic, WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)

Sep 25 2025

jijiki added a comment to T404204: Investigate options for automatic fallback to FancyCAPTCHA.

For this particular approach, I'd like to avoid mixing status updates from the maintenance script (scheduled, predictable) and end user requests (unscheduled / unpredictable). I would be happy to explore adding in production requests to the overall health check system, but think it would be easier to start with something simple, using just the maintenance script for setting status.

But it's already a pattern we use elsewhere, for example https://github.com/wikimedia/mediawiki-extensions-TorBlock/blob/master/includes/TorExitNodes.php#L70-L91 where we will wait on the request, while we load from an external service.

Yeah, I would like to avoid introducing a maintenance script dependency if possible. I'll have a look at handling this in the request path, then.

Sep 25 2025, 4:52 PM · MW-1.45-notes (1.45.0-wmf.22; 2025-10-07), Product Safety and Integrity (Sprint Apfel Strudel (Sep 29 - Oct 17)), SRE, WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)
jijiki added a comment to T404204: Investigate options for automatic fallback to FancyCAPTCHA.

My concerns with the cronjob approach are the following:

  • Potentially the time it takes to spawn a pod to run the cronjob is longer than the time it takes to run a check
    • upstream latency appears to have a p50 of ~60ms and a p99 of ~500ms
  • 5m minutes is a very long time to either make an assumption that hcaptcha is up, but in a similar matter, that it is down
  • If the memcached node holding this key fails, the key will be lost
    • The cluster will failover pretty quickly to a spare memcached node, but this node will be cold
    • Until the next cronjob, we will be operating using the default (which one?)
Sep 25 2025, 9:11 AM · MW-1.45-notes (1.45.0-wmf.22; 2025-10-07), Product Safety and Integrity (Sprint Apfel Strudel (Sep 29 - Oct 17)), SRE, WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)

Sep 23 2025

jijiki added a comment to T402366: hCaptcha account creation trial deployment tracker.

Please keep in mind that tomorrow we will be performing the T399891 Southward Datacenter Switchover @ 15:00 UTC. As Phase 2 has been scheduled during the UTC morning backport window, it should not affect this work.

Sep 23 2025, 11:09 AM · Product Safety and Integrity (Sprint Apfel Strudel (Sep 29 - Oct 17)), WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)
jijiki added a subtask for T402366: hCaptcha account creation trial deployment tracker: Unknown Object (Task).
Sep 23 2025, 10:20 AM · Product Safety and Integrity (Sprint Apfel Strudel (Sep 29 - Oct 17)), WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)

Sep 16 2025

jijiki added a comment to T403829: hCaptcha: Self-host secure-api.js code.

For licensing reasons, as discussed in 1185314 (and to my knowledge as well), we are unable to host any proprietary code to our repos. I am afraid, same principle applies to our puppet repo as well.

Sep 16 2025, 7:53 AM · ConfirmEdit (CAPTCHA extension)

Sep 10 2025

jijiki added a parent task for T404184: Investigate options for per-wiki, percentage-based rollout of hCaptcha: Unknown Object (Task).
Sep 10 2025, 3:40 PM · serviceops, Traffic, WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)
jijiki added projects to T404184: Investigate options for per-wiki, percentage-based rollout of hCaptcha: Traffic, serviceops.
Sep 10 2025, 3:39 PM · serviceops, Traffic, WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)

Sep 5 2025

jijiki updated the task description for T397841: Implement an hCaptcha IP blinding proxy prototype.
Sep 5 2025, 3:15 PM · Trust and Safety Product Sprint, SecTeam-Processed, Security-Team, Security, WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)
jijiki added a parent task for T397841: Implement an hCaptcha IP blinding proxy prototype: Unknown Object (Task).
Sep 5 2025, 2:07 PM · Trust and Safety Product Sprint, SecTeam-Processed, Security-Team, Security, WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)

Sep 4 2025

jijiki added a comment to T401425: Investigate memcache errors during wikidata and commons dumps runs.

In wikikube we run a mw-mcrouter daemonset to interact with the memcached cluster. The dse cluster does not run mcrouter as a daemonset (and it wouldnt make sense to do so). Using the default MCROUTER_SERVER environment variable (which was incorrect) led to multiple memcached errors and may (unsure) have contributed to delays in producing dumps.

Sep 4 2025, 5:32 PM · Essential-Work, Data-Platform-SRE (2025.09.05 - 2025.09.26), MW-on-K8s, Dumps-Generation, serviceops

Sep 3 2025

jijiki created T403608: Monitor Template and Module changes on large wikis and ship events to Logstash.
Sep 3 2025, 10:48 AM · serviceops
jijiki renamed T403606: Increased response times linked to Module:Citation/CS1 changes from Increased latency on mw-web, mw-api-int, mw-ext-api to Increased response times linked to Module:Citation/CS1 changes.
Sep 3 2025, 10:34 AM · serviceops
jijiki created T403606: Increased response times linked to Module:Citation/CS1 changes.
Sep 3 2025, 10:32 AM · serviceops

Sep 2 2025

jijiki lowered the priority of T402181: Deploy Temporary accounts to all remaining small-sized projects from Unbreak Now! to High.

After discussing with @Dreamy_Jazz @kostajh and @Ladsgroup, we have doubts it the above observations are caused by the change, so I am lowering the priority while we investigate.

Sep 2 2025, 4:04 PM · Trust and Safety Product Sprint (Sprint Princess Tarta (August 18 - September 5)), OKR-Work, Trust and Safety Product Team, Temporary accounts
jijiki changed the status of T396217: Document that groups with IP reveal rights must not be changed without making changes to the cache for Special:GlobalContributions from Open to In Progress.
Sep 2 2025, 3:43 PM · MW-1.45-notes (1.45.0-wmf.16; 2025-08-26), Trust and Safety Product Sprint (Sprint Princess Tarta (August 18 - September 5)), OKR-Work, Temporary accounts (Global wiki rollout), CheckUser-GlobalContributions, Trust and Safety Product Team
jijiki reopened T402181: Deploy Temporary accounts to all remaining small-sized projects, a subtask of T340001: [Epic] Deployment plan for Temporary Accounts, as In Progress.
Sep 2 2025, 3:43 PM · Patch-For-Review, Product Safety and Integrity, Epic, Temporary accounts
jijiki reopened T402181: Deploy Temporary accounts to all remaining small-sized projects as "In Progress".

Hey folks, after the deployment at ~13:15 UTC, I observed the following

  • Increase in memcached traffic

~40% in write traffic

image.png (548×1 px, 97 KB)

Sep 2 2025, 3:43 PM · Trust and Safety Product Sprint (Sprint Princess Tarta (August 18 - September 5)), OKR-Work, Trust and Safety Product Team, Temporary accounts
jijiki added a comment to T396217: Document that groups with IP reveal rights must not be changed without making changes to the cache for Special:GlobalContributions.

Sorry for the mixup, I will reopen T402181 and remove my comment!

Sep 2 2025, 3:39 PM · MW-1.45-notes (1.45.0-wmf.16; 2025-08-26), Trust and Safety Product Sprint (Sprint Princess Tarta (August 18 - September 5)), OKR-Work, Temporary accounts (Global wiki rollout), CheckUser-GlobalContributions, Trust and Safety Product Team
jijiki updated subscribers of T396217: Document that groups with IP reveal rights must not be changed without making changes to the cache for Special:GlobalContributions.
Sep 2 2025, 3:31 PM · MW-1.45-notes (1.45.0-wmf.16; 2025-08-26), Trust and Safety Product Sprint (Sprint Princess Tarta (August 18 - September 5)), OKR-Work, Temporary accounts (Global wiki rollout), CheckUser-GlobalContributions, Trust and Safety Product Team
jijiki raised the priority of T396217: Document that groups with IP reveal rights must not be changed without making changes to the cache for Special:GlobalContributions from High to Unbreak Now!.

<comment removed as it belongs to T402181>

Sep 2 2025, 3:25 PM · MW-1.45-notes (1.45.0-wmf.16; 2025-08-26), Trust and Safety Product Sprint (Sprint Princess Tarta (August 18 - September 5)), OKR-Work, Temporary accounts (Global wiki rollout), CheckUser-GlobalContributions, Trust and Safety Product Team

Aug 6 2025

jijiki claimed T397841: Implement an hCaptcha IP blinding proxy prototype.
Aug 6 2025, 1:56 PM · Trust and Safety Product Sprint, SecTeam-Processed, Security-Team, Security, WE4.2 Bot detection (WE4.2 hCaptcha account creation trial)
jijiki created T401307: add metadata.label.team to app.job module.
Aug 6 2025, 11:32 AM · serviceops
jijiki changed the status of T374350: Thumbor workers hang indefinitely when conducting some tiff operations, leading to user-facing error from Open to Stalled.

The bandaid works for now. We the timeout command first sends a SIGTERM (which is being ignored/blocked as we saw earlier), and 5s later we sends a SIGKILL which, well, can't be ignored. This by far is not addressing the underline issue, however, it improves the availability of the service.

Aug 6 2025, 10:14 AM · Structured-Data-Backlog, serviceops, Thumbor

Aug 4 2025

jijiki lowered the priority of T401107: etcdserver: mvcc: database space exceeded from Unbreak Now! to High.
Aug 4 2025, 5:23 PM · Patch-For-Review, serviceops, Prod-Kubernetes, Wikimedia-production-error
jijiki updated subscribers of T401107: etcdserver: mvcc: database space exceeded.

Digging deeper with @Clement_Goubert and @Scott_French, we found that a vast number of mw-script jobs was created (not on purpose) around the 30th and 31st of July. This lead to the creation of thousand objects, jobs, and pods, that were not cleaned up yet, occupying quite a lot of etcd space, leading to this production error.

Aug 4 2025, 5:14 PM · Patch-For-Review, serviceops, Prod-Kubernetes, Wikimedia-production-error
jijiki closed T338220: Page only if videoscalers are unavailable for longer than the default time as Invalid.

moved to k8s

Aug 4 2025, 2:56 PM · serviceops, Sustainability (Incident Followup)
jijiki moved T401107: etcdserver: mvcc: database space exceeded from Incoming 🐫 to Production Errors 🚜 on the serviceops board.
Aug 4 2025, 2:55 PM · Patch-For-Review, serviceops, Prod-Kubernetes, Wikimedia-production-error
jijiki created T401111: Apertium processes become zombies.
Aug 4 2025, 12:10 PM · Apertium
jijiki updated the task description for T401107: etcdserver: mvcc: database space exceeded.
Aug 4 2025, 11:22 AM · Patch-For-Review, serviceops, Prod-Kubernetes, Wikimedia-production-error
jijiki created T401107: etcdserver: mvcc: database space exceeded.
Aug 4 2025, 10:39 AM · Patch-For-Review, serviceops, Prod-Kubernetes, Wikimedia-production-error

Aug 1 2025

jijiki triaged T400969: Alert in need of triage: KubernetesWorkerUnschedulable as Low priority.
Aug 1 2025, 11:01 AM · serviceops, sre-alert-triage
jijiki changed the status of T400969: Alert in need of triage: KubernetesWorkerUnschedulable from Open to Stalled.

sorry folks, host's number is up for retirement, my bad. tx @Clement_Goubert

Aug 1 2025, 11:00 AM · serviceops, sre-alert-triage

Jul 31 2025

jijiki moved T397683: Make mw-mcrouter Pods use a higher priorityClass from Incoming 🐫 to this.quarter 🍕 on the serviceops board.
Jul 31 2025, 9:17 AM · Patch-For-Review, MW-on-K8s, Kubernetes, Prod-Kubernetes, serviceops
jijiki moved T400263: ☂️ [FY2025-26][Hypothesis] WE6.2.1 Production Readiness Checklist from Doing 😎 to this.quarter 🍕 on the serviceops board.
Jul 31 2025, 9:16 AM · serviceops
jijiki moved T400481: Create a template on wikitech from Incoming 🐫 to Doing 😎 on the serviceops board.
Jul 31 2025, 9:16 AM · serviceops
jijiki moved T400263: ☂️ [FY2025-26][Hypothesis] WE6.2.1 Production Readiness Checklist from Incoming 🐫 to Doing 😎 on the serviceops board.
Jul 31 2025, 9:16 AM · serviceops
jijiki removed projects from T397439: X-Wikimedia-Debug cookie not routed correctly in Kubernetes on POST requests: MW-on-K8s, serviceops.
Jul 31 2025, 9:15 AM · Traffic, MediaWiki-Platform-Team, WikimediaDebug
jijiki added a comment to T397439: X-Wikimedia-Debug cookie not routed correctly in Kubernetes on POST requests.

Given that the value of the X-Wikimedia-Debug header determines whether, and to which mw-debug or mw-experimental service, a request will be routed, I would guess that this POST request was missing this header.

Jul 31 2025, 9:14 AM · Traffic, MediaWiki-Platform-Team, WikimediaDebug

Jul 30 2025

jijiki added a comment to T374350: Thumbor workers hang indefinitely when conducting some tiff operations, leading to user-facing error.

Take the following with a grain of salt, attaching gdb on the hang convert process: It appears that one thread is stuck(?) during unlink(), assuming that it may be in the process of cleaning up. Additionally, during that operation it caught a signal, which again we could assume is it is the SIGTERM from timeout.

Jul 30 2025, 11:06 AM · Structured-Data-Backlog, serviceops, Thumbor

Jul 25 2025

jijiki updated the task description for T400481: Create a template on wikitech .
Jul 25 2025, 2:34 PM · serviceops