User Details
- User Since
- Jul 26 2022, 2:11 PM (189 w, 4 d)
- Availability
- Available
- IRC Nick
- claime
- LDAP User
- Clément Goubert
- MediaWiki User
- CGoubert-WMF [ Global Accounts ]
Fri, Mar 13
We have removed the rate limits on OPTIONS requests and I don't see anymore 429 responses to them in logs. Resolving, please feel free to reopen if the issue presents itself again.
Thu, Mar 12
Wed, Mar 11
I'll leave the MediaWiki internals for you to decide, but as far as what we want to achieve, yes, that sounds right. Given pageviews and unique-devices are now two services, the PageViewInfo extension needs to be able to address both individually through two different service mesh listeners.
Ack, thanks for following up.
Mon, Mar 9
Thu, Mar 5
Host back in the pool, thanks <3
Depending on exactly what you want this rewrite to do, it may be that an apache rule isn't the right choice here.
It's cordoned and depooled, fire away.
Wed, Mar 4
Updated task description with racking details and OS. Waiting for review on the puppet patch.
Updated racking details, changed OS to Trixie, puppet patches merged. All yours.
All yours.
Tue, Mar 3
Mon, Mar 2
This should now be unblocked.
Thu, Feb 26
Assigning to Scott for reshaping.
Moving to serviceops Radar as we have provided the required information.
Unstall once T418492: Redirect API Portal wiki URLs to www.mediawiki.org/wiki/Wikimedia_APIs is ready to do.
Setting stalled for @apaskulin to unstall when ready for us to implement.
I was misled by the namespace changes caused by the puppet patch, the namespace definition itself wasn't merged. I just merged and deployed it this morning.
Wed, Feb 25
Since this behaviour was apparent on multiple mw-on-k8s deployments, and disappeared with more replicas, it's very likely that it's load related and not an actual memory leak. This should be confirmed if we get either longer time to oomkill or the behaviour disappears with more replicas. If the behaviour disappears, no more changes should be needed. If the time to oomkill is longer, we should raise the memory limits until the asymptote reached by the RAM usage curves doesn't hit the limit.
When rate limits on "internal" traffic (WME/WMCS/etc) start being enforced on the rest-gateway, PageViewInfo will get rate limited as well unless we add an exception just for this extension.
I agree that the second option makes the most sense with the overall strategy, as well as waiting until after the switchover (while the implementation is very late, it's only one host that is not being decommissioned).
