User Details
- User Since
- May 15 2023, 9:41 AM (133 w, 6 d)
- Availability
- Available
- IRC Nick
- fabfur
- LDAP User
- Fabfur
- MediaWiki User
- FFurnari-WMF [ Global Accounts ]
Wed, Dec 3
Ranked bots paste has been superseded by the shared doc: https://docs.google.com/spreadsheets/d/1PKfAhcc2jXl72CbF73JXTeZMTbw_RtQnYJ6YZ4Fozyk/edit?gid=0#gid=0
Tue, Dec 2
Nov 5 2025
I don't think regexes should be automatically escaped: the user must always be in charge of deciding that (and automatically parsing what's a regex and what not in a input box could be a pain). I agree that a big bold warning about escaped/unescaped regex could be added to some fields.
Nov 4 2025
haproxy configuration deployed everywhere
Oct 27 2025
Oct 24 2025
Password has been reset by @ssingh for me, thanks anyway
A not-so-refined search on Turnilo produced this paste: P84293
We can even refine it later
Oct 23 2025
@Joe so, just have a better vision over some points: some of these steps must be performed manually, while other should happen in an automated fashion, like:
Oct 1 2025
This has been superseded by more refined actions to exclude a broader class of "invalid" requests. As HTTP/1.0 requests are not per-se invalid we can consider this declined.
Sep 30 2025
Problem is that PyBal (twisted) defaults to a HTTP1.0 client so healthchecks in eqiad|codfw will fail after this. Or we patch PyBal to support HTTP1.1 requests or we have to wait for Liberica being deployed here too (or we make an exception in HAProxy configuration for healthcheck requests but it doesn't seem a good long-term solution to me),
Sep 29 2025
This has been migrated to Gitlab in the meantime
Yep, definitely, thanks for reminding
Sep 25 2025
This has been reverted due to issues with load balancers checks
Done rejecting all HTTP_1.0 requests
Sep 22 2025
Hi @BTullis sorry for the late answer, I think this fired correctly because being depooled the host produced no haproxykafka messages so, IMHO is the right thing to do. In this case we usually both depool and silence the affected host (if the depool lasts longer than some minutes). IIRC varnishkafka had the same behavior
Sep 16 2025
We deployed a change in HAProxy logging (see T403176) to avoid sending non-utf8 encoded headers to DLQ, this *could* also affect this issue as we're now logging these messages to Kafka (through the usual HaproxyKafka pipeline).
@Antoine_Quhen could you check all is good on your side?
Sep 11 2025
Sep 9 2025
I'm also taking care of this with some experiments to check when actually HAProxy (or HaproxyKafka) skips these messages
Sep 1 2025
With https://gerrit.wikimedia.org/r/c/operations/puppet/+/1183081 I think we can consider this as closed. New cache hosts reimaged won't have varnishkafka references (except for statsv)
Aug 29 2025
Aug 28 2025
Aug 22 2025
Aug 14 2025
Abandoned for T400244
Aug 13 2025
Aug 12 2025
Declined. Better upgrade purged to golang-1.23
Renamed to HttpHound
Aug 1 2025
Closing as per T400199 and opening a new ticket dedicated only to watchdog feature
This has been addressed with the following changes:
Jul 31 2025
Jul 30 2025
This has been already fixed upstream in docker-registry.wikimedia.org/bullseye:20250723, problem is that docker (or podman, FWIW) doesn't fetch the latest tag already, so that's need to be pulled explicitly.
Jul 29 2025
Jul 28 2025
Jul 25 2025
Note that this happened again ~2025-07-24 14:43 on cp3071, same host
Thanks a lot @Scott_French ! This weekend is fine, I'll retry on Monday!
The image used for debci building (bullseye) is still affected by the bullseye-backports issue:
Closing as we can use the HaproxyKafka component to keep track of related tasks
Jul 24 2025
Jul 23 2025
Adding DE team too, considering that two hosts didn't sent messages for a long time and this could be impacting on the analytics data.
