Page MenuHomePhabricator

Shutdown varnishkafka webrequest instances
Closed, ResolvedPublic

Description

As the webrequest dataset source has been migrated to haproxykafka (on 1st of April), we're now ready to switch off varnishkafka instances that are still producing and sending data to the "old" kafka topics.

If everyone involved agrees, I would define the following roadmap:

  • On Monday (12 May 2025) I'll disable and stop the varnishkafka service on both text and upload cluster in MAGRU DC, to test eventual and unforeseen issues with monitoring, configuration or performances.
  • On the next week (19-23 May 2025) we can switch off varnishkafka instances on all cache hosts, on all datacenters. This is a good time to check that everything on the DE side works as expected and (eventually) do some fixes.
  • On June the 19th we will remove entirely varnishkafka, and all related configurations, from production hosts.

Let me know if this sounds like a feasible roadmap for all involved people

  • Shutdown varnishkafka (webrequest) in MAGRU DC
  • Shutdown varnishkafka (webrequest) instances on all cp hosts
  • Remove varnishkafka (webrequest) instances and related configuration from all cp hosts

Event Timeline

Please note that you should only shutdown the webrequest instance

@JAllemandou please comment on proposed timeline, looks to me its aligned with our plan / data retention.

Hey folks,
I'm sorry I was OoO last week and missed the timeline. I'm happy for VK to be shut-down when you wish. we don't gather data anymore on the cluster and everything has been moved, so you can clean up on your side. I've created T394011 about cleaning up on our side too.
Thank you!

Sorry the first date has been postponed to tomorrow (May 13th)

Change #1145282 had a related patch set uploaded (by Fabfur; author: Fabfur):

[operations/puppet@production] cache: add option to enable or disable varnishkafka instance

https://gerrit.wikimedia.org/r/1145282

Change #1145282 merged by Fabfur:

[operations/puppet@production] cache: add option to enable or disable varnishkafka instance

https://gerrit.wikimedia.org/r/1145282

Change #1145948 had a related patch set uploaded (by Fabfur; author: Fabfur):

[operations/puppet@production] hiera: disable varnishkafka in magru text|upload

https://gerrit.wikimedia.org/r/1145948

Change #1145948 merged by Fabfur:

[operations/puppet@production] hiera: disable varnishkafka (webrequest) in magru text|upload

https://gerrit.wikimedia.org/r/1145948

Mentioned in SAL (#wikimedia-operations) [2025-05-14T15:25:07Z] <fabfur> removing varnishkafka from magru (T393772)

Change #1146008 had a related patch set uploaded (by Fabfur; author: Fabfur):

[operations/puppet@production] varnishkafka: disable webrequest monitoring if ensure => absent

https://gerrit.wikimedia.org/r/1146008

Change #1146021 had a related patch set uploaded (by Fabfur; author: Fabfur):

[operations/puppet@production] hiera: enable vk monitoring in magru to actually remove it

https://gerrit.wikimedia.org/r/1146021

Change #1146008 merged by Fabfur:

[operations/puppet@production] varnishkafka: disable webrequest monitoring if ensure => absent

https://gerrit.wikimedia.org/r/1146008

Change #1146021 merged by Fabfur:

[operations/puppet@production] hiera: enable vk monitoring in magru to actually remove it

https://gerrit.wikimedia.org/r/1146021

Change #1147783 had a related patch set uploaded (by Fabfur; author: Fabfur):

[operations/puppet@production] hiera: disable vk on A:cp hosts

https://gerrit.wikimedia.org/r/1147783

Fabfur renamed this task from Shutdown varnishkafka instances to Shutdown varnishkafka webrequest instances.May 20 2025, 7:26 AM

Change #1147783 merged by Fabfur:

[operations/puppet@production] hiera: disable vk (webrequest) on A:cp hosts

https://gerrit.wikimedia.org/r/1147783

Fabfur changed the task status from Open to In Progress.May 20 2025, 7:39 AM
Fabfur updated the task description. (Show Details)

varnishkafka webrequest has been shut down on all cache hosts

@Fabfur: I think this all done and the alerts have been removed as well. Confirming: can we close this ticket as resolved? If yes, please do so. Thanks!

@Fabfur: I think this all done and the alerts have been removed as well. Confirming: can we close this ticket as resolved? If yes, please do so. Thanks!

The very last task (removing varnishkafka from puppet) is still pending, I'll work on that before closing this

Change #1183081 had a related patch set uploaded (by Fabfur; author: Fabfur):

[operations/puppet@production] profile:cache: remove varnishkafka (webrequest) from cp hosts

https://gerrit.wikimedia.org/r/1183081

Change #1183081 merged by Fabfur:

[operations/puppet@production] profile:cache: remove varnishkafka (webrequest) from cp hosts

https://gerrit.wikimedia.org/r/1183081

With https://gerrit.wikimedia.org/r/c/operations/puppet/+/1183081 I think we can consider this as closed. New cache hosts reimaged won't have varnishkafka references (except for statsv)