IPVS issues with UDP services, pybal depooling strategy
Open, MediumPublic
Actions

Assigned To

None

Authored By

	• ema
	Jul 31 2017, 11:56 AM

Description

Last week's reboot of hydrogen, one of the two recdns in eqiad, caused a bunch of issues.

Currently, pybal depools servers by removing them from the virtual service (ipvsadm -d). IPVS has known packet loss issues when removing servers from UDP virtual services.

We should update pybal to do the following in case of planned maintenance:

set weight to zero
schedule server removal after a certain amount of time (if still under maintenance)

Similarly, in case of service failure:

set weight to zero
if failure persists, remove server

We should also consider enabling expire_nodest_conn. From ipvs-sysctl.txt:

expire_nodest_conn - BOOLEAN
        0 - disabled (default)
        not 0 - enabled

        The default value is 0, the load balancer will silently drop
        packets when its destination server is not available. It may
        be useful, when user-space monitoring program deletes the
        destination server (because of server overload or wrong
        detection) and add back the server later, and the connections
        to the server can continue.

        If this feature is enabled, the load balancer will expire the
        connection immediately when a packet arrives and its
        destination server is not available, then the client program
        will be notified that the connection is closed. This is
        equivalent to the feature some people requires to flush
        connections when its destination is not available.

Related Objects
Search...

Status	Assigned	Task
Open	None	T172103 IPVS issues with UDP services, pybal depooling strategy
Declined	None	T172124 PyBal Feature: progressive depooling strategy for monitored failures
Declined	None	T86650 Add support for setting weight=0 when depooling
Invalid	None	T171850 Backport ipvsadm

Event Timeline

• ema created this task.Jul 31 2017, 11:56 AM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptJul 31 2017, 11:56 AM

• ema triaged this task as Medium priority.Jul 31 2017, 11:56 AM

• ema added a project: PyBal.

+1. There are a number of tricky things here to get to these simple goals, though, and since the sysctls affect all services, we have to have the TCP cases in mind as well:

We need to set the related ipvs conn_reuse sysctl to 2 before any of this. It's easy, it's an improvement today, and probably more of an improvement with everything else below.
Pybal + ipvsadm need fixups and/or deployed version updates as appropriate:
1. Current jessie ipvsadm doesn't support weight=0
2. Current jessie ipvsadm doesn't support setting the sh scheduler flag we need to make weight=0 work sanely
3. Current PyBal doesn't support weight=0
Pybal needs to update its failure-monitoring depooling strategy before we turn on expire_nodest_conn - some of the monitors are too flappy, and flapping to a full backend-delete with expire_nodest_conn=1 has a lot more impact than without it. So before we turn on the sysctl, PyBal first has to get smarter about "weight=0 first, then remove later when failure persists".
Our maintenance tooling needs to get smarter about weight=0 periods as well, but turning on expire_nodest_conn before these are all fixed is ok. Since maintenance doesn't really flap pointlessly, and almost always the service ends up shutting off at least briefly and losing all TCP connections anyways, either setting of the sysctl without a weight=0 period has about the same effect.

BBlack added a subtask: T171850: Backport ipvsadm.Jul 31 2017, 1:51 PM

BBlack added a subtask: T86650: Add support for setting weight=0 when depooling.Jul 31 2017, 3:17 PM

BBlack created subtask T172124: PyBal Feature: progressive depooling strategy for monitored failures.Jul 31 2017, 3:27 PM

• ema moved this task from Backlog to LoadBalancer on the Traffic board.Aug 1 2017, 12:44 PM

• Phabricator_maintenance moved this task from Backlog to Acknowledged on the SRE board.Jan 26 2019, 9:24 PM

The swap of Traffic for Traffic-Icebox in this ticket's set of tags was based on a bulk action for all such tickets that haven't been updated in 6 months or more. This does not imply any human judgement about the validity or importance of the task, and is simply the first step in a larger task cleanup effort. Further manual triage and/or requests for updates will happen this month for all such tickets. For more detail, have a look at the extended explanation on the main page of Traffic-Icebox . Thank you!

BCornwall closed subtask T171850: Backport ipvsadm as Invalid.Sep 19 2022, 7:34 PM

BCornwall closed subtask T172124: PyBal Feature: progressive depooling strategy for monitored failures as Declined.May 2 2023, 8:03 PM

BCornwall closed subtask T86650: Add support for setting weight=0 when depooling as Declined.May 2 2023, 8:14 PM

IPVS issues with UDP services, pybal depooling strategyOpen, MediumPublicActions

Description

Related ObjectsSearch...

Event Timeline

IPVS issues with UDP services, pybal depooling strategy
Open, MediumPublic
Actions

Related Objects
Search...