Page MenuHomePhabricator

openstack: prevent puppet from restarting neutron-openvswitch-agent
Closed, ResolvedPublic

Description

We have experienced an outage when puppet restart neutron-openvswitch-agent in the fleet.

Event Timeline

aborrero changed the task status from Open to In Progress.
aborrero triaged this task as High priority.
aborrero moved this task from Backlog to Doing on the User-aborrero board.

Change #1098498 had a related patch set uploaded (by Arturo Borrero Gonzalez; author: Arturo Borrero Gonzalez):

[operations/puppet@production] openstack: neutron-openvswitch-agent: prevent puppet from restarting the service

https://gerrit.wikimedia.org/r/1098498

Change #1098498 merged by Arturo Borrero Gonzalez:

[operations/puppet@production] openstack: neutron-openvswitch-agent: prevent puppet from restarting the service

https://gerrit.wikimedia.org/r/1098498

aborrero claimed this task.

Is there any theory about why restarting openvswitch-agent is more delicate than restarting the old linuxbridge agent?

I'm in favor of avoiding outages, but because the agent runs in many places (cloudvirts), decoupling it from puppet can result in agent state being out of sync with config which also seems bad.

Is there any theory about why restarting openvswitch-agent is more delicate than restarting the old linuxbridge agent?

I'm in favor of avoiding outages, but because the agent runs in many places (cloudvirts), decoupling it from puppet can result in agent state being out of sync with config which also seems bad.

my current theory is that the linuxbridge agent was stateless, whereas openvswitch is stateful.