Page MenuHomePhabricator

[cloudvirt] Enable jumbo frames on cloud-hosts/cloud-private interfaces
Closed, ResolvedPublic

Description

We currently have our ceph osd hosts setup to use jumbo frames, but the cloudvirts are configured to use 1500 MTU.

This is not optimal as jumbo frames will allow for higher throughput to the VM storage layer (ceph).

All the switches (cloudsw*) are configured with jumbo frames too.

This task is to enable the jumbo frames on the cloudvirts making sure nothing breaks in-between.

Related: T329711: [ceph] Add monitoring for inter-osd/mon/cloudvirt connectivity, T329778: [ceph] Investigate if there's a way to degrade instead of failing when jumbo frames are being dropped in the network

Event Timeline

dcaro triaged this task as High priority.

Questions for NetOps: they live in the cloud-hosts vlan. It is OK if some hosts attached to that VLAN use high MTU and other don't?

Questions for NetOps: they live in the cloud-hosts vlan. It is OK if some hosts attached to that VLAN use high MTU and other don't?

TCP handshake contains MSS data of each side so they will both send packets at the max size they can send and max size the other side can receive.
And they can be asymmetric, host A can send packets at 9000 to host B, and host B send 1500 packets to host A.

UDP by default is also < 1500.

So for UDP it comes down to the application, and TCP should do the right thing.

Ok, thanks!

Next question would be:

Because T319184: Move WMCS servers to 1 single NIC some cloudvirts have a single NIC trunked with 1 native + 1 tagged VLAN. I wonder if there is any problem if each have a different MTU set on either the linux side or the switch port side.

Switch side is already at 9192.

Server TCP stack will define the MSS based on the interface used to reach the other endpoint (so based on the routing table).

So here too it should just work.

dcaro removed dcaro as the assignee of this task.May 22 2025, 7:41 AM

Questions for NetOps: they live in the cloud-hosts vlan. It is OK if some hosts attached to that VLAN use high MTU and other don't?

I would say thread very carefully here and fully test any implementation in codfw before going live. As Arzhel says for TCP the MSS negotiation should largely keep things working ok, but MTU problems can be tricky.

taavi renamed this task from [cloudvirt] Move to jumbo frames to [cloudvirt] Enable jumbo frames on cloud-hosts/cloud-private interfaces.Nov 7 2025, 12:06 PM
taavi claimed this task.
taavi added subscribers: taavi, RhinosF1, Aklapper.

Change #1203003 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] P:wmcs::cloud_private_subnet: Support enabling jumbo frames

https://gerrit.wikimedia.org/r/1203003

Change #1203003 merged by Majavah:

[operations/puppet@production] P:wmcs::cloud_private_subnet: Support enabling jumbo frames

https://gerrit.wikimedia.org/r/1203003

Change #1203398 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] hieradata: Enable jumbo frames on all codwf1dev cloudvirts

https://gerrit.wikimedia.org/r/1203398

Change #1203399 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] hieradata: Enable jumbo frames on codfw1dev cloudnets

https://gerrit.wikimedia.org/r/1203399

Change #1203400 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] hieradata: Enable jumbo frames on remaining codfw1dev nodes

https://gerrit.wikimedia.org/r/1203400

Change #1203398 merged by Majavah:

[operations/puppet@production] hieradata: Enable jumbo frames on all codwf1dev cloudvirts

https://gerrit.wikimedia.org/r/1203398

Change #1203399 merged by Majavah:

[operations/puppet@production] hieradata: Enable jumbo frames on codfw1dev cloudnets

https://gerrit.wikimedia.org/r/1203399

Change #1203400 merged by Majavah:

[operations/puppet@production] hieradata: Enable jumbo frames on remaining codfw1dev nodes

https://gerrit.wikimedia.org/r/1203400

Change #1203760 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] hieradata: Enable jumbo frames on cloudvirt1062

https://gerrit.wikimedia.org/r/1203760

Change #1203760 merged by Majavah:

[operations/puppet@production] hieradata: Enable jumbo frames on cloudvirt1062

https://gerrit.wikimedia.org/r/1203760

Change #1204623 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] hieradata: Enable jumbo frames on all eqiad1 cloudvirts

https://gerrit.wikimedia.org/r/1204623

Change #1204624 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] hieradata: Enable jumbo frames on eqiad1 cloudnets

https://gerrit.wikimedia.org/r/1204624

Change #1204625 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] hieradata: Enable jumbo frames on remaining eqiad1 nodes

https://gerrit.wikimedia.org/r/1204625

Change #1204626 had a related patch set uploaded (by Majavah; author: Majavah):

[operations/puppet@production] P:wmcs::cloud_private_subnet: Cleanup feature flag for jumbo frames

https://gerrit.wikimedia.org/r/1204626

Change #1204623 merged by Majavah:

[operations/puppet@production] hieradata: Enable jumbo frames on all eqiad1 cloudvirts

https://gerrit.wikimedia.org/r/1204623

Change #1204624 merged by Majavah:

[operations/puppet@production] hieradata: Enable jumbo frames on eqiad1 cloudnets

https://gerrit.wikimedia.org/r/1204624

Change #1204625 merged by Majavah:

[operations/puppet@production] hieradata: Enable jumbo frames on remaining eqiad1 nodes

https://gerrit.wikimedia.org/r/1204625

Change #1204626 merged by Majavah:

[operations/puppet@production] P:wmcs::cloud_private_subnet: Cleanup feature flag for jumbo frames

https://gerrit.wikimedia.org/r/1204626