We detected some network problems when introducing the new dualstack and ipv4-only networks.
This ticket is to track the work to identify and fix them.
We detected some network problems when introducing the new dualstack and ipv4-only networks.
This ticket is to track the work to identify and fix them.
| Title | Reference | Author | Source Branch | Dest Branch | |
|---|---|---|---|---|---|
| codfw1dev: flavors: fix project access for g4.cores1.ram1.disk4 | repos/cloud/cloud-vps/tofu-infra!171 | aborrero | arturo-322-codfw1dev-flavors-f | main | |
| codfw1dev: create network tests instances | repos/cloud/cloud-vps/tofu-infra!170 | aborrero | arturo-276-codfw1dev-create-ne | main | |
| codfw1dev: add vxlan-ipv4-only.cloudinstances2b-gw.svc.codfw1dev.wikimedia.cloud FQDN | repos/cloud/cloud-vps/tofu-infra!169 | aborrero | arturo-169-codfw1dev-add-vxlan | main | |
| codfw1dev: tools-codfw1dev: manage default security group | repos/cloud/cloud-vps/tofu-infra!140 | aborrero | arturo-118-codfw1dev-tools-cod | main |
aborrero opened https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/140
codfw1dev: tools-codfw1dev: manage default security group
aborrero merged https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/140
codfw1dev: tools-codfw1dev: manage default security group
Change #1097370 had a related patch set uploaded (by Arturo Borrero Gonzalez; author: Arturo Borrero Gonzalez):
[operations/puppet@production] openstack: networktests: refresh for latest network changes
Change #1097370 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] openstack: networktests: refresh for latest network changes
I detected a few inconsistencies in the network testing scripts, I will fix them.
Among others, I will use the vlanX120.cloudgwYYYY.<deploy>.wikimediacloud.org scheme for the IP addresses, like we do for vlan1107/vlan2107.
Change #1097380 had a related patch set uploaded (by Arturo Borrero Gonzalez; author: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: use vlan1120/vlan2120 prefix for FQDN
Change #1097380 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: use vlan1120/vlan2120 prefix for FQDN
Change #1097440 had a related patch set uploaded (by Arturo Borrero Gonzalez; author: Arturo Borrero Gonzalez):
[operations/puppet@production] openstack: networktests: support IPv6 and IPv4-only networks
today @cmooney reported this was maybe caused by some inconsistency on the edge routing configuration for cloudsw devices.
I'd forgot about this task, apologies.
The reason the problems occurred last time with this is the cloud switches had not yet been configured with IPv6 addressing on the various host-facing vlans, nor on their interconnects or link to the WMF core routers. Which basically meant IPv6 routing in and out of the cloud network was not functioning (indeed not even configured), and the cloud systems were firing traffic into a black hole.
We're working to add the IPv6 interfaces, routing protocols, public BGP announcements for assigned ranges etc now, after which we should be able to re-attempt enabling IPv6 on the host side.
The only caveat to that is I don't think any IPv4 networks should have been affected by the lack of v6 configuration on the physical infra, so perhaps there is some other issue here. Either way we should ensure the v6 infra is fully configured before we re-attempt.
Change #1097440 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] openstack: networktests: support IPv6 and IPv4-only networks
aborrero opened https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/169
codfw1dev: add vxlan-ipv4-only.cloudinstances2b-gw.svc.codfw1dev.wikimedia.cloud FQDN
aborrero merged https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/169
codfw1dev: add vxlan-ipv4-only.cloudinstances2b-gw.svc.codfw1dev.wikimedia.cloud FQDN
aborrero opened https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/170
codfw1dev: create network tests instances
aborrero merged https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/170
codfw1dev: create network tests instances
aborrero opened https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/171
codfw1dev: flavors: fix project access for g4.cores1.ram1.disk4
aborrero merged https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/171
codfw1dev: flavors: fix project access for g4.cores1.ram1.disk4
Mentioned in SAL (#wikimedia-cloud) [2025-04-07T15:30:30Z] <arturo> create a bunch of VMs by hand, like networktests-vlan-legacy-floating T380728
Mentioned in SAL (#wikimedia-cloud) [2025-04-07T15:30:48Z] <arturo> [codfw1dev] testlabs create a bunch of VMs by hand, like networktests-vlan-legacy-floating T380728
we think all problems have been addresses. Among other things: