Page MenuHomePhabricator

cloudgw: evaluate / validate setup in codfw1dev
Closed, ResolvedPublic

Description

This task is to track the work of evaluating the cloudgw setup in codfw1dev, as described in the 2020 network refresh project:

https://wikitech.wikimedia.org/wiki/Wikimedia_Cloud_Services_team/EnhancementProposals/2020_Network_refresh

  • introduce basic puppet support
  • reimage labtestvirt2003 as Debian Buster
  • enable new puppet code in labtestvirt2003
  • test basic network bits
  • test relocating neutron functions to cloudgw
  • repurpose/rename labtestvirt2003 as cloudgw2001-dev,

Details

SubjectRepoBranchLines +/-
operations/dnsmaster+1 -0
operations/dnsmaster+8 -5
operations/puppetproduction+3 -1
operations/puppetproduction+9 -2
operations/puppetproduction+1 -0
operations/puppetproduction+19 -4
operations/puppetproduction+2 -2
operations/puppetproduction+23 -15
operations/puppetproduction+4 -0
operations/puppetproduction+11 -1
operations/puppetproduction+1 -0
operations/puppetproduction+13 -6
operations/puppetproduction+12 -4
operations/puppetproduction+7 -7
operations/puppetproduction+0 -2
operations/puppetproduction+14 -15
operations/puppetproduction+2 -2
operations/puppetproduction+4 -12
operations/puppetproduction+25 -17
operations/puppetproduction+0 -2
operations/puppetproduction+3 -3
operations/puppetproduction+3 -2
operations/puppetproduction+1 -1
operations/puppetproduction+3 -1
operations/puppetproduction+0 -2
operations/puppetproduction+1 -1
operations/puppetproduction+61 -27
operations/puppetproduction+8 -7
operations/puppetproduction+4 -1
operations/puppetproduction+88 -8
operations/puppetproduction+28 -0
operations/puppetproduction+7 -1
operations/puppetproduction+6 -0
operations/puppetproduction+16 -2
operations/puppetproduction+0 -1
operations/puppetproduction+3 -6
operations/puppetproduction+13 -13
operations/puppetproduction+9 -9
operations/puppetproduction+9 -0
operations/puppetproduction+2 -0
operations/puppetproduction+1 -0
operations/puppetproduction+56 -2
Show related patches Customize query in gerrit

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Change 629143 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: rt_table: fix newline at the end of file

https://gerrit.wikimedia.org/r/629143

Change 629143 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: rt_table: fix newline at the end of file

https://gerrit.wikimedia.org/r/629143

Change 629146 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: interfaces: avoid augeas loop

https://gerrit.wikimedia.org/r/629146

Change 629146 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: interfaces: avoid augeas loop

https://gerrit.wikimedia.org/r/629146

Change 629150 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: interfaces: more template fixes

https://gerrit.wikimedia.org/r/629150

Change 629150 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: interfaces: more template fixes

https://gerrit.wikimedia.org/r/629150

Change 629151 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: interfaces: template fixes

https://gerrit.wikimedia.org/r/629151

Change 629151 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: interfaces: template fixes

https://gerrit.wikimedia.org/r/629151

Change 629164 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: hardcode mapped IPv6 interface

https://gerrit.wikimedia.org/r/629164

Change 629164 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: hardcode mapped IPv6 interface

https://gerrit.wikimedia.org/r/629164

Change 629323 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] hiera: labtestvirt2003: fix hiera key names

https://gerrit.wikimedia.org/r/629323

Change 629323 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] hiera: labtestvirt2003: fix hiera key names

https://gerrit.wikimedia.org/r/629323

Change 630778 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] openstack: cloudgw: drop unused orig_nic hiera key

https://gerrit.wikimedia.org/r/630778

Change 630778 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] openstack: cloudgw: drop unused orig_nic hiera key

https://gerrit.wikimedia.org/r/630778

Change 630809 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] openstack: cloudgw: add hiera keys for basic network bits

https://gerrit.wikimedia.org/r/630809

Change 630809 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] openstack: cloudgw: add hiera keys for basic network bits

https://gerrit.wikimedia.org/r/630809

Script wmf-auto-reimage was launched by aborrero on cumin2001.codfw.wmnet for hosts:

labtestvirt2003.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202009291118_aborrero_29762_labtestvirt2003_codfw_wmnet.log.

Completed auto-reimage of hosts:

['labtestvirt2003.codfw.wmnet']

Of which those FAILED:

['labtestvirt2003.codfw.wmnet']

Change 630812 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] openstack: cloudgw: introduce native vlan for easier reimaging

https://gerrit.wikimedia.org/r/630812

Change 630812 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] openstack: cloudgw: introduce native vlan for easier reimaging

https://gerrit.wikimedia.org/r/630812

Script wmf-auto-reimage was launched by aborrero on cumin2001.codfw.wmnet for hosts:

labtestvirt2003.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202010011534_aborrero_15026_labtestvirt2003_codfw_wmnet.log.

Completed auto-reimage of hosts:

['labtestvirt2003.codfw.wmnet']

Of which those FAILED:

['labtestvirt2003.codfw.wmnet']

Script wmf-auto-reimage was launched by aborrero on cumin2001.codfw.wmnet for hosts:

labtestvirt2003.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202010011535_aborrero_15187_labtestvirt2003_codfw_wmnet.log.

Completed auto-reimage of hosts:

['labtestvirt2003.codfw.wmnet']

Of which those FAILED:

['labtestvirt2003.codfw.wmnet']

Change 631500 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] openstack: cloudgw: replace dots with colons when building host IPv6

https://gerrit.wikimedia.org/r/631500

Change 631500 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] openstack: cloudgw: replace dots with colons when building host IPv6

https://gerrit.wikimedia.org/r/631500

Script wmf-auto-reimage was launched by aborrero on cumin2001.codfw.wmnet for hosts:

labtestvirt2003.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202010020916_aborrero_15594_labtestvirt2003_codfw_wmnet.log.

Completed auto-reimage of hosts:

['labtestvirt2003.codfw.wmnet']

Of which those FAILED:

['labtestvirt2003.codfw.wmnet']

Mentioned in SAL (#wikimedia-cloud) [2020-10-02T09:16:54Z] <arturo> [codfw1dev] trying the labtestvirt2003 (cloudgw) reimage again (T261724)

Script wmf-auto-reimage was launched by aborrero on cumin2001.codfw.wmnet for hosts:

labtestvirt2003.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202010020917_aborrero_15759_labtestvirt2003_codfw_wmnet.log.

Completed auto-reimage of hosts:

['labtestvirt2003.codfw.wmnet']

Of which those FAILED:

['labtestvirt2003.codfw.wmnet']

Script wmf-auto-reimage was launched by aborrero on cumin2001.codfw.wmnet for hosts:

labtestvirt2003.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202010081028_aborrero_31551_labtestvirt2003_codfw_wmnet.log.

Mentioned in SAL (#wikimedia-cloud) [2020-10-08T10:28:30Z] <arturo> [codfw1dev] reimaging labtestvirt2003 (cloudgw) T261724

Completed auto-reimage of hosts:

['labtestvirt2003.codfw.wmnet']

Of which those FAILED:

['labtestvirt2003.codfw.wmnet']

Change 632904 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] hieradata: labtestvirt2003: refresh network data for cloudgw PoC with latest allocations

https://gerrit.wikimedia.org/r/632904

Change 632904 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: refresh network config for the PoC

https://gerrit.wikimedia.org/r/632904

Change 632945 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: interfaces: don't bring up interfaces recursively

https://gerrit.wikimedia.org/r/632945

Change 632945 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: interfaces: don't bring up interfaces recursively

https://gerrit.wikimedia.org/r/632945

Mentioned in SAL (#wikimedia-cloud) [2020-10-09T08:47:40Z] <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack subnet delete a5ab5362-4ffb-4059-9ff7-391e22dcf3bc (T261724)

Mentioned in SAL (#wikimedia-cloud) [2020-10-09T08:49:24Z] <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack subnet create --network wan-transport-codfw --gateway 185.15.57.9 --no-dhcp --subnet-range 185.15.57.8/30 cloud-gw-transport-codfw (T261724)

Mentioned in SAL (#wikimedia-cloud) [2020-10-09T09:10:33Z] <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack router set --external-gateway wan-transport-codfw --fixed-ip subnet=cloud-gw-transport-codfw,ip-address=185.15.57.10 cloudinstances2b-gw (T261724)

Mentioned in SAL (#wikimedia-cloud) [2020-10-09T09:12:48Z] <arturo> [codfw1dev] root@cloudcontrol2001-dev:~# openstack subnet delete 31214392-9ca5-4256-bff5-1e19a35661de (cloud-instances-transport1-b-codfw - 208.80.153.184/29) (T261724)

Change 633155 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] openstack: codfw1dev: refresh external connection for neutron

https://gerrit.wikimedia.org/r/633155

Change 633155 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] openstack: codfw1dev: refresh external connection for neutron

https://gerrit.wikimedia.org/r/633155

Mentioned in SAL (#wikimedia-cloud) [2020-10-09T09:22:44Z] <arturo> [codfwd1dev] rebooting cloudnet boxes for bridge and vlan changes (T261724)

Mentioned in SAL (#wikimedia-cloud) [2020-10-09T10:15:10Z] <arturo> [codfwd1ev] root@cloudcontrol2001-dev:~# openstack router set --disable-snat cloudinstances2b-gw --external-gateway wan-transport-codfw (T261724)

Change 633711 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: refresh routing configuration

https://gerrit.wikimedia.org/r/633711

Change 633711 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: refresh routing configuration

https://gerrit.wikimedia.org/r/633711

Change 633775 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: refresh network configuration

https://gerrit.wikimedia.org/r/633775

Change 633775 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: refresh network configuration

https://gerrit.wikimedia.org/r/633775

Change 634209 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: basefirewall: accept ICMP packets

https://gerrit.wikimedia.org/r/634209

Change 634209 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: basefirewall: accept ICMP packets

https://gerrit.wikimedia.org/r/634209

Change 634223 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] neutron: add option to disable customizations if cloudgw is enabled

https://gerrit.wikimedia.org/r/634223

Change 634223 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] neutron: add option to disable customizations if cloudgw is enabled

https://gerrit.wikimedia.org/r/634223

Mentioned in SAL (#wikimedia-cloud) [2020-10-15T13:56:16Z] <arturo> [codfw1dev] drop neutron l3 agent hacks in cloudnet2002/2003-dev (T261724)

Mentioned in SAL (#wikimedia-cloud) [2020-10-15T15:17:30Z] <arturo> [codfw1dev] try cleaning up anything related to address scopes in the neutron database (T261724)

Change 634470 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: add routes for internal VM addressing

https://gerrit.wikimedia.org/r/634470

Change 634470 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: add routes for internal VM addressing

https://gerrit.wikimedia.org/r/634470

Mentioned in SAL (#wikimedia-cloud) [2020-10-16T09:01:16Z] <arturo> [codfw1dev] basic network connectivity seems stable after cleaning up everything related to address scopes (T261724)

Mentioned in SAL (#wikimedia-cloud) [2020-10-16T09:18:19Z] <arturo> [codfw1dev] live-hacked cloudservices2002-dev /etc/powerdns/recursor.conf file to include cloud-codfw1dev-floating CIDR (185.15.57.0/29) while https://gerrit.wikimedia.org/r/c/operations/puppet/+/634050 is in review, so VMs with a floating IP can query the DNS recursor (T261724)

Change 634490 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: add support for selecting NICs using hiera

https://gerrit.wikimedia.org/r/634490

Change 634490 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: add support for selecting NICs using hiera

https://gerrit.wikimedia.org/r/634490

Change 634492 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] hieradata: labtestvirt2003: cloudgw: fix NIC names

https://gerrit.wikimedia.org/r/634492

Change 634492 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] hieradata: labtestvirt2003: cloudgw: fix NIC names

https://gerrit.wikimedia.org/r/634492

Script wmf-auto-reimage was launched by aborrero on cumin2001.codfw.wmnet for hosts:

labtestvirt2003.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202010201448_aborrero_17241_labtestvirt2003_codfw_wmnet.log.

Mentioned in SAL (#wikimedia-cloud) [2020-10-20T14:49:06Z] <arturo> [codfw1dev] reimaging labtestvirt2003 (cloudgw) to test puppet code (T261724)

Change 635314 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] openstack: pdns recursor: allow querys from extra CIDRs

https://gerrit.wikimedia.org/r/635314

Change 635314 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] openstack: pdns recursor: allow querys from extra CIDRs

https://gerrit.wikimedia.org/r/635314

Mentioned in SAL (#wikimedia-cloud) [2020-10-20T15:47:05Z] <arturo> changing DNS recursor ACLs (https://gerrit.wikimedia.org/r/c/operations/puppet/+/635314) this can be reverted any time if it causes problems (T261724)

Completed auto-reimage of hosts:

['labtestvirt2003.codfw.wmnet']

Of which those FAILED:

['labtestvirt2003.codfw.wmnet']

Script wmf-auto-reimage was launched by aborrero on cumin2001.codfw.wmnet for hosts:

labtestvirt2003.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202010210855_aborrero_18426_labtestvirt2003_codfw_wmnet.log.

Mentioned in SAL (#wikimedia-cloud) [2020-10-21T08:56:38Z] <arturo> [codfw1dev] reimaging labtestvirt2003 (cloudgw) to test puppet code (T261724)

Change 635514 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: fix unset variable in interfaces template

https://gerrit.wikimedia.org/r/635514

Change 635514 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: fix unset variable in interfaces template

https://gerrit.wikimedia.org/r/635514

Completed auto-reimage of hosts:

['labtestvirt2003.codfw.wmnet']

and were ALL successful.

Change 635519 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: force installation of latest kernel

https://gerrit.wikimedia.org/r/635519

Change 635519 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: force installation of latest kernel

https://gerrit.wikimedia.org/r/635519

Script wmf-auto-reimage was launched by aborrero on cumin2001.codfw.wmnet for hosts:

labtestvirt2003.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202010211030_aborrero_5050_labtestvirt2003_codfw_wmnet.log.

Mentioned in SAL (#wikimedia-cloud) [2020-10-21T10:31:23Z] <arturo> [codfw1dev] reimaging labtestvirt2003 (cloudgw) to test puppet code (T261724)

Completed auto-reimage of hosts:

['labtestvirt2003.codfw.wmnet']

Of which those FAILED:

['labtestvirt2003.codfw.wmnet']

Change 635555 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/puppet@production] cloudgw: install latest nftables package

https://gerrit.wikimedia.org/r/635555

Change 635555 merged by Arturo Borrero Gonzalez:
[operations/puppet@production] cloudgw: install latest nftables package

https://gerrit.wikimedia.org/r/635555

Script wmf-auto-reimage was launched by aborrero on cumin2001.codfw.wmnet for hosts:

labtestvirt2003.codfw.wmnet

The log can be found in /var/log/wmf-auto-reimage/202010221106_aborrero_17826_labtestvirt2003_codfw_wmnet.log.

Completed auto-reimage of hosts:

['labtestvirt2003.codfw.wmnet']

Of which those FAILED:

['labtestvirt2003.codfw.wmnet']

Change 635965 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/dns@master] wikimediacloud.org: refresh codfw1dev addresses with cloudgw changes

https://gerrit.wikimedia.org/r/635965

Change 635965 merged by Arturo Borrero Gonzalez:
[operations/dns@master] wikimediacloud.org: refresh codfw1dev addresses with cloudgw changes

https://gerrit.wikimedia.org/r/635965

Mentioned in SAL (#wikimedia-cloud) [2020-10-23T10:09:04Z] <arturo> [codf1dev] doing DNS changes for the cloudgw PoC, including designate and https://gerrit.wikimedia.org/r/c/operations/dns/+/635965 (T261724)

Change 635990 had a related patch set uploaded (by Arturo Borrero Gonzalez; owner: Arturo Borrero Gonzalez):
[operations/dns@master] templates/57.15.185.in-addr.arpa: add missing PTR record for neutron virtual address

https://gerrit.wikimedia.org/r/635990

Change 635990 merged by Arturo Borrero Gonzalez:
[operations/dns@master] templates/57.15.185.in-addr.arpa: add missing PTR record for neutron virtual address

https://gerrit.wikimedia.org/r/635990

Mentioned in SAL (#wikimedia-cloud) [2020-10-23T11:30:56Z] <arturo> [codfw1dev] openstack --os-project-id cloudinfra-codfw1dev recordset create --type PTR --record nat.cloudgw.codfw1dev.wikimediacloud.org. --description "created by hand" 0-29.57.15.185.in-addr.arpa. 1.0-29.57.15.185.in-addr.arpa. (T261724)

aborrero added a subtask: Unknown Object (Task).Nov 30 2020, 10:47 AM

We consider this evaluation to be successfully completed.