Page MenuHomePhabricator

Andrew (Andrew Bogott)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Nov 2 2014, 11:35 PM (448 w, 4 d)
Availability
Available
IRC Nick
andrewbogott
LDAP User
Unknown
MediaWiki User
Andrewbogott [ Global Accounts ]

Recent Activity

Today

Andrew added a comment to T338520: Shellbox is broken on wikitech-static due to disk fullness.

A ton of files in /srv/mediawiki/images/wikitech/archive but deleteArchivedFiles.php --delete says there's nothing to delete. It's tempting to just rm that directory anyway but it would be nice to know what's happening first...

Fri, Jun 9, 1:34 AM · serviceops, Shellbox, wikitech.wikimedia.org
Andrew added a comment to T338520: Shellbox is broken on wikitech-static due to disk fullness.

I will look if I can get an ssh connection. Worst case we can resize the instance and increase our monthly bill be a few bucks. Thanks for noticing!

Fri, Jun 9, 1:11 AM · serviceops, Shellbox, wikitech.wikimedia.org

Yesterday

Andrew added a comment to T336685: Request increased quota for wikiwho Cloud VPS project.

Thank you! I've reverted the quota change.

Thu, Jun 8, 7:20 PM · WikiWho, Community-Tech, Cloud-VPS (Quota-requests)
Andrew added a comment to T302154: quarry-nfs-1 went down; quarry is offline.

Ah, sorry, I should've read back further in the task! Yes, that host can+should be deleted.

Thu, Jun 8, 4:54 PM · cloud-services-team (Kanban), Quarry
Andrew added a comment to T328691: [toolsdb] Migrate linkwatcher db to Trove.

FYI, the thing with docker not starting is upstream bug https://storyboard.openstack.org/#!/story/2010599 which could use a comment or two in support

Thu, Jun 8, 4:33 PM · linkwatcher, cloud-services-team (FY2022/2023-Q4), Toolforge
Andrew added a comment to T302154: quarry-nfs-1 went down; quarry is offline.

Mentioned in SAL (#wikimedia-cloud) [2022-02-20T19:49:50Z] < @Andrew > moving nfs service from quarry-nfs-1 (bullseye) to quarry-nfs-2 (buster), testing to see if T302154 is a kernal or nfs-version issue

@Andrew can quarry-nfs-1 be deleted then? I don't see anything stored on it, /srv/quarry/project is empty and has no volume mounted.
All data is on quarry-nfs-2 and this one has the external volume mounted on it.
I turned off the instance to verify it's not used

Thu, Jun 8, 2:34 PM · cloud-services-team (Kanban), Quarry
Andrew added a comment to T338320: wmcs-image-create fails because of changes (breakage?) in VM snapshots.

The proposed fix works! I've submitted it upstream

Thu, Jun 8, 1:04 PM · Patch-For-Review, cloud-services-team, Cloud-VPS

Wed, Jun 7

Andrew added a comment to T338320: wmcs-image-create fails because of changes (breakage?) in VM snapshots.

I reduced the rdb chunk size in glance-api.conf but that didn't resolve the issue... now I see

Wed, Jun 7, 6:11 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew added a comment to T338320: wmcs-image-create fails because of changes (breakage?) in VM snapshots.

possibly https://bugs.launchpad.net/glance/+bug/1916482

Wed, Jun 7, 4:10 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew added a comment to T338320: wmcs-image-create fails because of changes (breakage?) in VM snapshots.

While creating the snapshot, I see these errors:

Wed, Jun 7, 4:05 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew added a comment to T338320: wmcs-image-create fails because of changes (breakage?) in VM snapshots.

I downloaded a variety of images ('openstack image save') and it's only the recent snapshot of a VM that seems broken:

Wed, Jun 7, 4:05 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew added a comment to T338262: Cinder volume stuck in Detaching state.
Wed, Jun 7, 3:58 PM · Upstream, cloud-services-team, Cloud-VPS
Andrew added a comment to T338262: Cinder volume stuck in Detaching state.

I downloaded a variety of images ('openstack image save') and it's only the recent snapshot of a VM that seems broken:

Wed, Jun 7, 1:14 PM · Upstream, cloud-services-team, Cloud-VPS
Andrew updated the task description for T338320: wmcs-image-create fails because of changes (breakage?) in VM snapshots.
Wed, Jun 7, 12:54 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew created T338320: wmcs-image-create fails because of changes (breakage?) in VM snapshots.
Wed, Jun 7, 12:40 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew closed T338195: puppet package versioning on Bookworm for cloud-vps, a subtask of T338188: Create Bookworm image, as Resolved.
Wed, Jun 7, 12:37 PM · cloud-services-team, Cloud-VPS
Andrew closed T338195: puppet package versioning on Bookworm for cloud-vps as Resolved.
Wed, Jun 7, 12:37 PM · Puppet, cloud-services-team, Cloud-VPS

Tue, Jun 6

Andrew closed T337882: Cannot create trove db in horizon/terraform, a subtask of T337559: TF code to test all the things, as Resolved.
Tue, Jun 6, 7:07 PM · Cloud-VPS
Andrew closed T337882: Cannot create trove db in horizon/terraform as Resolved.

'failed to create' generally signifies a quota issue. Indeed, the 'trove' project is out of security groups. I've increased the quota from 40 to 100.

Tue, Jun 6, 7:07 PM · Cloud-VPS
Andrew added a comment to T338262: Cinder volume stuck in Detaching state.

Getting things truly detached and ready for attachment required me to remove things from the database as well as detach in the CLI.

Tue, Jun 6, 7:03 PM · Upstream, cloud-services-team, Cloud-VPS
Andrew claimed T338262: Cinder volume stuck in Detaching state.
Tue, Jun 6, 6:59 PM · Upstream, cloud-services-team, Cloud-VPS
Andrew added a comment to T338262: Cinder volume stuck in Detaching state.

This is probably https://bugs.launchpad.net/charm-nova-compute/+bug/2019888. It doesn't just affect wikiwho volumes, I was able to reproduce in testlabs.

Tue, Jun 6, 6:44 PM · Upstream, cloud-services-team, Cloud-VPS
Andrew added a comment to T338195: puppet package versioning on Bookworm for cloud-vps.

I think this is a missing dependency in the package.

Tue, Jun 6, 5:33 PM · Puppet, cloud-services-team, Cloud-VPS
Andrew added a comment to T338195: puppet package versioning on Bookworm for cloud-vps.

I now have the proper version installing via cloud-init () but now when puppet is invoked it says:

Tue, Jun 6, 5:32 PM · Puppet, cloud-services-team, Cloud-VPS
Andrew added a comment to T338195: puppet package versioning on Bookworm for cloud-vps.

Thanks @Dzahn ! The challenge is to encode that in cloud-init yaml (which may or may not be possible)

Tue, Jun 6, 12:48 PM · Puppet, cloud-services-team, Cloud-VPS
Andrew renamed T338195: puppet package versioning on Bookworm for cloud-vps from puppet package versioning on Bookworm to puppet package versioning on Bookworm for cloud-vps.
Tue, Jun 6, 1:04 AM · Puppet, cloud-services-team, Cloud-VPS
Andrew created T338195: puppet package versioning on Bookworm for cloud-vps.
Tue, Jun 6, 1:04 AM · Puppet, cloud-services-team, Cloud-VPS
Andrew closed T338192: Failure to resolve 'puppet' in DNS on bookworm as Resolved.

I think the above patch is adequate for this.

Tue, Jun 6, 12:54 AM · Infrastructure-Foundations, cloud-services-team, Cloud-VPS
Andrew closed T338192: Failure to resolve 'puppet' in DNS on bookworm, a subtask of T338188: Create Bookworm image, as Resolved.
Tue, Jun 6, 12:54 AM · cloud-services-team, Cloud-VPS
Andrew added a comment to T338192: Failure to resolve 'puppet' in DNS on bookworm.

This is apparently known behavior:

Tue, Jun 6, 12:26 AM · Infrastructure-Foundations, cloud-services-team, Cloud-VPS

Mon, Jun 5

Andrew updated the task description for T338192: Failure to resolve 'puppet' in DNS on bookworm.
Mon, Jun 5, 11:28 PM · Infrastructure-Foundations, cloud-services-team, Cloud-VPS
Andrew created T338192: Failure to resolve 'puppet' in DNS on bookworm.
Mon, Jun 5, 11:27 PM · Infrastructure-Foundations, cloud-services-team, Cloud-VPS
Andrew claimed T338188: Create Bookworm image.
Mon, Jun 5, 10:02 PM · cloud-services-team, Cloud-VPS
Andrew closed T336670: openstack cli: clarify and document usage, a subtask of T330759: Modernize openstack rbac, as Resolved.
Mon, Jun 5, 7:39 PM · Patch-For-Review, Goal, cloud-services-team (FY2022/2023-Q4), Cloud-VPS
Andrew closed T336670: openstack cli: clarify and document usage as Resolved.

I've tried to describe the best practices here: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Openstack_cli

Mon, Jun 5, 7:39 PM · cloud-services-team (FY2022/2023-Q4), Cloud-VPS
Andrew added a comment to T337806: mysterious oom issues on VMs.

This is happening again on tools-sgeweblight-10-14.tools.eqiad1.wikimedia.cloud

Mon, Jun 5, 1:30 PM · cloud-services-team, Cloud-VPS

Fri, Jun 2

Andrew added a comment to T336963: cloudcontrol2001-dev can't reach cloud-vps public IPs.

topranks> Cathal Mooney Pings are being blocked by 185.15.57.5 itself it seems:
1:39 PM https://www.irccloud.com/pastebin/TZm8TF4e/
Plain Text • 4 lines raw | line numbers
1:40 PM i.e. they are getting there but it's sending unreachable messages back
1:40 PM traffic does seem to get beyond the cloudgw
1:41 PM https://www.irccloud.com/pastebin/Dki06Xhv/
Plain Text • 8 lines raw | line numbers
1:46 PM They seem to be making it to cloudnet/neutron, which is generating the rejects:
1:46 PM https://www.irccloud.com/pastebin/PZTwUGW0/
Plain Text • 9 lines raw | line numbers
1:47 PM Not sure if that helps. What I can say is that nothing here is using the 172.20.x addressing, or this is not being affected by the new cloud-private networking.
1:47 PM cloudweb, cloudgw and cloudnet are on their existing addresses that they were prior to starting any of this
1:49 PM Seems there is a NAT rule to forward this traffic to/from VM IP 172.16.128.97
1:49 PM But that IP is unreachable from the cloudnet for some reason
1:50 PM root@cloudnet2005-dev:/home/cmooney# ip neigh show 172.16.128.97
1:50 PM 172.16.128.97 dev qr-21e10025-d4 FAILED
1:51 PM It can ping other VMs so I think the issue isn't with cloudnet2005 connection to the instance vlan
1:51 PM https://www.irccloud.com/pastebin/vSa9SoOM/
Plain Text • 4 lines raw | line numbers
1:53 PM TL;DR - I don't think this is a physical network issue, and it's not using any of the new components
1:57 PM cloudnet2005-dev can't reach VM tools-codfw1dev-bastion-2 for some reason

Fri, Jun 2, 8:02 PM · User-aborrero, Patch-For-Review, cloud-services-team
Andrew committed rCCKB6ba2e9041e2b: Move codfw cloudcontrol nodes to codfw.wmnet (authored by Andrew).
Move codfw cloudcontrol nodes to codfw.wmnet
Fri, Jun 2, 2:51 PM
Andrew added a comment to T337961: Clean up clouddb1021.

@Ladsgroup can I assign this to you?

Fri, Jun 2, 1:13 PM · Data-Engineering, DBA

Thu, Jun 1

Andrew added a comment to T336587: cloudservices[2004/2005]-dev & cloudweb2002-dev: connect them to cloudsw so they can have cloud-private vlan.

I haven't dug much, but designate is currently failing on cloudservices200[45]-dev because the services on that host are unable to contact mysql on cloudcontrols:

Thu, Jun 1, 11:57 PM · ops-codfw, cloud-services-team, SRE, netops, Infrastructure-Foundations, Cloud-VPS

Wed, May 31

Andrew added a comment to T332734: Use postgres instead of sqlite for backy2.

sgtm!

Wed, May 31, 3:46 PM · cloud-services-team (FY2022/2023-Q4), Cloud-VPS
Andrew added a comment to T337744: Request creation of krehel VPS project.

To expand on @Aklapper's comment -- the distinction (for me) is whether the application involves community collaboration, or if the project is essentially a 'laptop in the cloud'. If you're doing things that you could easily do on a local box then we're unlikely to approve. If, on the other hand, you need public-facing things or persistent services (a web service, a subscription to an external event stream, a project that has more than one person working on the same host) then we would be more likely to consider it as a cloud-vps candidate.

Wed, May 31, 3:45 PM · Cloud-VPS (Project-requests)
Andrew added a comment to T336669: Decision request - How to provide a way to install system dependencies for buildpack-based images.

Option 2 seems like the right call to me. I'm curious about the non-free concern... would we be limiting install to particular repos, or would users also be able to inject non-free repos before installing packages?

Wed, May 31, 3:41 PM · Toolforge Build Service (Iteration 15), Cloud Services Proposals
Andrew closed T337732: widespread sssd failures on cloud-vps as Resolved.

Out of an abundance of caution, I fixed these by hand. Everything seems OK now but the issue will likely recur if a grub update is pushed out for Buster again.

Wed, May 31, 3:04 AM · cloud-services-team, Cloud-VPS
Andrew created T337806: mysterious oom issues on VMs.
Wed, May 31, 2:37 AM · cloud-services-team, Cloud-VPS

Tue, May 30

Andrew added a comment to T337732: widespread sssd failures on cloud-vps.

This seems to only happen to hosts with /dev/vda rather than /dev/sda. But it doesn't happen on ALL hosts with /dev/vda.

Tue, May 30, 1:29 PM · cloud-services-team, Cloud-VPS
Andrew added a comment to T337732: widespread sssd failures on cloud-vps.

The list of affected instances (via sudo cumin --force A:all 'dpkg --list | grep grub-pc | grep iF'):

Tue, May 30, 1:12 PM · cloud-services-team, Cloud-VPS
Andrew added a comment to T337732: widespread sssd failures on cloud-vps.

Once the grub failure is dealt with, installing 'apt install libsss-certmap0' fixes puppet and ssh

Tue, May 30, 12:31 PM · cloud-services-team, Cloud-VPS
Andrew added a comment to T337732: widespread sssd failures on cloud-vps.

This is somehow related to grub. If I run 'apt install libnss-sss' it complains about a failure to install grub on /dev/vda3. Bypassing that failure by hand seems to get things unstuck but I'm not sure that unattended upgrades can do that.

Tue, May 30, 12:25 PM · cloud-services-team, Cloud-VPS
Andrew created T337732: widespread sssd failures on cloud-vps.
Tue, May 30, 12:17 PM · cloud-services-team, Cloud-VPS

Fri, May 26

Andrew added a comment to T330759: Modernize openstack rbac.

The openstack rbac work that I've been doing[0] has hit some serious roadbumps, but I'm swiftly approaching a stopping point. Y'all are long overdue for an update, so here's a summary of where I'm at.

Fri, May 26, 7:08 PM · Patch-For-Review, Goal, cloud-services-team (FY2022/2023-Q4), Cloud-VPS
Andrew created T337577: Replace use of openstack environment settings with clouds.yaml.
Fri, May 26, 5:33 PM · Patch-For-Review, cloud-services-team, Goal, Cloud-VPS

Thu, May 25

Andrew added a comment to T332734: Use postgres instead of sqlite for backy2.

The backup nodes now have postgres installed with a 'back2' user and a 'backy2' table and a backy-generated schema. To actually switch backy2 over from sqlite to posgres, merge the following patch:

Thu, May 25, 7:41 PM · cloud-services-team (FY2022/2023-Q4), Cloud-VPS
Andrew closed T337434: Need help resizing existing cinder volumes in wikiapiary project as Resolved.

Great, quota reverted.

Thu, May 25, 1:01 PM · cloud-services-team, Cloud-VPS, WikiApiary

Wed, May 24

Andrew added a comment to T337434: Need help resizing existing cinder volumes in wikiapiary project.

I temporarily increased your storage quota by 100G. That should let you create a new 100G volume, get whatever data you want onto that, and then resize wikidb, and then delete the stray volume left over.

Wed, May 24, 9:57 PM · cloud-services-team, Cloud-VPS, WikiApiary
Andrew closed T337196: Request trove db for Lutz Toolforge tool as Resolved.

@Chicocvenancio you should now have access to a new 'Lutz' project with 80Gb of database storage quota. You're among the first people to follow this workflow so please follow up here if you find things that are broken.

Wed, May 24, 5:53 PM · Toolforge (Quota-requests)
Andrew claimed T337413: Create wmcs cookbook for creating a trove-only cloud-vps project.
Wed, May 24, 4:01 PM · Cloud-VPS, Patch-For-Review, cloud-services-team
Andrew created T337413: Create wmcs cookbook for creating a trove-only cloud-vps project.
Wed, May 24, 4:00 PM · Cloud-VPS, Patch-For-Review, cloud-services-team
Andrew added a comment to T337196: Request trove db for Lutz Toolforge tool.

@Chicocvenancio I'm wrong, you're right... there's a different workflow toolforge+trove. I may to need to do some coding but we'll get this together.

Wed, May 24, 4:00 PM · Toolforge (Quota-requests)
Andrew added a comment to T336685: Request increased quota for wikiwho Cloud VPS project.

+1 approved. Please ping on this ticket when the old VM is removed so we can revert (some of) the quota increase.

Wed, May 24, 3:14 PM · WikiWho, Community-Tech, Cloud-VPS (Quota-requests)
Andrew added a comment to T336478: Request creation of gitlabsonarbot VPS project.

+1 approved, even though we'd rather use you as a test case in toolforge :)

Wed, May 24, 3:14 PM · Cloud-VPS (Project-requests)
Andrew added a parent task for T292195: Unable to create user in Trove postgres DB: T337396: Better support for Postgres on Trove.
Wed, May 24, 1:33 PM · cloud-services-team, Cloud-VPS
Andrew added a subtask for T337396: Better support for Postgres on Trove: T292195: Unable to create user in Trove postgres DB.
Wed, May 24, 1:33 PM · Cloud-VPS, cloud-services-team (FY2022/2023-Q4)
Andrew created T337396: Better support for Postgres on Trove.
Wed, May 24, 1:32 PM · Cloud-VPS, cloud-services-team (FY2022/2023-Q4)
Andrew added a comment to T337196: Request trove db for Lutz Toolforge tool.

Then wikitech is wrong :( Where is the page that sent you here?

Wed, May 24, 1:31 PM · Toolforge (Quota-requests)
Andrew added a comment to T306098: Cloud VPS "swift" project Stretch deprecation.

Hello again! I'm just checking in that someone is still tasked with resolving this after our recent team shuffles.

Wed, May 24, 1:26 PM · SRE-swift-storage, Cloud-VPS (Debian Stretch Deprecation)

Tue, May 23

Andrew reassigned T332734: Use postgres instead of sqlite for backy2 from Andrew to fnegri.

This is running on cloudbackup100[34].

Tue, May 23, 8:05 PM · cloud-services-team (FY2022/2023-Q4), Cloud-VPS
Andrew committed rLPRI568ef416f053: Add more fake backy2 passwords (authored by Andrew).
Add more fake backy2 passwords
Tue, May 23, 7:05 PM
Andrew committed rLPRI2c2a8efdc843: add fake backy2 postgres passwords (authored by Andrew).
add fake backy2 postgres passwords
Tue, May 23, 7:03 PM
Andrew renamed T332734: Use postgres instead of sqlite for backy2 from backy2 schema upgrades to Use postgres instead of sqlite for backy2.
Tue, May 23, 6:16 PM · cloud-services-team (FY2022/2023-Q4), Cloud-VPS
Andrew added a comment to T327726: Define and document the scope of work of the WMCS team..

I worked on this some today! There are still some blanks to fill in. I welcome feedback on the three-star support level column that I added.

Tue, May 23, 3:03 PM · cloud-services-team (FY2022/2023-Q4), wmcs-retrospective
Andrew closed T336104: constructors.json missing from python3-os-client-config debian package as Resolved.
Tue, May 23, 1:47 PM · cloud-services-team, Cloud-VPS
Andrew closed T301280: Move project-specific NFS mounts onto project-local NFS servers as Resolved.
Tue, May 23, 1:46 PM · Goal, cloud-services-team (FY2022/2023-Q4), Patch-For-Review, Cloud-VPS
Andrew closed T301280: Move project-specific NFS mounts onto project-local NFS servers, a subtask of T291405: [NFS] Reduce or eliminate bare-metal NFS servers, as Resolved.
Tue, May 23, 1:46 PM · cloud-services-team
Andrew closed T291405: [NFS] Reduce or eliminate bare-metal NFS servers as Resolved.

I think we're now down to the minimum -- just dumps (which are huge) are on metal and everything is on VMs.

Tue, May 23, 1:46 PM · cloud-services-team
Andrew closed T291405: [NFS] Reduce or eliminate bare-metal NFS servers, a subtask of T272395: Cloud: reduce NAT exceptions from cloud to production, as Resolved.
Tue, May 23, 1:45 PM · cloud-services-team, Epic
Andrew closed T324729: Find a sustainable local storage solution for cloud-vps as Resolved.

Now we have etcd running on cloudvirtlocal100[1-3] and things seem to be working fine.

Tue, May 23, 1:45 PM · Goal, cloud-services-team (FY2022/2023-Q4), Cloud-VPS
Andrew closed T324729: Find a sustainable local storage solution for cloud-vps, a subtask of T313984: cloudvirt1019: hpssacli not found, as Resolved.
Tue, May 23, 1:45 PM · Patch-For-Review, cloud-services-team (Hardware)
Andrew closed T336379: Openstack API slowdowns as Resolved.

real 0m14.480s
user 0m1.966s
sys 0m0.176s
root@cloudcontrol1005:~/foo#
root@cloudcontrol1005:~/foo#
root@cloudcontrol1005:~/foo# time ./bar.py

Tue, May 23, 1:42 PM · cloud-services-team, Cloud-VPS

Mon, May 22

Andrew added a comment to T337269: decommission labstore100[45].eqiad.wmne.

Two things:

Mon, May 22, 8:49 PM · SRE, ops-eqiad, cloud-services-team, decommission-hardware
Andrew assigned T337269: decommission labstore100[45].eqiad.wmne to Jclark-ctr.
Mon, May 22, 8:48 PM · SRE, ops-eqiad, cloud-services-team, decommission-hardware
Andrew updated the task description for T337269: decommission labstore100[45].eqiad.wmne.
Mon, May 22, 8:37 PM · SRE, ops-eqiad, cloud-services-team, decommission-hardware
Andrew updated the task description for T333477: Migrate tools nfs from labstore1004 server to a ceph-backed VM.
Mon, May 22, 8:32 PM · cloud-services-team (FY2022/2023-Q4), Patch-For-Review, Cloud-VPS
Andrew created T337269: decommission labstore100[45].eqiad.wmne.
Mon, May 22, 8:29 PM · SRE, ops-eqiad, cloud-services-team, decommission-hardware
Andrew updated the task description for T330759: Modernize openstack rbac.
Mon, May 22, 7:40 PM · Patch-For-Review, Goal, cloud-services-team (FY2022/2023-Q4), Cloud-VPS
Andrew added a comment to T330759: Modernize openstack rbac.

https://governance.openstack.org/tc/goals/selected/consistent-and-secure-rbac.html#the-issues-we-are-facing-with-scope-concept <- implies that enabling scope now is premature and may never be necessary. I'm torn between mourning the lost effort and cheering the simpler model

Mon, May 22, 7:40 PM · Patch-For-Review, Goal, cloud-services-team (FY2022/2023-Q4), Cloud-VPS

Thu, May 18

Andrew created T336963: cloudcontrol2001-dev can't reach cloud-vps public IPs.
Thu, May 18, 6:42 PM · User-aborrero, Patch-For-Review, cloud-services-team

Wed, May 17

Andrew committed rCCKB9102f6c8a8a1: Pass --os-cloud to openstack commands (authored by Andrew).
Pass --os-cloud to openstack commands
Wed, May 17, 6:01 PM
Andrew added a comment to T336670: openstack cli: clarify and document usage.

I do not think this is a result of my auth refactors, since nothing on the backend has been updated in eqiad1 yet. I suspect instead that this is something that's changed in the standalone nova CLI.

Wed, May 17, 4:50 PM · cloud-services-team (FY2022/2023-Q4), Cloud-VPS
Andrew added a comment to T336808: Inconsistent connectivity between cloudservices200[45]-dev and codfw1dev cloudcontrols.

cloudlb2001-dev seems unable to reach designate. "telnet 208.80.153.43 9001" and "telnet 208.80.153.44 9001" both fail from cloudlb2001-dev. That likely means that haproxy is not pooling designate on cloudlb2001.

Wed, May 17, 12:40 AM · User-aborrero, Patch-For-Review, cloud-services-team
Andrew added a comment to T336808: Inconsistent connectivity between cloudservices200[45]-dev and codfw1dev cloudcontrols.

Removing '185.15.57.24 openstack.codfw1dev.wikimediacloud.org' from /etc/hosts in cloudcontrol2001-dev sfixed the 503 problem. The intermittent timeouts are still happening.

Wed, May 17, 12:12 AM · User-aborrero, Patch-For-Review, cloud-services-team

Tue, May 16

Andrew added a comment to T336808: Inconsistent connectivity between cloudservices200[45]-dev and codfw1dev cloudcontrols.

"wget https://openstack.codfw1dev.wikimediacloud.org:29001" returns 503 no matter whether haproxy is or isn't running on cloudcontrol2005-dev. This surprises me since openstack.codfw1dev.wikimediacloud.org is a CNAME for cloudcontrol2005-dev.wikimedia.org. Some routing thing is happening that I don't understand.

Tue, May 16, 8:27 PM · User-aborrero, Patch-For-Review, cloud-services-team
Andrew added a comment to T336808: Inconsistent connectivity between cloudservices200[45]-dev and codfw1dev cloudcontrols.
telnet cloudservices2005-dev.wikimedia.org 9001
Tue, May 16, 7:40 PM · User-aborrero, Patch-For-Review, cloud-services-team
Andrew created T336808: Inconsistent connectivity between cloudservices200[45]-dev and codfw1dev cloudcontrols.
Tue, May 16, 7:18 PM · User-aborrero, Patch-For-Review, cloud-services-team
Andrew closed T336723: cloudlb vs. password_safelist, a subtask of T332153: cloudlb: prepare backends, as Resolved.
Tue, May 16, 7:13 PM · cloud-services-team (FY2022/2023-Q4)
Andrew closed T336723: cloudlb vs. password_safelist as Resolved.
Tue, May 16, 7:13 PM · Patch-For-Review, cloud-services-team
Andrew added a comment to T336723: cloudlb vs. password_safelist.

should the cidr for these hosts get their own network:constants entry? Otherwise I can pass around the list of lb nodes as parameters but I fear we'll wind up doing that a lot.

Tue, May 16, 1:14 PM · Patch-For-Review, cloud-services-team
Andrew updated the task description for T336723: cloudlb vs. password_safelist.
Tue, May 16, 4:44 AM · Patch-For-Review, cloud-services-team
Andrew created T336723: cloudlb vs. password_safelist.
Tue, May 16, 4:41 AM · Patch-For-Review, cloud-services-team

Mon, May 15

Andrew added a comment to T336104: constructors.json missing from python3-os-client-config debian package.

Confusing because the openstacksdk docs (https://docs.openstack.org/openstacksdk/latest/user/guides/connect_from_config.html) say:

Mon, May 15, 2:13 AM · cloud-services-team, Cloud-VPS