Page MenuHomePhabricator

Andrew (Andrew Bogott)
User

Today

  • No visible events.

Tomorrow

  • No visible events.

Wednesday

  • No visible events.

User Details

User Since
Nov 2 2014, 11:35 PM (597 w, 6 h)
Availability
Available
IRC Nick
andrewbogott
LDAP User
Unknown
MediaWiki User
Andrewbogott [ Global Accounts ]

Recent Activity

Fri, Apr 10

Andrew added a comment to T421911: Keystone logs no longer appearing in logstash.

The root of this seems to be that keystone has stopped logging with the name 'keystone,' instead using the name '<frozen importlib._bootstrap>'

Fri, Apr 10, 9:01 PM · Cloud-VPS, User-aborrero, cloud-services-team
Andrew added a comment to T422801: Consider allowing cumin access to all Cloud VPS VMs.

We can likely have cloudinit add a public cumin key to all hosts on creation, but I have a few thoughts:

Fri, Apr 10, 5:24 PM · tools-platform-team, Cloud-VPS
Andrew closed T416483: openstack flamingo: "'enabled' is a required property" for LDAP-managed users as Resolved.

Upstream has acknowledged this and think it's fixed.

Fri, Apr 10, 4:33 PM · Upstream, cloud-services-team, Cloud-VPS

Thu, Apr 9

Andrew added a comment to T422509: Cloud init and unattended upgrades while bootstrapping Trixie VMs.

We are attempting to only get the puppet package from the wikimedia repo (this is set by cloud-init at creation time)

Thu, Apr 9, 6:16 PM · Patch-For-Review, Cloud-VPS, cloud-services-team
Andrew added a comment to T422509: Cloud init and unattended upgrades while bootstrapping Trixie VMs.

The base image is based on a trixie VM with our puppet classes already applied (that happens at build time). So shouldn't /that/ have already downgraded puppet in the base image?

Thu, Apr 9, 2:22 PM · Patch-For-Review, Cloud-VPS, cloud-services-team

Wed, Apr 8

Andrew renamed T422515: wmcs cookbook "--project" arg is ambiguous, could mean project id or project name from Handle project IDs with dash in cloud cookbooks / openstack API to wmcs cookbook "--project" arg is ambiguous, could mean project id or project name.
Wed, Apr 8, 8:45 PM · Cloud-VPS, cloud-services-team
Andrew triaged T422515: wmcs cookbook "--project" arg is ambiguous, could mean project id or project name as Medium priority.

For starters we should probably look for places that take a --project arg and convert them to either --project-id or --project-name depending on what the code does.

Wed, Apr 8, 8:44 PM · Cloud-VPS, cloud-services-team
Andrew triaged T422538: Connection with `k8s.tools.eqiad1.wikimedia.cloud` hits SSL error as Medium priority.
Wed, Apr 8, 8:42 PM · cloud-services-team, Toolforge
Andrew added a comment to T422538: Connection with `k8s.tools.eqiad1.wikimedia.cloud` hits SSL error.

@Nokib_Sarkar have you seen this happen on multiple occasions, or just several times on the 7th specifically? (I want to make sure it's not a side-effect of maintenance activity.)

Wed, Apr 8, 8:42 PM · cloud-services-team, Toolforge
Andrew triaged T422509: Cloud init and unattended upgrades while bootstrapping Trixie VMs as Medium priority.

Do you have any theory (you being @elukey and @fgiunchedi) about why that happened on this exact instance? I just checked and we have around 100 running Trixie VMs so presumably cloud-init works properly most of the time.

Wed, Apr 8, 8:40 PM · Patch-For-Review, Cloud-VPS, cloud-services-team
Andrew closed T422462: Elasticsearch credential request for techactivity as Resolved.

This is done, and your creds should be in your envvars as TOOL_ELASTICSEARCH_USER and TOOL_ELASTICSEARCH_PASSWORD

Wed, Apr 8, 7:28 PM · Toolforge (Quota-requests)
Andrew added a comment to T422462: Elasticsearch credential request for techactivity.

How much disk is there to play with?

~400T of unreplicated space, split in 3 nodes

Wed, Apr 8, 7:17 PM · Toolforge (Quota-requests)
Andrew closed T420282: cloudcephmon2007-dev service implementation, a subtask of T416396: Q3:rack/setup/install cloudcephmon2007-dev, as Resolved.
Wed, Apr 8, 7:16 PM · SRE, DC-Ops, ops-codfw
Andrew closed T420282: cloudcephmon2007-dev service implementation as Resolved.
Wed, Apr 8, 7:16 PM · cloud-services-team, SRE, DC-Ops, ops-codfw
Andrew added a comment to T376400: Redesign wikitech-static.

The only thing left to do here (that I know if) is relative links being messed up in the initial wikitech-static landing page. Search works, and once you navigate to a valid page the links work.

Wed, Apr 8, 4:12 PM · Patch-For-Review, serviceops-radar, SRE-Unowned, SRE, wikitech.wikimedia.org
Andrew added a comment to T422462: Elasticsearch credential request for techactivity.

This is fine but I have to figure out how to do it! Docs seem to be at https://wikitech.wikimedia.org/wiki/Portal:Toolforge/Admin#Granting_a_tool_write_access_to_Elasticsearch

Wed, Apr 8, 3:47 PM · Toolforge (Quota-requests)
Andrew placed T422437: decommission cloudcephmon2004-dev up for grabs.
Wed, Apr 8, 3:32 PM · SRE, DC-Ops, ops-codfw, decommission-hardware
Andrew added a comment to T417393: Carry out controlled network switch down tests in cloud.

cloudcontrol nodes not in C8 (i.e. 1006/1007) though didn't seem to give up trying to connect to rabbitmq01.eqiad1.wikimediacloud.org:5671 whereas cloudcontrol1011 stopped trying to talk to rabbitmq01 as expected.

Wed, Apr 8, 2:24 PM · Cloud-VPS, cloud-services-team (FY2025/2026-Q3-Q4)

Tue, Apr 7

Andrew closed T361237: [infra] Upgrade Toolforge K8s etcd nodes to Bookworm, a subtask of T387005: [infra] Toolforge: migrate to Debian Bookworm or later, as Resolved.
Tue, Apr 7, 8:40 PM · Cloud-VPS (Debian Bullseye Deprecation), cloud-services-team, Toolforge
Andrew closed T361237: [infra] Upgrade Toolforge K8s etcd nodes to Bookworm as Resolved.
Tue, Apr 7, 8:40 PM · Toolforge (Toolforge iteration 26), User-Raymond_Ndibe, cloud-services-team, Kubernetes
Andrew added a comment to T422437: decommission cloudcephmon2004-dev.

decom script is failing:

Tue, Apr 7, 7:05 PM · SRE, DC-Ops, ops-codfw, decommission-hardware
Andrew added a comment to T421242: New flavor for the integration project with more vCPU and ephemeral disk space.

note to self: the old flavor (g4.cores8.ram24.disk20.ephemeral90.4xiops) is available in other projects (according to tofu-infra: ["integration", "search", "gitlab-runners", "wikiapiary", "zuul3"]) so I'm leaving it as is.

Tue, Apr 7, 2:58 PM · User-zeljkofilipin, Browser Test Platform, Continuous-Integration-Infrastructure, Jenkins, Continuous-Integration-Config, Cloud-VPS (Quota-requests)

Mon, Apr 6

Andrew added a subtask for T420282: cloudcephmon2007-dev service implementation: T422437: decommission cloudcephmon2004-dev.
Mon, Apr 6, 11:42 PM · cloud-services-team, SRE, DC-Ops, ops-codfw
Andrew added a parent task for T422437: decommission cloudcephmon2004-dev: T420282: cloudcephmon2007-dev service implementation.
Mon, Apr 6, 11:42 PM · SRE, DC-Ops, ops-codfw, decommission-hardware
Andrew created T422437: decommission cloudcephmon2004-dev.
Mon, Apr 6, 11:42 PM · SRE, DC-Ops, ops-codfw, decommission-hardware
Andrew closed T421025: Add PTR record for azwikimedia (mail.wikimedia.az), a subtask of T419582: Add floating IP and vanity domain for azwikimedia project, as Resolved.
Mon, Apr 6, 9:39 PM · cloud-services-team, Cloud-VPS (Quota-requests)
Andrew closed T421025: Add PTR record for azwikimedia (mail.wikimedia.az) as Resolved.

I think I've fixed both (!) things that were wiping out your ptr record. Please re-open if it vanishes again!

Mon, Apr 6, 9:39 PM · cloud-services-team, Cloud-VPS (Quota-requests)

Thu, Apr 2

Andrew added a comment to T421025: Add PTR record for azwikimedia (mail.wikimedia.az).

I'm pretty sure the issue was that tofu was removing the by-hand record and then the ip-updater adding the instance- record. I've added this to tofu, let's see if it persists now.

Thu, Apr 2, 12:47 PM · cloud-services-team, Cloud-VPS (Quota-requests)

Tue, Mar 31

Andrew closed T421832: wmcs.openstack.restart_openstack attempts to restart services on decom cloudcontrol1005 as Resolved.

I have not used it before, but 'designate-manage service clean' seems to be the tool needed. Now I see:

Tue, Mar 31, 3:36 PM · Cloud-VPS, tools-infrastructure-team, cloud-services-team (FY2025/2026-Q3-Q4)
Andrew created T421911: Keystone logs no longer appearing in logstash.
Tue, Mar 31, 2:40 PM · Cloud-VPS, User-aborrero, cloud-services-team
Andrew added a comment to T379550: openstack: keystone may be failing to add users to the bastion project in Keystone and/or LDAP.

I just encountered a variation on this: new user dpogorzelski was in the bastion project with the proper role, but 'project-bastion' didn't appear in 'groups dpogorzelski'. It's not unusual for that to take a few minutes but this time it had been hours.

Tue, Mar 31, 2:39 PM · Cloud-VPS, User-aborrero, cloud-services-team
Andrew renamed T377055: Openstack services should use standard HTTPS port from Keystone auth endpoint should use a standard HTTPS port to Openstack services should use standard HTTPS port.
Tue, Mar 31, 1:41 PM · Cloud-VPS, cloud-services-team

Mon, Mar 30

Andrew added a comment to T421739: Improvements to auto-generated floating ip ptr records.

Item #2 is already handled by the code. I don't know how/why my last attempt was clobbered; trying again.

Mon, Mar 30, 7:32 PM · Patch-For-Review, Cloud-VPS, cloud-services-team
Andrew added a comment to T421739: Improvements to auto-generated floating ip ptr records.

Item #2 is already handled by the code. I don't know how/why my last attempt was clobbered; trying again.

Mon, Mar 30, 7:30 PM · Patch-For-Review, Cloud-VPS, cloud-services-team
Andrew added a comment to T421739: Improvements to auto-generated floating ip ptr records.

For #2, taavi points out that there is a boilerplate description for auto-created records

Mon, Mar 30, 3:35 PM · Patch-For-Review, Cloud-VPS, cloud-services-team
Andrew updated the task description for T421739: Improvements to auto-generated floating ip ptr records.
Mon, Mar 30, 3:34 PM · Patch-For-Review, Cloud-VPS, cloud-services-team
Andrew created T421739: Improvements to auto-generated floating ip ptr records.
Mon, Mar 30, 3:10 PM · Patch-For-Review, Cloud-VPS, cloud-services-team

Fri, Mar 27

Andrew reopened T421025: Add PTR record for azwikimedia (mail.wikimedia.az), a subtask of T419582: Add floating IP and vanity domain for azwikimedia project, as Open.
Fri, Mar 27, 9:56 PM · cloud-services-team, Cloud-VPS (Quota-requests)
Andrew reopened T421025: Add PTR record for azwikimedia (mail.wikimedia.az) as "Open".
Fri, Mar 27, 9:56 PM · cloud-services-team, Cloud-VPS (Quota-requests)
Andrew added a comment to T421025: Add PTR record for azwikimedia (mail.wikimedia.az).

Hm, we have a bot that maintains those instance- addresses, it must've clobbered the one I made by hand. I will need to think about this a bit.

Fri, Mar 27, 9:54 PM · cloud-services-team, Cloud-VPS (Quota-requests)
Andrew added a comment to T420737: Support newer database engines on Trove.

Current status:

Fri, Mar 27, 4:24 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew added a comment to T420737: Support newer database engines on Trove.

Thank you @Don-vip! For better or worse there turns out to be no automated upgrade path; users will basically have to do a dump and import into a new engine. That said, if you want to try that (and, better yet, document it) I will ping you when the new engine is available.

Fri, Mar 27, 4:22 PM · Patch-For-Review, cloud-services-team, Cloud-VPS

Thu, Mar 26

Andrew added a comment to T416707: Sunsetting mirrors.wikimedia.org.

Hi folks! Any idea when this is likely to happen? I will need to coordinate for openstack nodes which use a bespoke openstack repo hosted on the mirror.

Thu, Mar 26, 10:33 PM · Infrastructure-Foundations, SRE
Andrew added a comment to T421242: New flavor for the integration project with more vCPU and ephemeral disk space.

+1 this seems fine; would you like us to also remove the older 8-core flavor or do you expect to use both in the future?

Thu, Mar 26, 4:40 PM · User-zeljkofilipin, Browser Test Platform, Continuous-Integration-Infrastructure, Jenkins, Continuous-Integration-Config, Cloud-VPS (Quota-requests)

Wed, Mar 25

Andrew added a comment to T420611: Disk quota increase for catalyst-dev.

+1 approved

Wed, Mar 25, 2:59 PM · Catalyst, Cloud-VPS (Quota-requests)

Tue, Mar 24

Andrew reassigned T408704: offline rackspace wikitech-static, online aws wikitech-static from Andrew to RobH.
Tue, Mar 24, 1:42 PM · Infrastructure-Foundations
Andrew added a comment to T408704: offline rackspace wikitech-static, online aws wikitech-static.

I've deleted all the things I can find in our rackspace account -- it should be empty or effectively empty. Over to you, Rob!

Tue, Mar 24, 1:41 PM · Infrastructure-Foundations
Andrew added a comment to T421054: Move all openstack rabbitmq queues to quorum.

because they are declared classic and not quorum by default.

Tue, Mar 24, 12:56 PM · cloud-services-team (FY2025/2026-Q3-Q4), Cloud-VPS
Andrew closed T421025: Add PTR record for azwikimedia (mail.wikimedia.az), a subtask of T419582: Add floating IP and vanity domain for azwikimedia project, as Resolved.
Tue, Mar 24, 2:35 AM · cloud-services-team, Cloud-VPS (Quota-requests)
Andrew closed T421025: Add PTR record for azwikimedia (mail.wikimedia.az) as Resolved.
andrew@bookworm:~/tofu-infra/resources/eqiad1-r/cloudinfra$ dig -x 185.15.56.85
Tue, Mar 24, 2:35 AM · cloud-services-team, Cloud-VPS (Quota-requests)
Andrew added a comment to T421025: Add PTR record for azwikimedia (mail.wikimedia.az).

users had not actually allocated a floating IP, but I have now done so. It is: 185.15.56.85

Tue, Mar 24, 2:25 AM · cloud-services-team, Cloud-VPS (Quota-requests)

Mon, Mar 23

Andrew created T420948: Power Supply - Status - issue on cloudbackup2003:9290.
Mon, Mar 23, 3:52 PM · SRE, ops-codfw, cloud-services-team, DC-Ops
Andrew created T420937: experiment with moving rabbitmq behind haproxy.
Mon, Mar 23, 2:50 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew placed T416394: Q3:rack/setup/install cloudcephosd1053 up for grabs.
Mon, Mar 23, 1:04 PM · ops-eqiad, SRE, DC-Ops
Andrew placed T416395: Q3:rack/setup/install cloudcephosd1054 up for grabs.
Mon, Mar 23, 1:04 PM · ops-eqiad, SRE, DC-Ops
Andrew placed T419892: Q3:rack/setup/install cloudcephosd105[56] up for grabs.
Mon, Mar 23, 1:04 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops

Fri, Mar 20

Andrew created T420737: Support newer database engines on Trove.
Fri, Mar 20, 2:32 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew added a comment to T419892: Q3:rack/setup/install cloudcephosd105[56].

eqiad folks: these hosts are untested hardware with a novel drive configuration. I do not expect partman to work on the first go!

Fri, Mar 20, 2:24 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Andrew added a comment to T416394: Q3:rack/setup/install cloudcephosd1053.

eqiad folks: these hosts are untested hardware with a novel drive configuration. I do not expect partman to work on the first go!

Fri, Mar 20, 2:24 PM · ops-eqiad, SRE, DC-Ops
Andrew added a comment to T416395: Q3:rack/setup/install cloudcephosd1054.

eqiad folks: these hosts are untested hardware with a novel drive configuration. I do not expect partman to work on the first go!

Fri, Mar 20, 2:24 PM · ops-eqiad, SRE, DC-Ops
Andrew updated the task description for T419892: Q3:rack/setup/install cloudcephosd105[56].
Fri, Mar 20, 2:10 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Andrew updated the task description for T416394: Q3:rack/setup/install cloudcephosd1053.
Fri, Mar 20, 2:09 PM · ops-eqiad, SRE, DC-Ops
Andrew updated the task description for T416395: Q3:rack/setup/install cloudcephosd1054.
Fri, Mar 20, 2:09 PM · ops-eqiad, SRE, DC-Ops
Andrew updated the task description for T416394: Q3:rack/setup/install cloudcephosd1053.
Fri, Mar 20, 2:08 PM · ops-eqiad, SRE, DC-Ops
Andrew updated the task description for T419892: Q3:rack/setup/install cloudcephosd105[56].
Fri, Mar 20, 2:08 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Andrew added a comment to T420532: Request creation of etherpads3 VPS project.

I will say that putting hyphens in the project name makes my brain hurt, but mostly because I still think of the project name as the primary key and I know that the S3 gateway we use does not allow hyphens in names.

Fri, Mar 20, 1:50 PM · Tool-etherpad-backup, Cloud-VPS (Project-requests)

Thu, Mar 19

Andrew added a comment to T420532: Request creation of etherpads3 VPS project.

+1, seems good

Thu, Mar 19, 3:04 PM · Tool-etherpad-backup, Cloud-VPS (Project-requests)

Wed, Mar 18

Andrew triaged T419582: Add floating IP and vanity domain for azwikimedia project as Medium priority.
Wed, Mar 18, 2:50 PM · cloud-services-team, Cloud-VPS (Quota-requests)
Andrew added a comment to T420213: Deprecate and remove 'bastion-restricted' hosts.

FWIW I see some value in having the cumin authorized_keys entries have an IP restriction on a host not accessible to most people, but otherwise agree with the proposal.

Wed, Mar 18, 2:50 PM · Cloud-VPS, cloud-services-team
Andrew triaged T420213: Deprecate and remove 'bastion-restricted' hosts as Low priority.
Wed, Mar 18, 2:48 PM · Cloud-VPS, cloud-services-team
Andrew triaged T420282: cloudcephmon2007-dev service implementation as Medium priority.
Wed, Mar 18, 2:47 PM · cloud-services-team, SRE, DC-Ops, ops-codfw

Tue, Mar 17

Andrew closed T406516: Upgrade openstack to version 'Flamingo' as Resolved.
Tue, Mar 17, 2:05 PM · Cloud-VPS, cloud-services-team
Andrew closed T405117: Update our Horizon release to 2025.2 as Resolved.
Tue, Mar 17, 1:47 PM · cloud-services-team, Horizon
Andrew closed T405117: Update our Horizon release to 2025.2, a subtask of T406516: Upgrade openstack to version 'Flamingo', as Resolved.
Tue, Mar 17, 1:47 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T406516: Upgrade openstack to version 'Flamingo'.

This is all done except for the cloudvirtlocal hosts.

Tue, Mar 17, 2:43 AM · Cloud-VPS, cloud-services-team

Mon, Mar 16

Andrew created T420282: cloudcephmon2007-dev service implementation.
Mon, Mar 16, 10:02 PM · cloud-services-team, SRE, DC-Ops, ops-codfw
Andrew closed T401899: Reimage cloudgw hosts to Trixie, a subtask of T375217: Complete upgrading WMCS bare metal hosts to Trixie, as Resolved.
Mon, Mar 16, 2:36 PM · Cloud-VPS, cloud-services-team
Andrew closed T401899: Reimage cloudgw hosts to Trixie as Resolved.
Mon, Mar 16, 2:36 PM · Cloud-VPS, cloud-services-team
Andrew created T420213: Deprecate and remove 'bastion-restricted' hosts.
Mon, Mar 16, 1:52 PM · Cloud-VPS, cloud-services-team

Mar 12 2026

Andrew claimed T408704: offline rackspace wikitech-static, online aws wikitech-static.
Mar 12 2026, 7:52 PM · Infrastructure-Foundations
Andrew added a comment to T408704: offline rackspace wikitech-static, online aws wikitech-static.

This VM is now shut off. They will still bill us for the space, but let's give this a week before we delete things and close out the account.

Mar 12 2026, 7:52 PM · Infrastructure-Foundations
Andrew added a comment to T408704: offline rackspace wikitech-static, online aws wikitech-static.

Just as soon as I can get pwstore working I will shut down this host and wait for screams.

Mar 12 2026, 6:41 PM · Infrastructure-Foundations
Andrew added a comment to T419582: Add floating IP and vanity domain for azwikimedia project.

This was discussed and approved during today's weekly meeting.

Mar 12 2026, 6:29 PM · cloud-services-team, Cloud-VPS (Quota-requests)
Andrew closed T371382: Collect access metrics from cloud-vps web proxy as Declined.

It's been a long time since we discussed this and no one is working on it. The privacy implications are a bit messy so I'm just closing.

Mar 12 2026, 4:08 PM · Cloud-VPS, cloud-services-team
Andrew placed T419738: decommission cloudgw2002-dev up for grabs.
Mar 12 2026, 3:53 PM · SRE, DC-Ops, ops-codfw, cloud-services-team, decommission-hardware
Andrew added a comment to T417393: Carry out controlled network switch down tests in cloud.

Thank you, I looked at cloudvirt.drain though I couldn't find an option specifically to make sure the destination host is not in the rack we are draining. Maybe not a huge issue though? The scenario I'm thinking about is we're draining a cloudvirt and all/most VMs migrate to another cloudvirt in the same rack, of course things would converge eventually at the risk of moving VMs a bunch of times.

Mar 12 2026, 1:46 PM · Cloud-VPS, cloud-services-team (FY2025/2026-Q3-Q4)

Mar 11 2026

Andrew updated the task description for T419738: decommission cloudgw2002-dev.
Mar 11 2026, 8:41 PM · SRE, DC-Ops, ops-codfw, cloud-services-team, decommission-hardware
Andrew added a comment to T419738: decommission cloudgw2002-dev.

decom script says:

Mar 11 2026, 8:32 PM · SRE, DC-Ops, ops-codfw, cloud-services-team, decommission-hardware
Andrew added a parent task for T419738: decommission cloudgw2002-dev: Unknown Object (Task).
Mar 11 2026, 5:20 PM · SRE, DC-Ops, ops-codfw, cloud-services-team, decommission-hardware
Andrew created T419738: decommission cloudgw2002-dev.
Mar 11 2026, 5:20 PM · SRE, DC-Ops, ops-codfw, cloud-services-team, decommission-hardware
Andrew closed T418765: cloudgw2004-dev service implementation as Resolved.

2004-dev is up and working now, thanks to @taavi and a reboot.

Mar 11 2026, 5:19 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T417393: Carry out controlled network switch down tests in cloud.

Oh, to check the maintenance state of a host you want to look at the host aggregates. Docs for that here: https://wikitech.wikimedia.org/wiki/Portal:Cloud_VPS/Admin/Host_aggregates

Mar 11 2026, 3:45 PM · Cloud-VPS, cloud-services-team (FY2025/2026-Q3-Q4)
Andrew added a comment to T417393: Carry out controlled network switch down tests in cloud.

Plan is to grab another announced maint window on Tues March 17th to resume the testing.

I have also opened subtasks for the remaining racks, one notable difference is that those do contain cloudvirt hosts. @Andrew what's the recommended procedure to temporarily drain a rack of VMs and then put them back? So far I found wmcs.openstack.cloudvirt.drain cookbook mentioned on wikitech

Mar 11 2026, 3:44 PM · Cloud-VPS, cloud-services-team (FY2025/2026-Q3-Q4)

Mar 10 2026

Andrew closed T419558: Horizon logins failing in codfw1dev as Resolved.
Mar 10 2026, 5:58 PM · cloud-services-team, Horizon
Andrew created T419558: Horizon logins failing in codfw1dev.
Mar 10 2026, 3:19 PM · cloud-services-team, Horizon
Andrew added a comment to T419508: Debug and understand why bringing down cloud net/gw/lb resulted in cloud vps network down.

It seems that the network services on 1006 were manually (or via cookbook) set to down. That would certainly explain the failover.

Mar 10 2026, 2:50 PM · Cloud-VPS, cloud-services-team (FY2025/2026-Q3-Q4)

Mar 9 2026

Andrew added a comment to T418813: Quota increases for gitlab-runners.

Thanks, @Andrew

Hey folks, sorry about the not-very-coherent response on this. The bottom line is that compute+storage resources are not an issue, we can definitely provide what you need.

The thing that is in flux our commitment to magnum:

Since the resources will be needed regardless of using Magnum (the volume requirements MAY be different without Magnum, not sure) can we go ahead with the quota increase and have the Magnum discussion in a different venue?

Mar 9 2026, 8:13 PM · User-dcaro, Cloud-VPS (Quota-requests)
Andrew added a comment to T419182: Request creation of lingualibre VPS project.

+1

Mar 9 2026, 2:26 PM · User-dcaro, Lingua-Libre, Hackathon-Northwestern-Europe-2026, Cloud-VPS (Project-requests)
Andrew lowered the priority of T416483: openstack flamingo: "'enabled' is a required property" for LDAP-managed users from Medium to Low.

This is resolved in codfw1dev. Task is still open because I'm trying to track (and fix?) the issue upstream.

Mar 9 2026, 2:25 PM · Upstream, cloud-services-team, Cloud-VPS

Mar 6 2026

Andrew added a comment to T417736: Request creation of azwikimedia VPS project.

So -- I reiterate that I don't think a cloud-vps project is the right way to handle things like this. If I were solving the issues you're solving, I would definitely look for a hosted service rather than building my own infra from scratch, and I would start /that/ process by talking to other affiliates and asking how they are solving the problem.

Mar 6 2026, 3:07 PM · User-dcaro, Cloud-VPS (Project-requests)