Page MenuHomePhabricator

Andrew (Andrew Bogott)
User

Today

  • No visible events.

Tomorrow

  • No visible events.

Wednesday

  • No visible events.

User Details

User Since
Nov 2 2014, 11:35 PM (580 w, 10 h)
Availability
Available
IRC Nick
andrewbogott
LDAP User
Unknown
MediaWiki User
Andrewbogott [ Global Accounts ]

Recent Activity

Today

Andrew added a comment to T412506: Investigation into ToolforgeKubernetesNodeNotReady 2025-12-12 page.

I got a round of these pages this evening. The first page was at 00:10UTC.

Mon, Dec 15, 2:38 AM · Toolforge, cloud-services-team

Fri, Dec 12

Andrew added a comment to T408387: CloudVPS instance for ProVe.

A little more info here: https://wikitech.wikimedia.org/wiki/Help:Cloud_VPS_instances

Fri, Dec 12, 4:30 PM · cloud-services-team (FY2025/26-Q1-Q2), Cloud-VPS (Project-requests)
Andrew added a comment to T408387: CloudVPS instance for ProVe.

You should be able to log in with your developer account credentials on https://horizon.wikimedia.org/project/ -- that is the web interface for managing things in your project.

Fri, Dec 12, 4:29 PM · cloud-services-team (FY2025/26-Q1-Q2), Cloud-VPS (Project-requests)
Andrew added a comment to T408387: CloudVPS instance for ProVe.

@NathanGavenski I'm on the run today but I glanced at the 'prove' project on Horizon and I don't see any VMs there, so it doesn't look like there's anywhere to ssh to. The domain you're seeing on openstack-browser is meant to be a container for future services (e.g. myinternalservice.svc.prove.eqiad1.wikimedia.cloud) ; it doesn't itself refer to any actual host or destination.

Fri, Dec 12, 10:53 AM · cloud-services-team (FY2025/26-Q1-Q2), Cloud-VPS (Project-requests)

Thu, Dec 11

Andrew updated subscribers of T412428: Wikidata full .json.gz dumps not published since 20250625.

This is likely related to the refactoring done for T352650. WMCS staff is traveling this week so before I dig too deep I'm hoping @BTullis will appear with a quick fix.

Thu, Dec 11, 9:33 PM · Wikidata, Data-Engineering, Dumps-Generation, Wikidata data dumps
Andrew added a comment to T412349: Start collecting uptime metrics for toolforge.

I'm hoping someone will link me to an existing dash on grafana.wmcloud.org and then we'll be done :)

Thu, Dec 11, 9:43 AM · cloud-services-team, Toolforge
Andrew created T412349: Start collecting uptime metrics for toolforge.
Thu, Dec 11, 9:43 AM · cloud-services-team, Toolforge

Sun, Dec 7

Andrew closed T411751: Temporary quota increase for mwoffliner project as Resolved.

Done. Please re-open and follow up on this task when you finish the migration so we can revert the quota change.

Sun, Dec 7, 2:53 PM · cloud-services-team, affects-Kiwix-and-openZIM, Cloud-VPS (Quota-requests)

Fri, Dec 5

Andrew added a comment to T411545: Update make-toolforge-user-list.py.

nice work komla!

Fri, Dec 5, 3:15 PM · cloud-services-team
Andrew added a comment to T411545: Update make-toolforge-user-list.py.

There are 1297 users in eqiad1 with the 'member' role. It's easy for me (or you) to dump the list of userids ("openstack role assignment list --role member") but correlating the usernames back to their email addresses will take a few lines of code.

Fri, Dec 5, 2:07 AM · cloud-services-team

Thu, Dec 4

Andrew added a comment to T361237: [infra] Upgrade Toolforge K8s etcd nodes to Bookworm.

I'm partway into this process but everyone is about to travel so I'm rolling things back to Bullseye everywhere.

Thu, Dec 4, 3:40 PM · User-Raymond_Ndibe, cloud-services-team, Kubernetes, Toolforge

Tue, Dec 2

Andrew created T411545: Update make-toolforge-user-list.py.
Tue, Dec 2, 7:25 PM · cloud-services-team
Andrew claimed T375217: Complete upgrading WMCS bare metal hosts to Trixie.
Tue, Dec 2, 5:05 PM · Cloud-VPS, cloud-services-team
Andrew closed T376277: Reimage cloudweb hosts to trixie, a subtask of T375217: Complete upgrading WMCS bare metal hosts to Trixie, as Resolved.
Tue, Dec 2, 5:04 PM · Cloud-VPS, cloud-services-team
Andrew closed T376277: Reimage cloudweb hosts to trixie as Resolved.
Tue, Dec 2, 5:04 PM · Striker, Horizon, cloud-services-team, wikitech.wikimedia.org
Andrew closed T410846: wmcs Trixie kernel reboots as Resolved.
Tue, Dec 2, 5:03 PM · Cloud-VPS, cloud-services-team

Mon, Dec 1

Andrew added a comment to T409579: Upgrade cloud-vps hosts to Debian Trixie.

Just now I ran into this error during reimage:

RuntimeError: Host is in BIOS mode but needs to be UEFI as it is connected to a Nokia switch

Is that right? Do we need to convert preseed to uefi recipes before reimaging for everything plugged into a nokia?

Mon, Dec 1, 2:29 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T410846: wmcs Trixie kernel reboots.

The remaining reboots are blocked by VMs that can't be drained. I hope to have that resolved tomorrow when mass reboots are scheduled.

Mon, Dec 1, 2:24 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T409579: Upgrade cloud-vps hosts to Debian Trixie.

Just now I ran into this error during reimage:

Mon, Dec 1, 12:35 AM · Cloud-VPS, cloud-services-team

Sun, Nov 30

Andrew placed T410403: Q2:rack/setup/install Toolforge up for grabs.
Sun, Nov 30, 10:46 PM · SRE, DC-Ops, ops-eqiad

Wed, Nov 26

Andrew added a comment to T410265: [tofu-infra] "tofu plan" failing in codfw.

This is probably unrelated, but it /is/ a concern with Ceph and trixie (right now the ceph hosts themselves are running bookworm but the radosgw is on Trixie.)

Wed, Nov 26, 11:17 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T410265: [tofu-infra] "tofu plan" failing in codfw.

Tests suggest that the 'NoneType' error happens 100% of the time when cloudcontrol2005-dev is the radosgw backend, and 0% of the time with either other cloudcontrol as the backend.

Wed, Nov 26, 9:04 PM · Cloud-VPS, cloud-services-team
Andrew closed T411025: eqiad row C/D cloud hosts pending migration, a subtask of T404609: eqiad: rows C/D Upgrade Tracking, as Resolved.
Wed, Nov 26, 7:06 PM · SRE, Infrastructure-Foundations, netops, DC-Ops, ops-eqiad
Andrew closed T411025: eqiad row C/D cloud hosts pending migration as Resolved.
Wed, Nov 26, 7:06 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), cloud-services-team, ops-eqiad, DC-Ops, SRE
Andrew updated the task description for T411025: eqiad row C/D cloud hosts pending migration.
Wed, Nov 26, 4:47 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), cloud-services-team, ops-eqiad, DC-Ops, SRE
Andrew updated the task description for T411025: eqiad row C/D cloud hosts pending migration.
Wed, Nov 26, 4:07 PM · Data-Platform-SRE (2025.11.07 - 2025.11.28), cloud-services-team, ops-eqiad, DC-Ops, SRE
Andrew added a comment to T410265: [tofu-infra] "tofu plan" failing in codfw.

Tests suggest that the 'NoneType' error happens 100% of the time when cloudcontrol2005-dev is the radosgw backend, and 0% of the time with either other cloudcontrol as the backend.

Wed, Nov 26, 4:15 AM · Cloud-VPS, cloud-services-team

Tue, Nov 25

Andrew added a comment to T410403: Q2:rack/setup/install Toolforge.

I think this is ready for dcops now but please lmk what I forgot!

Tue, Nov 25, 7:45 PM · SRE, DC-Ops, ops-eqiad
Andrew updated the task description for T410403: Q2:rack/setup/install Toolforge.
Tue, Nov 25, 7:37 PM · SRE, DC-Ops, ops-eqiad
Andrew updated the task description for T410403: Q2:rack/setup/install Toolforge.
Tue, Nov 25, 7:29 PM · SRE, DC-Ops, ops-eqiad
Andrew updated the task description for T410846: wmcs Trixie kernel reboots.
Tue, Nov 25, 12:22 AM · Cloud-VPS, cloud-services-team

Mon, Nov 24

Andrew updated the task description for T410846: wmcs Trixie kernel reboots.
Mon, Nov 24, 10:43 PM · Cloud-VPS, cloud-services-team
Andrew closed T397648: Keystone not cleaning up ldap groups on project delete as Resolved.

This is now taken care of by the project deletion cookbook.

Mon, Nov 24, 10:32 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T410403: Q2:rack/setup/install Toolforge.

Assigning to myself pending a decision about hostnames

Mon, Nov 24, 10:10 PM · SRE, DC-Ops, ops-eqiad
Andrew updated the task description for T410403: Q2:rack/setup/install Toolforge.
Mon, Nov 24, 10:09 PM · SRE, DC-Ops, ops-eqiad
Andrew updated the task description for T410403: Q2:rack/setup/install Toolforge.
Mon, Nov 24, 10:06 PM · SRE, DC-Ops, ops-eqiad
Andrew added a comment to T410846: wmcs Trixie kernel reboots.

this is the price I pay for being an early adopter

Mon, Nov 24, 2:15 PM · Cloud-VPS, cloud-services-team

Sun, Nov 23

Andrew closed T410784: cloudcephosd bios upgrades as Resolved.
Sun, Nov 23, 8:59 PM · Cloud-VPS, cloud-services-team
Andrew updated the task description for T410784: cloudcephosd bios upgrades.
Sun, Nov 23, 8:25 PM · Cloud-VPS, cloud-services-team
Andrew updated the task description for T410846: wmcs Trixie kernel reboots.
Sun, Nov 23, 7:47 PM · Cloud-VPS, cloud-services-team
Andrew updated the task description for T410784: cloudcephosd bios upgrades.
Sun, Nov 23, 7:27 PM · Cloud-VPS, cloud-services-team
Andrew added a project to T410846: wmcs Trixie kernel reboots: Cloud-VPS.
Sun, Nov 23, 7:23 PM · Cloud-VPS, cloud-services-team
Andrew created T410846: wmcs Trixie kernel reboots.
Sun, Nov 23, 7:23 PM · Cloud-VPS, cloud-services-team

Fri, Nov 21

Andrew created T410784: cloudcephosd bios upgrades.
Fri, Nov 21, 8:18 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T410294: Site: codfw 1 VM request for codfw1dev CAS test/dev, hostname: cloudidp2001-dev.

(I'm moving this to a private address; lots of cookbook things to come)

Fri, Nov 21, 5:24 PM · Cloud-VPS, cloud-services-team, vm-requests, Infrastructure-Foundations, SRE

Thu, Nov 20

Andrew created T410659: Standardize on opentofu management of all projects in the default keystone domain.
Thu, Nov 20, 4:12 PM · Patch-For-Review, Cloud-VPS, cloud-services-team

Tue, Nov 18

Andrew created T410470: cloudvirt1071 crash.
Tue, Nov 18, 11:48 PM · cloud-services-team (FY2025/26-Q1-Q2), Cloud-VPS

Mon, Nov 17

Andrew renamed T410294: Site: codfw 1 VM request for codfw1dev CAS test/dev, hostname: cloudidp2001-dev from Site: codfw 1 VM request for codfw1dev CAS test/dev, hostname: cloudidp to Site: codfw 1 VM request for codfw1dev CAS test/dev, hostname: cloudidp2001-dev.
Mon, Nov 17, 8:07 PM · Cloud-VPS, cloud-services-team, vm-requests, Infrastructure-Foundations, SRE
Andrew updated the task description for T410294: Site: codfw 1 VM request for codfw1dev CAS test/dev, hostname: cloudidp2001-dev.
Mon, Nov 17, 6:26 PM · Cloud-VPS, cloud-services-team, vm-requests, Infrastructure-Foundations, SRE
Andrew added a comment to T410294: Site: codfw 1 VM request for codfw1dev CAS test/dev, hostname: cloudidp2001-dev.

Looks good, but you definitely don't need 8G of RAM, 4G should be more than enough for this workload.

Mon, Nov 17, 6:25 PM · Cloud-VPS, cloud-services-team, vm-requests, Infrastructure-Foundations, SRE
Andrew renamed T410294: Site: codfw 1 VM request for codfw1dev CAS test/dev, hostname: cloudidp2001-dev from Site: 1 VM %request for codfw1dev CAS test/dev, hostname: cloudidp to Site: codfw 1 VM request for codfw1dev CAS test/dev, hostname: cloudidp.
Mon, Nov 17, 5:25 PM · Cloud-VPS, cloud-services-team, vm-requests, Infrastructure-Foundations, SRE
Andrew added a comment to T409328: sso failure in codfw1dev (labtesthorizon.wikimedia.org).

I'm leaning towards moving this service to a separate host. Ganeti request is T410294

Mon, Nov 17, 5:23 PM · Infrastructure-Foundations, CAS-SSO, cloud-services-team, Cloud-VPS
Andrew created T410294: Site: codfw 1 VM request for codfw1dev CAS test/dev, hostname: cloudidp2001-dev.
Mon, Nov 17, 5:22 PM · Cloud-VPS, cloud-services-team, vm-requests, Infrastructure-Foundations, SRE
Andrew added a comment to T408543: MTU setting in IPv6 VMs causes issues with Docker.

ec318e06-1ddc-4856-8e37-17a2a5aeb0b3 | tcp-proxy-test on cloudvirt1044 is showing the migration issue.

Mon, Nov 17, 2:30 PM · Patch-For-Review, cloud-services-team, Cloud-VPS

Sun, Nov 16

Andrew added a comment to T408543: MTU setting in IPv6 VMs causes issues with Docker.
sudo cumin --backend openstack "*" 'ip addr | grep "mtu 1450"'
Sun, Nov 16, 11:42 PM · Patch-For-Review, cloud-services-team, Cloud-VPS

Nov 15 2025

Andrew added a comment to T408543: MTU setting in IPv6 VMs causes issues with Docker.

As @taavi predicted, a reboot --hard of that server reset the MTU and allowed it to migrate. So that's good, and suggests that maybe we only need to reboot a select subset of VMs to get everyone on the same page mtu-wise.

Nov 15 2025, 12:01 AM · Patch-For-Review, cloud-services-team, Cloud-VPS

Nov 14 2025

Andrew added a comment to T330759: Modernize openstack rbac.

I think I care about deprecation warnings when they apply to our custom policies, but don't care when keystone is issuing warnings about policies that shipped directly from keystone upstream. I'm happy assuming they're approximately a 'note to self' from the keystone team and ignoring them unless you think I'm missing something.

Nov 14 2025, 11:40 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew added a comment to T408543: MTU setting in IPv6 VMs causes issues with Docker.

Today I'm draining a cloudvirt and I see this error in the logs (along with a failed migration):

Nov 14 2025, 11:33 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew closed T330759: Modernize openstack rbac as Resolved.

Keystone logs are still fairly full of warnings like

Nov 14 2025, 8:21 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew closed T330759: Modernize openstack rbac, a subtask of T276018: Investigate new roles and policies in openstack Xena, as Resolved.
Nov 14 2025, 8:21 PM · cloud-services-team, Cloud-VPS
Andrew updated the task description for T330759: Modernize openstack rbac.
Nov 14 2025, 7:19 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew closed T273150: OpenStack services should use system users to talk to Keystone, a subtask of T330759: Modernize openstack rbac, as Resolved.
Nov 14 2025, 7:18 PM · Patch-For-Review, cloud-services-team, Cloud-VPS
Andrew closed T273150: OpenStack services should use system users to talk to Keystone as Resolved.

Refactoring Neutron is scary, and splitting out a new user for Neutron won't really enhance security so I'm declaring this to be done enough.

Nov 14 2025, 7:18 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T385604: Decision Request - How openstack projects relate to tofu-infra.

Seems like consensus around option 1 -- let's close this next week if no one objects.

Nov 14 2025, 7:14 PM · Cloud-VPS, cloud-services-team, User-aborrero, Cloud Services Proposals

Nov 13 2025

Andrew updated subscribers of T409328: sso failure in codfw1dev (labtesthorizon.wikimedia.org).

@taavi this is one of the codfw1dev issues that has me blocked. I've spent a while messing with the envoy config but at this point I'm not even sure how this is meant to work.

Nov 13 2025, 10:39 PM · Infrastructure-Foundations, CAS-SSO, cloud-services-team, Cloud-VPS
Andrew added a comment to T409328: sso failure in codfw1dev (labtesthorizon.wikimedia.org).

andrewbogott> Andrew Bogott moritzm: do you still aspire to look at https://phabricator.wikimedia.org/T409328 or should I take another stab?
4:14 PM
<moritzm> Moritz Mühlenhoff andrewbogott: I had a quick look yesterday and the CAS part looks all fine in the logs
4:14 PM I miss some context what's actually the finer details of the indended setup and I currently have some more pressing things to look at
4:15 PM so please take another stab, otherwise I'll try to make some time for it next week

Nov 13 2025, 5:41 PM · Infrastructure-Foundations, CAS-SSO, cloud-services-team, Cloud-VPS
Andrew closed T343362: Magnum UI should offer full kube config, a subtask of T328711: Magnum in Horizon (magnum-ui) in codfw1dev, as Resolved.
Nov 13 2025, 4:01 PM · cloud-services-team (FY2023/2024-Q1-Q2), Goal, Openstack-Magnum
Andrew closed T343362: Magnum UI should offer full kube config as Resolved.

Unless I'm missing something, this feature is now available on Horizon via 'Get Cluster Config'. There's also an API for this.

Nov 13 2025, 4:01 PM · cloud-services-team, Openstack-Magnum
Andrew added a comment to T409029: Flapping wikitech-static icinga alert.

I've just switched the mod_evasive settings to be more aggressive than the defaults:

Nov 13 2025, 3:15 PM · wikitech.wikimedia.org, cloud-services-team

Nov 12 2025

Andrew placed T409162: Q2:rack/setup/install clouddb1026-1033 up for grabs.
Nov 12 2025, 9:40 PM · ops-eqiad, cloud-services-team (Hardware), DC-Ops, SRE
Andrew added a comment to T407586: latest Trixie image (as of 2025-10-16) grub failure on R450 hardware.

I just ran a couple more tests:

Nov 12 2025, 8:23 PM · Upstream, cloud-services-team, SRE
Andrew added a comment to T408387: CloudVPS instance for ProVe.

Hi Andrew,

Many thanks for your response. We are trying to find out whom to contact to move this forward and understand all that is involved. Do you know who we could contact to create a VPS project and get it approved? Also, who is responsible for signing off gadgets, etc?

Nov 12 2025, 7:39 PM · cloud-services-team (FY2025/26-Q1-Q2), Cloud-VPS (Project-requests)
Andrew added a comment to T399180: Cloudcephosd: migrate to single network uplink.

OSD nodes up through 1034 are scheduled for decom in 2026. Unless there's an urgent port shortage, we should only retcon 1035 and above to avoid sending DC ops on multiple visits to the older hosts.

Nov 12 2025, 1:53 PM · netops, SRE, Infrastructure-Foundations

Nov 10 2025

Andrew added a comment to T407586: latest Trixie image (as of 2025-10-16) grub failure on R450 hardware.

On @fgiunchedi's request I tried dd'ing every drive on a server before reimaging but grub still exhibits the issue.

Nov 10 2025, 8:38 PM · Upstream, cloud-services-team, SRE
Andrew added a comment to T395255: codfw1dev has seen neutron metadata agents down since epoxy upgrade.

This wasn't applied yet on codfw1dev but now it is.

Nov 10 2025, 2:42 PM · Upstream, cloud-services-team, Cloud-VPS
Andrew closed T409580: Prepare cloud-vps haproxy configs to work on debian trixie as Resolved.
Nov 10 2025, 12:05 AM · Cloud-VPS, cloud-services-team
Andrew closed T409580: Prepare cloud-vps haproxy configs to work on debian trixie, a subtask of T409579: Upgrade cloud-vps hosts to Debian Trixie, as Resolved.
Nov 10 2025, 12:05 AM · Cloud-VPS, cloud-services-team

Nov 7 2025

Andrew claimed T409162: Q2:rack/setup/install clouddb1026-1033.

You're right!

Nov 7 2025, 10:53 PM · ops-eqiad, cloud-services-team (Hardware), DC-Ops, SRE
Andrew added a comment to T409580: Prepare cloud-vps haproxy configs to work on debian trixie.

https://www.claudiokuenzler.com/blog/1498/haproxy-option-httpchk-headers-body-end-version-string-unsupported shows a before:

Nov 7 2025, 6:47 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T409580: Prepare cloud-vps haproxy configs to work on debian trixie.

An example of an offending line is

Nov 7 2025, 6:44 PM · Cloud-VPS, cloud-services-team
Andrew created T409580: Prepare cloud-vps haproxy configs to work on debian trixie.
Nov 7 2025, 6:40 PM · Cloud-VPS, cloud-services-team
Andrew created T409579: Upgrade cloud-vps hosts to Debian Trixie.
Nov 7 2025, 6:39 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T407586: latest Trixie image (as of 2025-10-16) grub failure on R450 hardware.

I've just noticed that there are quite a few 2-drive r450s that reimaged without trouble, for example cloudrabbit200[123]-dev.

Nov 7 2025, 5:11 PM · Upstream, cloud-services-team, SRE
Andrew placed T409162: Q2:rack/setup/install clouddb1026-1033 up for grabs.
Nov 7 2025, 2:28 PM · ops-eqiad, cloud-services-team (Hardware), DC-Ops, SRE
Andrew updated the task description for T409162: Q2:rack/setup/install clouddb1026-1033.
Nov 7 2025, 2:23 PM · ops-eqiad, cloud-services-team (Hardware), DC-Ops, SRE
Andrew added a comment to T409162: Q2:rack/setup/install clouddb1026-1033.

For my reference, the following will be the redundant pairs according to T401295

Nov 7 2025, 2:13 PM · ops-eqiad, cloud-services-team (Hardware), DC-Ops, SRE

Nov 6 2025

Andrew added a comment to T407586: latest Trixie image (as of 2025-10-16) grub failure on R450 hardware.

For future debug research: We can prevent the final reboot after a reimage like this:

Nov 6 2025, 7:18 PM · Upstream, cloud-services-team, SRE
Andrew assigned T409328: sso failure in codfw1dev (labtesthorizon.wikimedia.org) to MoritzMuehlenhoff.

@MoritzMuehlenhoff has offered to take a look at this.

Nov 6 2025, 6:52 PM · Infrastructure-Foundations, CAS-SSO, cloud-services-team, Cloud-VPS
Andrew added a comment to T409365: Grant zuul project access to `fast-iops` volume type and `4xiops` instance flavor.

not sure if there is a separate setting for the fast-iops bit

My understanding is that volume types are only relevant for additional volumes (created with "Volumes" in Horizon, or with the volumes API), whereas the attributes of the root volumes of instances (including the IOPS limits) are controlled by the flavor. @Andrew is that correct?

Nov 6 2025, 3:57 PM · Cloud-VPS (Quota-requests)

Nov 5 2025

Andrew created T409328: sso failure in codfw1dev (labtesthorizon.wikimedia.org).
Nov 5 2025, 6:02 PM · Infrastructure-Foundations, CAS-SSO, cloud-services-team, Cloud-VPS
Andrew added a comment to T376400: Redesign wikitech-static.

Can you point me to some specific examples? My half-baked spot checks (e.g. http://ec2-54-81-201-239.compute-1.amazonaws.com/wiki/Eqiad_data_center.html#/media/File:Eqiad_logical.png) seem to be hosted locally (or I am misunderstanding something fundamental).

Sure, for example the first image on http://ec2-54-81-201-239.compute-1.amazonaws.com/wiki/Network_design.html uses src="http://upload.wikimedia.org/wikipedia/labs/thumb/5/5f/Wikimedia_network_overview.png/960px-Wikimedia_network_overview.png" and src="http://upload.wikimedia.org/wikipedia/labs/thumb/5/5f/Wikimedia_network_overview.png/1280px-Wikimedia_network_overview.png" when you click of it.

Nov 5 2025, 4:09 PM · Patch-For-Review, serviceops-radar, SRE-Unowned, SRE, wikitech.wikimedia.org

Nov 3 2025

Andrew added a comment to T408387: CloudVPS instance for ProVe.

Hello again!

Nov 3 2025, 5:10 PM · cloud-services-team (FY2025/26-Q1-Q2), Cloud-VPS (Project-requests)

Oct 30 2025

Andrew added a comment to T408387: CloudVPS instance for ProVe.

we're told Gadgets shouldn't call external services

Oct 30 2025, 3:41 PM · cloud-services-team (FY2025/26-Q1-Q2), Cloud-VPS (Project-requests)
Andrew added a comment to T370037: Cloud VPS: extend tofu-infra coverage.

I think that admin-defined just means 'things in cloud-vps managed and supported by staff rather than by random users'.

Oct 30 2025, 2:50 PM · Cloud-VPS, User-aborrero, Epic, cloud-services-team

Oct 29 2025

Andrew triaged T402806: huggle-nfs volume filling up as High priority.
Oct 29 2025, 3:00 PM · cloud-services-team, Huggle
Andrew triaged T402807: wikidumpparse NFS volume filling up as High priority.
Oct 29 2025, 2:29 PM · VPS-Projects, cloud-services-team

Oct 25 2025

Andrew added a comment to T407586: latest Trixie image (as of 2025-10-16) grub failure on R450 hardware.

Confirmed, when I rolled cloudcontrol1008-dev back to raid10 grub failed again.

Oct 25 2025, 6:17 PM · Upstream, cloud-services-team, SRE
Andrew added a comment to T407586: latest Trixie image (as of 2025-10-16) grub failure on R450 hardware.

Seems like grub works properly without sw raid. cloudcontrol1008-dev with flat.cfg:

Oct 25 2025, 5:45 PM · Upstream, cloud-services-team, SRE

Oct 23 2025

Andrew renamed T407586: latest Trixie image (as of 2025-10-16) grub failure on R450 hardware from latest Trixie image (as of 2025-10-16) grub failure on R540 hardware to latest Trixie image (as of 2025-10-16) grub failure on R450 hardware.
Oct 23 2025, 4:56 PM · Upstream, cloud-services-team, SRE
Andrew updated the task description for T407586: latest Trixie image (as of 2025-10-16) grub failure on R450 hardware.
Oct 23 2025, 4:55 PM · Upstream, cloud-services-team, SRE