Page MenuHomePhabricator

Andrew (Andrew Bogott)
User

Projects (13)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Nov 2 2014, 11:35 PM (498 w, 1 d)
Availability
Available
IRC Nick
andrewbogott
LDAP User
Unknown
MediaWiki User
Andrewbogott [ Global Accounts ]

Recent Activity

Fri, May 17

TBurmeister awarded T325774: Improve UI text and content for "Launch [database] instance" dialogue box in Horizon UI a Love token.
Fri, May 17, 5:05 PM · Horizon

Wed, May 15

Andrew added a comment to T358496: [toolforge,storage] Provide per-tool access to cloud-vps object storage.

This all seems correct, although I reiterate that the interesting part is the scope creation or management. We don't currently have a good way to create databases that are explicitly tied to a particular tool -- even with the bespoke/by hand approach we take now there's nothing to keep the tool and trove ACLS in sync.

Can you elaborate on this?

The current database-for-a-tool solution is that we've created trove-only openstack projects to manage databases used by toolforge tools. Those projects may or may not have the same members as the tool that project is supporting; it's entirely ad-hoc. If we have a model where a tool corresponds directly to an openstack tenant then we can put the trove DBs in there and have consistent access and membership between tools and trove.

Interesting, I thought that the openstack projects created for trove were already mapped to a tool (and the auth was through ldap matching the user to that tool that then matches the openstack tenant).
Just to verify, as of today, the trove database for a tool, is in an arbitrary openstack project that is managed by whomever requested to create the database right? (so completely independent of the LDAP tool group)

That's correct. Of course the owner of the db project can manage project access, so ideally they keep things in sync manually.

Wed, May 15, 1:57 PM · cloud-services-team, Toolforge

Tue, May 14

Andrew added a comment to T358496: [toolforge,storage] Provide per-tool access to cloud-vps object storage.

Here, we are only talking about allowing tools to set up and use s3-style buckets, correct?

Tue, May 14, 4:50 PM · cloud-services-team, Toolforge
Andrew added a comment to T358496: [toolforge,storage] Provide per-tool access to cloud-vps object storage.

This all seems correct, although I reiterate that the interesting part is the scope creation or management. We don't currently have a good way to create databases that are explicitly tied to a particular tool -- even with the bespoke/by hand approach we take now there's nothing to keep the tool and trove ACLS in sync.

Can you elaborate on this?

Tue, May 14, 4:47 PM · cloud-services-team, Toolforge

Mon, May 13

Andrew added a comment to T358496: [toolforge,storage] Provide per-tool access to cloud-vps object storage.

<snip>

I'm still thinking on use the cases

This task is specifically tackling this one:

  • As a tool, I want to be able to access the s3 buckets I created (from horizon) from within toolforge
Mon, May 13, 2:23 PM · cloud-services-team, Toolforge
Andrew added a comment to T364492: Ownership confusion on cloud-local puppet servers.

I'm now learned that new prod puppservers also use the 'gitpuppet' user. So eliminating that user will increase the diff with prod rather than shrink it. So that's not the right path forward. Probably I should just figure out a fix for case 1.

Mon, May 13, 1:55 PM · Patch-For-Review, Puppet-Infrastructure, cloud-services-team

Thu, May 9

Andrew added a comment to T325774: Improve UI text and content for "Launch [database] instance" dialogue box in Horizon UI.

Rivers change course, civilizations rise and fall, and I have finally done some work on this task.

Thu, May 9, 9:51 PM · Horizon
Andrew updated the task description for T364577: decommission cloudcontrol2001-dev.codfw.wmnet.
Thu, May 9, 7:28 PM · decommission-hardware, cloud-services-team
Andrew updated the task description for T364577: decommission cloudcontrol2001-dev.codfw.wmnet.
Thu, May 9, 7:28 PM · decommission-hardware, cloud-services-team
Andrew added a comment to T364559: Create (or teach Andrew how to create) private connections+dns entries for new cloudcontrols.

Reimaging cloudcontrol2006-dev works now, thanks!

Thu, May 9, 7:23 PM · SRE, netops, ops-codfw, Infrastructure-Foundations, cloud-services-team
Andrew created T364577: decommission cloudcontrol2001-dev.codfw.wmnet.
Thu, May 9, 7:18 PM · decommission-hardware, cloud-services-team
Andrew created T364559: Create (or teach Andrew how to create) private connections+dns entries for new cloudcontrols.
Thu, May 9, 4:23 PM · SRE, netops, ops-codfw, Infrastructure-Foundations, cloud-services-team
Andrew closed T324998: Q3:rack/setup/install cloudcephosd10(3[5-9]|40) as Invalid.

I'm closing this as invalid since those hosts have come and gone :)

Thu, May 9, 2:51 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops

Wed, May 8

Andrew closed T302537: Horizon puppet panel needs better exception handling as Resolved.
Wed, May 8, 10:13 PM · cloud-services-team, Horizon
Andrew added a comment to T358496: [toolforge,storage] Provide per-tool access to cloud-vps object storage.

The per-tool service user would not have a password, only app credentials.

If I understand it correctly, this might be entirely impossible in the current Keystone model (it could be a bug or intended behaviour). Is it a problem though to have a password associated with the service tool user, if that password is stored securely and only available to the agent that uses it to create the associated app credentials?

Wed, May 8, 9:51 PM · cloud-services-team, Toolforge
Andrew added a comment to T358496: [toolforge,storage] Provide per-tool access to cloud-vps object storage.

I'm hitting a roadblock with the service user plan -- because of keystone's belt-and-suspenders approach to security, I can override the policy to allow an admin user to create app creds for another user (e.g. novaadmin creating creds for tool.mytool) but there's an explicit check in the code comparing context ID to cred ID and erroring out. IMO this is a keystone bug (https://launchpad.net/bugs/2065212) but it's unlikely to be changed upstream anytime soon.

Wed, May 8, 8:04 PM · cloud-services-team, Toolforge
Andrew claimed T364492: Ownership confusion on cloud-local puppet servers.
Wed, May 8, 5:45 PM · Patch-For-Review, Puppet-Infrastructure, cloud-services-team
Andrew created T364492: Ownership confusion on cloud-local puppet servers.
Wed, May 8, 5:08 PM · Patch-For-Review, Puppet-Infrastructure, cloud-services-team

Tue, May 7

Don-vip awarded T358496: [toolforge,storage] Provide per-tool access to cloud-vps object storage a Fox token.
Tue, May 7, 6:21 PM · cloud-services-team, Toolforge
Andrew added a comment to T358496: [toolforge,storage] Provide per-tool access to cloud-vps object storage.

So many questions!

Tue, May 7, 4:26 PM · cloud-services-team, Toolforge
Andrew updated subscribers of T363125: sustainability of wikitech.wikimedia.org.
Tue, May 7, 1:53 PM · wikitech.wikimedia.org, Security, Epic, cloud-services-team

Mon, May 6

Andrew added a comment to T358496: [toolforge,storage] Provide per-tool access to cloud-vps object storage.

My favorite option is 'Automatic creation of per-tool keystone project'. Since that's a simple extension of 'On-demand creation of per-tool keystone project' I'm going to start with that (with a cli tool rather than an API endpoint for now).

Mon, May 6, 8:10 PM · cloud-services-team, Toolforge
Andrew updated the task description for T358496: [toolforge,storage] Provide per-tool access to cloud-vps object storage.
Mon, May 6, 8:07 PM · cloud-services-team, Toolforge
Andrew added a comment to T358496: [toolforge,storage] Provide per-tool access to cloud-vps object storage.

I'm striking out the 'keystone projects in ldap' option because keystone doesn't really support that one.

Mon, May 6, 8:03 PM · cloud-services-team, Toolforge
Andrew closed T332400: Migrate cloudweb, cloudbackup, cloudmetrics physical servers off buster as Resolved.
Mon, May 6, 3:53 PM · cloud-services-team

Thu, May 2

Andrew created T364047: puppet servers run out of inodes in puppet code volume.
Thu, May 2, 9:04 PM · Patch-For-Review, Puppet-Infrastructure, cloud-services-team, Infrastructure-Foundations

Wed, May 1

Andrew closed T335978: openstack: consider removing references to old hardware from the database as Resolved.

I think this is now cleaned up and resolved for now. In the future, I suspect that deleting canary VMs before deleting hypervisors will prevent them from showing up here, but openstack resource provider delete might be needed.

Wed, May 1, 4:57 PM · cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
Andrew closed T335978: openstack: consider removing references to old hardware from the database, a subtask of T335943: prometheus-openstack-exporter: collected data shows regular null intervals, as Resolved.
Wed, May 1, 4:55 PM · User-aborrero, cloud-services-team (FY2022/2023-Q4), Cloud-VPS
Andrew added a comment to T335978: openstack: consider removing references to old hardware from the database.

Ok, I think I found them! These deleted hosts can be cleaned up with

Wed, May 1, 4:26 PM · cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
Andrew added a comment to T335978: openstack: consider removing references to old hardware from the database.

Removing hardware records from the DB seems a little bit dangerous as that could leave dangling references elsewhere (for instance in the action log which keeps track of any previous actions a VM took, including a reference to where the VM was at the time.)

Wed, May 1, 3:05 PM · cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
Andrew closed T349651: Support Trove + Swift integration as Resolved.

Inasmuch as Trove works for this, the integration is also working.

Wed, May 1, 2:47 PM · Patch-For-Review, Data-Services, Cloud-VPS, User-Marostegui, cloud-services-team
Andrew closed T349651: Support Trove + Swift integration, a subtask of T212595: [Feature request] Database as a Service (Trove) for Cloud VPS projects, as Resolved.
Wed, May 1, 2:45 PM · cloud-services-team (Kanban), Data-Services, Cloud-VPS, User-Marostegui

Fri, Apr 26

Andrew closed T356287: Upgrade cloud-vps openstack to version 'Bobcat' as Resolved.
Fri, Apr 26, 1:18 PM · cloud-services-team (FY2023/2024-Q3-Q4), Goal, Cloud-VPS

Thu, Apr 25

Andrew added a comment to T362449: Taavi knowledge transfer: python-flask-keystone, novaproxy, enc api.

I (Andrew) am accepting this task to investigate deprecation warnings in these services and (probably) take over maintenance of python-flask-keystone.

Thu, Apr 25, 2:46 PM · Toolforge, Cloud-VPS, cloud-services-team
Andrew added a comment to T362449: Taavi knowledge transfer: python-flask-keystone, novaproxy, enc api.

From a meeting about these services today:

Thu, Apr 25, 2:45 PM · Toolforge, Cloud-VPS, cloud-services-team
Andrew renamed T362449: Taavi knowledge transfer: python-flask-keystone, novaproxy, enc api from Taavi knowledge transfer: python-flask-keystone to Taavi knowledge transfer: python-flask-keystone, novaproxy, enc api.
Thu, Apr 25, 2:44 PM · Toolforge, Cloud-VPS, cloud-services-team
Andrew added a comment to T362447: Taavi knowledge transfer: Toolforge misc services (e.g. mail server).

The toolforge exim server is using an experimental feature to support forwarding to gmail. That build is here: https://gitlab.wikimedia.org/repos/sre/exim4-arc -- it will likely become part of the main exim build soon.

Thu, Apr 25, 2:38 PM · Toolforge, Cloud-VPS, cloud-services-team

Wed, Apr 24

Andrew committed rOHMU3d655cf063d2: disable 'Rolling Cluster Upgrade' feature.
disable 'Rolling Cluster Upgrade' feature
Wed, Apr 24, 12:10 AM
Andrew committed rOHMU4a987ee706eb: Revert "_1370_project_container_infra_panel_group.py: change scss includes".
Revert "_1370_project_container_infra_panel_group.py: change scss includes"
Wed, Apr 24, 12:10 AM
Andrew added a reverting change for rOHMU3e33c3b06008: _1370_project_container_infra_panel_group.py: change scss includes: rOHMU4a987ee706eb: Revert "_1370_project_container_infra_panel_group.py: change scss includes".
Wed, Apr 24, 12:10 AM
Andrew committed rOHMU395ec1e4fc9e: requirements.txt: remove Horizon dependency.
requirements.txt: remove Horizon dependency
Wed, Apr 24, 12:10 AM
Andrew committed rOHMU2d813c6266e4: Update .gitreview for stable/zed (authored by OpenStack Release Bot <infra-root@openstack.org>).
Update .gitreview for stable/zed
Wed, Apr 24, 12:10 AM
Andrew committed rOHMU7a46e0f28852: Update TOX_CONSTRAINTS_FILE for stable/zed (authored by OpenStack Release Bot <infra-root@openstack.org>).
Update TOX_CONSTRAINTS_FILE for stable/zed
Wed, Apr 24, 12:10 AM
Andrew committed rOHMU8a0930096969: _1370_project_container_infra_panel_group.py: change scss includes.
_1370_project_container_infra_panel_group.py: change scss includes
Wed, Apr 24, 12:10 AM
Andrew committed rOHMU5b040f3b0fc8: Add MANIFEST.in.
Add MANIFEST.in
Wed, Apr 24, 12:10 AM
Andrew committed rOHMUaab800ceb736: sign-certificate-modal.controller.js: replace success() with then().
sign-certificate-modal.controller.js: replace success() with then()
Wed, Apr 24, 12:10 AM

Tue, Apr 23

Andrew reassigned T348643: cloudcephosd1021-1034: hard drive sector errors increasing from Andrew to dcaro.
Tue, Apr 23, 7:29 PM · cloud-services-team (FY2023/2024-Q3-Q4), SRE, ops-eqiad, DC-Ops, Cloud-VPS
Andrew closed T351450: Migrate Cloud VPS puppet infrastructure to Puppet 7 as Resolved.
Tue, Apr 23, 7:28 PM · Patch-For-Review, cloud-services-team (FY2023/2024-Q3-Q4), Goal, Puppet (Puppet 7.0), Cloud-VPS
Andrew updated the task description for T351452: Migrate per-project Puppet servers to Puppet 7.
Tue, Apr 23, 7:28 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew closed T361371: Update mailman project puppetmaster as Resolved.
Tue, Apr 23, 7:28 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew updated the task description for T351452: Migrate per-project Puppet servers to Puppet 7.
Tue, Apr 23, 7:27 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew closed T361371: Update mailman project puppetmaster, a subtask of T351452: Migrate per-project Puppet servers to Puppet 7, as Resolved.
Tue, Apr 23, 7:27 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew closed T361596: Update puppet wikidata-query puppetmaster, a subtask of T351452: Migrate per-project Puppet servers to Puppet 7, as Resolved.
Tue, Apr 23, 7:26 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew closed T361596: Update puppet wikidata-query puppetmaster as Resolved.

I've built a new puppetserver in this project, wdqspuppetserver-1. Nothing was using the old one so probably this effort was in vain.

Tue, Apr 23, 7:26 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew updated the task description for T351452: Migrate per-project Puppet servers to Puppet 7.
Tue, Apr 23, 6:42 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew added a comment to T332400: Migrate cloudweb, cloudbackup, cloudmetrics physical servers off buster .

It is safe to reimage cloudbackup1003 on April 30.

Tue, Apr 23, 4:45 PM · cloud-services-team
Andrew updated the task description for T332400: Migrate cloudweb, cloudbackup, cloudmetrics physical servers off buster .
Tue, Apr 23, 4:43 PM · cloud-services-team
Andrew added a comment to T350807: Package latest version of prometheus-memcached-exporter (v0.14.2).

Everything seems happy now. Thanks!

Tue, Apr 23, 3:37 PM · serviceops
Andrew added a comment to T363164: InterfaceSpeedError brq05a5494a-18 on cloudvirt2001-dev:9100 has the wrong speed: 1.25e+06..

This is taavi messing with ovs

Tue, Apr 23, 3:14 PM · cloud-services-team
Andrew added a comment to T361804: Decision request - Update python team best practices.

I'm late to this, but I also agree with B5. Do we need another rule regarding what to do with existing code when applying new checks or standards and would result in a reformat?

I would say no, specially as this decision does not force anyone to change anything.

A rule of the thumb though would be to separate those changes in it's own MR (if the changes are big), or in a first commit inside the MR (if they are small).

Tue, Apr 23, 1:32 PM · Cloud Services Proposals
Andrew added a comment to T363125: sustainability of wikitech.wikimedia.org.

B: Fishbowl wiki hosted on wikikube, accounts in ldap. This option could be a final state OR a temporary state on the way to the SUL option.
Con (long-term): Allowing r/w ldap access from wikitech/wikikube may continue to introduce surprising edge cases for product maintenance.

I don't think Wikitech will require r/w access after T359544: Disable SSH key management on Wikitech is done.

Tue, Apr 23, 1:21 PM · wikitech.wikimedia.org, Security, Epic, cloud-services-team
Andrew added a parent task for T161859: Make Wikitech an SUL wiki: T363125: sustainability of wikitech.wikimedia.org.
Tue, Apr 23, 3:56 AM · cloud-services-team, Epic, wikitech.wikimedia.org
Andrew added a parent task for T161553: Remove OpenStackManager from Wikitech: T363125: sustainability of wikitech.wikimedia.org.
Tue, Apr 23, 3:56 AM · cloud-services-team, MW-1.35-notes (1.35.0-wmf.8; 2019-11-26), wikitech.wikimedia.org, MediaWiki-extensions-OpenStackManager
Andrew added a parent task for T237773: Move Wikitech onto the production MW cluster: T363125: sustainability of wikitech.wikimedia.org.
Tue, Apr 23, 3:56 AM · cloud-services-team, wikitech.wikimedia.org
Andrew added a parent task for T292707: Migrate Wikitech to Kubernetes: T363125: sustainability of wikitech.wikimedia.org.
Tue, Apr 23, 3:56 AM · wikitech.wikimedia.org, MW-on-K8s, serviceops
Andrew added a parent task for T359544: Disable SSH key management on Wikitech: T363125: sustainability of wikitech.wikimedia.org.
Tue, Apr 23, 3:56 AM · cloud-services-team, wikitech.wikimedia.org
Andrew added subtasks for T363125: sustainability of wikitech.wikimedia.org: T161859: Make Wikitech an SUL wiki, T292707: Migrate Wikitech to Kubernetes, T161553: Remove OpenStackManager from Wikitech, T359551: Replace wikitech as source of two-factor auth protection for developer accounts, T237773: Move Wikitech onto the production MW cluster, T359544: Disable SSH key management on Wikitech.
Tue, Apr 23, 3:56 AM · wikitech.wikimedia.org, Security, Epic, cloud-services-team
Andrew added a parent task for T359551: Replace wikitech as source of two-factor auth protection for developer accounts: T363125: sustainability of wikitech.wikimedia.org.
Tue, Apr 23, 3:56 AM · LDAP, cloud-services-team, wikitech.wikimedia.org
Andrew created T363125: sustainability of wikitech.wikimedia.org.
Tue, Apr 23, 3:54 AM · wikitech.wikimedia.org, Security, Epic, cloud-services-team

Apr 19 2024

Andrew added a comment to T362956: nova-api can get the listen queue of socket full.

...I just checked and Bobcat is still using greenlet 2.0.2 so this is likely not fixed in bobcat :(

Apr 19 2024, 2:41 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T362956: nova-api can get the listen queue of socket full.

I think this is the same issue (but different log message) as T352635

Apr 19 2024, 2:40 PM · Cloud-VPS, cloud-services-team
Andrew added a comment to T362956: nova-api can get the listen queue of socket full.

I've been seeing this crash periodically since we upgraded to A -- if this is the same failure then believe this is a bug in the python threading library that we're using and the full queue is a symptom of a stuck listener.

Apr 19 2024, 2:35 PM · Cloud-VPS, cloud-services-team

Apr 18 2024

Andrew added a comment to T361804: Decision request - Update python team best practices.

I'm late to this, but I also agree with B5. Do we need another rule regarding what to do with existing code when applying new checks or standards and would result in a reformat?

Apr 18 2024, 6:52 PM · Cloud Services Proposals

Apr 17 2024

Andrew added a comment to T356287: Upgrade cloud-vps openstack to version 'Bobcat'.

codfw1dev is now running bobcat. The only (minor) issue I'm aware of so far is T350807

Apr 17 2024, 2:18 PM · cloud-services-team (FY2023/2024-Q3-Q4), Goal, Cloud-VPS
Andrew added a comment to T350807: Package latest version of prometheus-memcached-exporter (v0.14.2).

Coincidentally, I just did a dist-upgrade that pulled in this new package. The 0.14 package installs its binary here:

Apr 17 2024, 2:13 PM · serviceops
Andrew reopened T350807: Package latest version of prometheus-memcached-exporter (v0.14.2), a subtask of T352885: Enable extstore to a subset of memcached servers (experiment), as Open.
Apr 17 2024, 2:13 PM · serviceops
Andrew reopened T350807: Package latest version of prometheus-memcached-exporter (v0.14.2) as "Open".
Apr 17 2024, 2:13 PM · serviceops
Andrew reopened T350807: Package latest version of prometheus-memcached-exporter (v0.14.2), a subtask of T352891: Upgrade memcache and memcached gutter pools to Bookworm, as Open.
Apr 17 2024, 2:13 PM · serviceops
Andrew closed T356216: Q#:rack/setup/install (2) cloudbackup hosts as Resolved.

These are now in service and working fine.

Apr 17 2024, 1:29 PM · SRE, ops-codfw, cloud-services-team (Hardware), DC-Ops

Apr 16 2024

Andrew committed rCCKB75c77aa95f55: upgrade_openstack_node: Upgrade designate db on cloudcontrols.
upgrade_openstack_node: Upgrade designate db on cloudcontrols
Apr 16 2024, 7:19 PM
Andrew closed T361594: Update mariadbtest project puppetmaster as Resolved.
Apr 16 2024, 6:58 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew closed T361594: Update mariadbtest project puppetmaster, a subtask of T351452: Migrate per-project Puppet servers to Puppet 7, as Resolved.
Apr 16 2024, 6:56 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew closed T361591: Update pki project puppetmaster as Resolved.

puppetserver is upgraded but everything in this project is Buster so puppet 7 will be unhappy until that's fixed.

Apr 16 2024, 4:58 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew closed T361591: Update pki project puppetmaster, a subtask of T351452: Migrate per-project Puppet servers to Puppet 7, as Resolved.
Apr 16 2024, 4:57 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew claimed T361591: Update pki project puppetmaster.

This project was managed by jbond -- for now I will do this upgrade.

Apr 16 2024, 4:02 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew closed T361593: Update puppet-dev project puppetmaster, a subtask of T351452: Migrate per-project Puppet servers to Puppet 7, as Resolved.
Apr 16 2024, 3:39 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew closed T361593: Update puppet-dev project puppetmaster as Resolved.

10:28 AM taavi, jhathaway, moritzm, is the puppet-dev project effectively defunct now that jbond has departed? It's unmarked on the purge page and also has https://phabricator.wikimedia.org/T361593 with no response
10:29 AM
<moritzm> Moritz Mühlenhoff let me have a look
10:31 AM I haven't used it for ages and I think it was mostly used to stage/test the puppet 7. from my PoV it can be phased out unless Jesse or Taavi still use it
10:32 AM <andrewbogott> Andrew Bogott ok, thanks moritzm, let's see if anyone else has an opinion :)
10:34 AM
<jhathaway> Jesse Hathaway I agree with moritzm, I would like to keep the project around, but the instances can be removed
10:35 AM <andrewbogott> Andrew Bogott great, shall I delete things right now?
10:37 AM
<jhathaway> Jesse Hathaway fine by me, unless taavi objects
10:37 AM
<jbond> John Bond ftw also good with me
10:37 AM
<taavi> Taavi Väänänen no objections from me

Apr 16 2024, 3:39 PM · VPS-Projects, Puppet (Puppet 7.0), cloud-services-team
Andrew added a comment to T362438: decommission cloudbackup200[12].codfw.wmnet.

what are we doing with cloudbackup2001-array1 and cloudbackup2002-array1?

Apr 16 2024, 2:03 PM · SRE, ops-codfw, cloud-services-team, decommission-hardware

Apr 15 2024

Andrew added a comment to T360470: Update devtools project puppetmaster.

The latest puppetserver code is prone to gobbling RAM; I'd check for oom messages and see about using profile::puppetserver::java_max_mem

Apr 15 2024, 4:03 PM · VPS-project-devtools, Release-Engineering-Team (Now this 🫠), User-brennen, collaboration-services, Puppet (Puppet 7.0), cloud-services-team

Apr 12 2024

Andrew created T362452: Taavi knowledge transfer: cloud-vps monitoring.
Apr 12 2024, 9:53 PM · User-dcaro, Toolforge, Cloud-VPS, cloud-services-team
Andrew created T362450: Taavi knowledge transfer: Cloud VPS OpenTofu provider.
Apr 12 2024, 9:52 PM · Toolforge, Cloud-VPS, cloud-services-team
Andrew created T362449: Taavi knowledge transfer: python-flask-keystone, novaproxy, enc api.
Apr 12 2024, 9:51 PM · Toolforge, Cloud-VPS, cloud-services-team
Andrew created T362448: Taavi knowledge transfer: rebuild toolforge docker images.
Apr 12 2024, 9:49 PM · Toolforge, Cloud-VPS, cloud-services-team
Andrew created T362447: Taavi knowledge transfer: Toolforge misc services (e.g. mail server).
Apr 12 2024, 9:48 PM · Toolforge, Cloud-VPS, cloud-services-team
Andrew created T362446: Taavi knowledge transfer: toolforge job investigation.
Apr 12 2024, 9:45 PM · User-dcaro, Toolforge, Cloud-VPS, cloud-services-team
Andrew created T362445: Taavi knowledge transfer: Toolforge k8s upgrades.
Apr 12 2024, 9:43 PM · Toolforge, Cloud-VPS, cloud-services-team
Andrew created T362444: Taavi knowledge transfer: maintain-kubeusers .
Apr 12 2024, 9:42 PM · Toolforge, Cloud-VPS, cloud-services-team
Andrew created T362443: Learn how to do what Taavi does.
Apr 12 2024, 9:34 PM · Toolforge, cloud-services-team, Cloud-VPS
Andrew added a comment to T360470: Update devtools project puppetmaster.

It's not really a buster thing -- the puppet code for geoip is entirely different in the puppetserver manifests vs. the old puppetmaster manifests.

Apr 12 2024, 9:11 PM · VPS-project-devtools, Release-Engineering-Team (Now this 🫠), User-brennen, collaboration-services, Puppet (Puppet 7.0), cloud-services-team
Andrew updated the task description for T362438: decommission cloudbackup200[12].codfw.wmnet.
Apr 12 2024, 6:58 PM · SRE, ops-codfw, cloud-services-team, decommission-hardware