The PetScan machine petscan5 was unresponsive so I went for soft reboot but it seems stuck. I tried to stop it but that won't work either. Can someone reboot it please? Thanks!
Description
Related Objects
Event Timeline
The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!
UPDATE: The instance has restarted but apparently has no longer the key pair associated. I tried to ssh in from login.toolforge.org and bastion.wmflabs.org, without success.
Mentioned in SAL (#wikimedia-cloud) [2025-01-24T15:37:25Z] <dhinus> openstack server migrate {petscan5_id} T384642
Migrating the VM fixed the issue, probably caused by T383583: VM nova records attached to incorrect cloudcephmon IPs.
@Magnus uh weird, it was working for my user so I assumed that fixed it for everyone. Looking.
The keys that I would expect to work are the ones you can find listed at https://ldap.toolforge.org/user/magnus
As the proxy error message states, you need to report this directly to the maintainers of Petscan and not to this task which is in the Cloud-VPS infrastructure board. My understanding is that Petscan issues are tracked at https://github.com/magnusmanske/petscan_rs/issues.
It appears that this is not a software issue of PetScan, but the frequent need to reboot the VM perhaps has something to do with T385288.
I have limited the RAM for PetScan via systemctl, which also should restart PetScan after VM reboot. Please let me know if that doesn't take care of the problem.